论文标题

红细胞分割,与数据集的不平衡细胞分离和分类重叠

Red Blood Cell Segmentation with Overlapping Cell Separation and Classification on Imbalanced Dataset

论文作者

Naruenatthanaset, Korranat, Chalidabhongse, Thanarat H., Palasuwan, Duangdao, Anantrasirichai, Nantheera, Palasuwan, Attakorn

论文摘要

对血液涂片图像的自动红细胞(RBC)分类有助于血液学家分析RBC实验室的时间和成本降低。但是,重叠的细胞可能会导致错误的预测结果,因此在分类之前必须将它们分为多个单个RBC。为了将多个类别分类为深度学习,在医学成像中很常见,因为正常样本总是比稀有病样本高。本文提出了一种从血液涂片图像中细分和分类的RBC的新方法,特别是解决细胞重叠和数据失衡问题的方法。我们的分割过程着重于重叠的细胞分离,首先估计椭圆形代表RBC。该方法检测凹点,然后使用有向椭圆的拟合找到椭圆。 20个血液涂片图像的准确性为0.889。分类需要平衡的培训数据集。但是,一些RBC类型很少见。该数据集的不平衡比为34.538,对于20,875个单独的RBC样品,12个RBC类别为34.538。与许多其他应用程序相比,使用机器学习与不平衡数据集的RBC分类更具挑战性。我们分析了解决这个问题的技术。最佳准确性和F1得分分别为0.921和0.8679,使用EditighteNET-B1和增强。实验结果表明,使用增强的重量平衡技术有可能通过改善少数群体的F1得分来应对不平衡问题,而数据增强则显着改善了整体分类性能。

Automated red blood cell (RBC) classification on blood smear images helps hematologists to analyze RBC lab results in a reduced time and cost. However, overlapping cells can cause incorrect predicted results, and so they have to be separated into multiple single RBCs before classifying. To classify multiple classes with deep learning, imbalance problems are common in medical imaging because normal samples are always higher than rare disease samples. This paper presents a new method to segment and classify RBCs from blood smear images, specifically to tackle cell overlapping and data imbalance problems. Focusing on overlapping cell separation, our segmentation process first estimates ellipses to represent RBCs. The method detects the concave points and then finds the ellipses using directed ellipse fitting. The accuracy from 20 blood smear images was 0.889. Classification requires balanced training datasets. However, some RBC types are rare. The imbalance ratio of this dataset was 34.538 for 12 RBC classes from 20,875 individual RBC samples. The use of machine learning for RBC classification with an imbalanced dataset is hence more challenging than many other applications. We analyzed techniques to deal with this problem. The best accuracy and F1-score were 0.921 and 0.8679, respectively, using EfficientNet-B1 with augmentation. Experimental results showed that the weight balancing technique with augmentation had the potential to deal with imbalance problems by improving the F1-score on minority classes, while data augmentation significantly improved the overall classification performance.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源