中国激光, 2024, 51 (3): 0307108, 网络出版: 2024-02-19  

基于Transformer的宫颈异常细胞自动识别方法

Automatic Identification of Cervical Abnormal Cells Based on Transformer
作者单位
1 中国科学技术大学工程科学学院精密机械与精密仪器系,安徽 合肥 230027
2 中国科学技术大学苏州高等研究院,江苏 苏州 215123
3 中国科学技术大学微电子学院,安徽 合肥 230027
摘要
宫颈异常细胞与正常细胞在形态上存在较大相似性且细胞尺寸变化较大,这使得宫颈异常细胞的精准检测变得非常困难。鉴于此,开发了一种基于Transformer模型的宫颈异常细胞自动识别模型,以帮助病理学家作出更准确的诊断。提出了两种创新性方法,一是一种改进的Transformer编码器结构,通过引入深度(DW)卷积来高效获取图像的特征,捕捉图像中的全局依赖信息;二是自适应的动态交并比(IOU)阈值,在模型训练的不同阶段使用不同的IOU阈值,实现尽可能多的有效检测,提升模型的收敛速度和检测精度。在宫颈异常细胞数据集上,通过消融实验,证明了改进的Transformer编码器和动态IOU阈值的有效性。此外,与已有的宫颈异常细胞识别方法相比,所提出的方法在平均精度指标上有明显的提高。实验结果表明,所提出的方法能够高效且准确地识别宫颈异常细胞,且能辅助病理专家提高诊断准确率和效率,具有应用到临床的潜力。
Abstract
Objective

Cervical cancer is one of the most common malignant tumors and poses a serious threat to human health. However, because the onset of cervical cancer is gradual, early and effective screening is crucial. Traditional screening methods rely on manual examinations by pathologists, a process that is time-consuming, labor-intensive, error-prone, and often lacks an adequate number of pathologists for cervical cytology screening, making it challenging to meet the current demands for cervical cancer screening. In recent years, several deep-learning-based methods have been developed for screening abnormal cervical cells. However, because abnormal cervical cells develop from normal cells, they exhibit morphological similarities, making differentiation challenging. Pathologists typically need to reference normal cells in images to accurately distinguish them from abnormal cells. These factors limit the accuracy of abnormal cervical cell screening. This study proposes a Transformer-based approach for abnormal cervical cell screening that leverages the powerful global feature extraction and long-range dependency capabilities of Transformer. This method effectively enhances the detection accuracy of abnormal cervical cells, improving screening efficiency and alleviating the burden on medical professionals.

Methods

This study introduces a novel Transformer-based method for abnormal cervical cell detection that leverages the powerful global information extraction capabilities of Transformer to mimic the screening process of pathologists. The proposed method incorporates two innovative structures. The first is an improved Transformer encoder, which consists of multiple blocks stacked together. Each block comprises two parts: a multi-head self-attention layer and a feedforward neural network layer. The multi-head self-attention layer captures the correlation of the input data at different levels and scales, enabling the model to better understand the structure of the input sequence. The feedforward neural network layer includes multiple fully connected layers and activation functions and introduces nonlinear transformations to help the model adapt to complex data distributions. We also introduce Depthwise (DW)convolution and Dropout layers to the encoder. DW convolution layer performs convolution operations with separate kernels for each input channel, capturing features within the channels without introducing inter-channel dependencies. Dropout layer reduces the tendency of neural networks to overfit the training data, thereby enhancing the generalization of the model to unseen data. Additionally, we design a dynamic intersection-over-union (IOU) threshold method that adaptively adjusts the IOU threshold. In the initial stages of training, the model can obtain as many effective detections as possible, whereas in later stages, it can filter out most false positive predictions, thereby improving the detection accuracy of the model. Using the proposed method, the model can obtain precise information regarding the location of abnormal cells.

Results and Discussions

To validate the effectiveness of our proposed method, we compare it with common general-purpose object detection methods. The average accuracy (AP) and AP50 of our method are 26.1% and 46.8%, respectively, surpassing those of all general object detection models (Table 1). In particular, our method outperforms other comparative models by a significant margin in AP metrics, demonstrating that our model not only detects normal-sized targets but can also detect extremely small targets. Additionally, in a comparison with attFPN, a network specifically designed for abnormal cervical cell detection, our method surpasses attFPN in terms of AP by 1.1% (Table 2). Visual inspection of the detection results reveals that our method more accurately identifies target regions with lower false-positive and false-negative rates (Fig.5). Ablation experiments indicate that adopting the improved Transformer encoder method increases AP and AP50 by 1.8% and 2.3%, respectively, compared with the original model. The use of dynamic IOU thresholds results in a 0.6% increase in AP and a 0.9% increase in AP50 compared with the original model (Table 4). Furthermore, a comparison between the dynamic and fixed IOU thresholds in terms of loss and AP during the training process shows that the model with dynamic IOU thresholds experiences a faster loss reduction and achieves a higher AP in the later stages of training (Fig.6).

Conclusions

This study introduces an automatic identification method for abnormal cervical cells utilizing Transformer as the backbone. We further propose an enhanced Transformer encoder structure and a dynamically adjustable IOU threshold. Various comparative experiments on datasets demonstrate that the proposed method outperforms existing approaches in terms of accuracy and other metrics, thereby achieving precise identification of abnormal cervical cells. Through ablation experiments, it is proven that both proposed modules enhance the accuracy of the model in identifying abnormal cervical cells. Overall, the proposed method significantly improves the efficiency of medical image screening, saving medical time and resources, facilitating timely detection of cancerous lesions, and presenting considerable clinical and practical value. Future research may focus on the application of semi-supervised and unsupervised learning in the field of medical imaging to enhance image utilization, improve model detection performance, and better meet clinical requirements.

张峥, 陈明销, 李新宇, 程逸, 申书伟, 姚鹏. 基于Transformer的宫颈异常细胞自动识别方法[J]. 中国激光, 2024, 51(3): 0307108. Zheng Zhang, Mingxiao Chen, Xinyu Li, Yi Chen, Shuwei Shen, Peng Yao. Automatic Identification of Cervical Abnormal Cells Based on Transformer[J]. Chinese Journal of Lasers, 2024, 51(3): 0307108.

引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!