基于非局部网络与通道注意力机制的颈动脉狭窄分类模型

李博涛; 郑瀚; 罗继昌; 张桃红; 焦力群; 金紫薇

doi:10.13374/j.issn2095-9389.2024.09.12.001

基于非局部网络与通道注意力机制的颈动脉狭窄分类模型

Carotid stenosis classification model based on non-local networks and channel attention mechanism

摘要

摘要: 颈动脉狭窄是缺血性脑卒中的主要成因之一，目前数字减影造影技术(DSA)被称为颈动脉狭窄诊断的金标准，但传统的诊断方式需要由病理学家手动筛选分析DSA影像，存在着筛查速度慢、容易出错及对专业诊断人员的依赖等问题. 人工智能为我们提供了辅助诊断手段. 但目前的识别往往诊断出一处狭窄就完成识别，而实际影像有时存在不止一处的问题，为提高影像对多处狭窄的识别能力，本文提出了一个非局部通道注意力网络（Non-Local Channel Attention Net，NLCANet）对颈动脉狭窄进行准确分类. 该模型主要由两个模块构成：非局部多尺度特征融合模块（Non Local Multi-Scale Fusion module，NLMSF）和通道注意力模块（Multi-Level Channel Attention module，MLCA）. 非局部多尺度特征融合模块NLMSF利用非局部网络的思想来模拟空间注意力操作，同时，为了更好的提取多尺度特征，在非局部网络中还加入了多尺度特征融合的模块，对颈动脉影像分类起到重要作用；通道注意力模块MLCA通过高效的利用影像中的通道特征，为模型分类提供了更多的语义信息. 我们通过使用提取关键帧的技术，建立颈动脉狭窄数据集，将本文模型与其他主流的医学影像分类模型在该颈动脉狭窄数据集上进行对比. 我们的模型达到了最好的效果，模型的分类准确率要高于其他主流的模型至少2%.

Abstract: Carotid artery stenosis is one of the primary culprits behind ischemic stroke, a leading cause of morbidity and mortality worldwide. Precise diagnosis of this condition is crucial for effective patient management and treatment. Currently, Digital subtraction angiography (DSA) is the gold standard for diagnosing carotid artery stenosis, offering clear and detailed images that enable radiologists to visualize the degree of arterial narrowing. However, traditional diagnostic approaches often rely on radiologists to manually scrutinize and interpret these images—an inherently time-consuming process that is prone to human error and heavily dependent on professional expertise. In this context, the advent of artificial intelligence has brought forth promising auxiliary diagnostic tools aimed at enhancing the efficiency and accuracy of medical image analysis. Despite significant advancements, many current recognition systems have limited capacity, typically detecting only a single area of stenosis within an image. Such limitations can lead to incomplete or inaccurate diagnoses, potentially compromising patient outcomes. To address these challenges and improve diagnostic performance for carotid artery stenosis, this study introduces a novel approach: the Non-local channel attention network (NLCANet), an advanced network architecture specifically designed to enhance the classification accuracy of carotid artery stenosis. By leveraging non-local attention mechanisms and channel-wise feature extraction, NLCANet provides a more robust and nuanced approach to image analysis, ensuring more precise and reliable diagnostic results. This innovation aims to advance the field of medical image classification and significantly improve clinical outcomes by enabling faster and more accurate diagnoses. The proposed model is built upon two integral components that operate synergistically: the Non-local multi-scale fusion (NLMSF) module and the Multi-level channel attention (MLCA) module. The NLMSF module draws inspiration from non-local networks, which are capable of capturing long-range dependencies and contextual relationships within an image. By simulating spatial attention operations, the NLMSF module enables the model to focus on critical regions across the entire image, thereby incorporating both local detail and global context. Furthermore, it integrates a multiscale feature fusion strategy—particularly important in medical imaging, where stenotic regions often vary in size and complexity. In parallel, the MLCA module enhances the model’s ability to prioritize and utilize channel-wise information. Medical images typically contain rich multidimensional data distributed across channels, with each channel contributing differently to the classification task. The MLCA module is designed to effectively identify and weight channels that carry the most diagnostically relevant semantic information. This attention mechanism improves the model’s capacity to differentiate between varying degrees of stenosis, thereby reducing the risk of misclassification and improving diagnostic accuracy. By combining both spatial and channel-level attention mechanisms, NLCANet achieves a more comprehensive understanding of medical images, resulting in superior classification performance. We constructed a carotid artery stenosis dataset by applying key frame-extraction techniques to DSA sequences and conducted comparative experiments against several mainstream medical image classification models. Experimental results show that NLCANet achieves the best performance, with classification accuracy at least 2% higher than that of the other benchmark models. These findings demonstrate the effectiveness and clinical potential of NLCANet as an accurate, efficient, and reliable diagnostic tool for carotid artery stenosis.

HTML全文

参考文献(40)

施引文献

资源附件(0)