基于YOLOv8的掘进机零部件轻量化识别模型

徐俊; 赵小虎; 王杰

doi:10.13374/j.issn2095-9389.2025.08.04.003

基于YOLOv8的掘进机零部件轻量化识别模型

A lightweight recognition model for tunneling machine components based on YOLOv8

摘要

摘要: 目标检测与混合现实(Mixed reality，MR)技术的结合在矿用设备维修领域展现出广阔的应用前景，为满足MR设备对检测模型轻量化与高效性的要求，针对目前混合现实辅助维修中的目标识别无法兼顾准确性与实时性的问题，本文提出一种基于YOLOv8的轻量化识别模型YOLOv8-CLRS. 通过引入C2f-HC结构降低浮点运算次数，提高识别速度，采用线性可变形卷积模块LDConv进一步提高网络模型的实时性，并利用RepGFPN重构Neck网络实现对特征的高效融合与优化，同时采用更注重目标形状的Shape-IoU损失函数使得边界框回归更加准确. 在自制的掘进机零部件数据集上的测试结果表明，改进后的模型精确度提高了0.4%，参数量减少了30%，帧率提高了13.9%，在保证准确率的前提下实现了轻量化设计，能够有效满足混合现实辅助维修矿用设备系统的应用要求.

Abstract: The combination of object detection and mixed reality (MR) technology has shown broad application prospects in the field of mining equipment maintenance. In order to meet the requirements of MR equipment for lightweight and efficient detection models, and to address the problem of target recognition in current mixed reality assisted maintenance that cannot balance accuracy and real-time performance, we propose a lightweight recognition model YOLOv8-CLRS based on YOLOv8. First, we introduce a heterogeneous kernel-based convolution (HetConv) module to replace the native C2f structure. This redesign significantly reduces computational complexity and floating-point operations, resulting in faster inference speeds while effectively maintaining rich feature representations. Second, we integrate linearly deformable convolution modules into the backbone network, replacing standard convolutions. This enhancement improves the model’s ability to adapt to objects with diverse geometries and dynamic spatial layouts, thereby increasing robustness in cluttered and variable industrial settings. Third, we reconstruct the neck of the network using a reparameterized generalized feature pyramid network, which promotes more efficient multi-scale feature fusion and strengthens semantic interaction across different feature levels. In addition, we refine the bounding box regression process by incorporating shape-IoU, a novel loss function that emphasizes geometric shape alignment between predictions and ground truths. This results in superior localization performance, particularly for non-rectangular or intricately shaped mechanical components. The proposed model was rigorously evaluated on a custom-built dataset containing images of key components of a tunneling machine, captured under diverse conditions including varying illumination, occlusion, and viewpoints. The experimental results show a 0.4% improvement in precision, a 30% reduction in parameter count, and a 13.9% increase in inference speed compared to the baseline YOLOv8 model. These performance enhancements confirm the model’s strong suitability for real-time MR applications that require both reliability and efficiency. This study not only presents a viable and efficient solution for MR-assisted industrial maintenance systems but also offers valuable insights into lightweight model design that could benefit a wide range of edge-computing applications in industry. The proposed architecture effectively balances the trade-off between computational load and detection accuracy, addressing a major barrier to the widespread adoption of AI-enhanced MR systems in real-world maintenance environments.

HTML全文

参考文献(25)

施引文献

资源附件(0)