基于改进RepViT的岩石薄片岩性识别方法

    Lithology Recognition Method for Rock Thin Sections Based on an Improved RepViT

    • 摘要: 【研究目的】针对现有的岩性识别中存在参数量冗余致推理延迟高、多尺度矿物特征捕捉不足、浅层基础特征易丢失的问题。【研究方法】设计一种基于改进RepViT(Re-parameterized Vision Transformer)的岩石薄片图像分类模型CI-RepViT(Compact and Informative RepViT)。通过多尺度特征提取构建适合岩石特征提取的多尺度感受野体系,有效解决不同尺度矿物特征提取的问题;引入Identity分支,缓解残差梯度衰减并保留浅层基础特征,实现颗粒边缘等关键特征向深层的无损传递;将注意力机制升级为ECA注意力机制,省去SE的全连接层,减少参数量冗余,在提升特征筛选精度的同时降低计算开销。【研究结果】结果显示,参数量降低了0.64M,准确率提升了2.61%,通过对比ConvNextV2、GoogLeNet等常见的六种模型,CI-RepViT在准确率、精确率等关键指标上表现更优。【结论】为复杂岩性条件下岩石薄片的高效、轻量化识别提供技术支撑。

       

      Abstract: ObjectiveTo address the problems in existing lithology recognition methods, including redundant parameters leading to high inference latency, insufficient multi-scale mineral feature extraction, and the loss of shallow fundamental features.Methods A rock thin-section image classification model named CI-RepViT (Compact and Informative RepViT), based on an improved Re-parameterized Vision Transformer (RepViT), is designed. A multi-scale feature extraction strategy is adopted to construct a multi-scale receptive field system suitable for rock feature representation, effectively enhancing the extraction of mineral features at different scales. An Identity branch is introduced to alleviate residual gradient attenuation and preserve shallow fundamental features, enabling lossless propagation of critical information such as grain boundaries to deeper layers. In addition, the attention mechanism is upgraded to Efficient Channel Attention (ECA), which removes the fully connected layers used in Squeeze-and-Excitation (SE) blocks, thereby reducing parameter redundancy and computational overhead while improving feature selection accuracy.Results The results demonstrate that the proposed model reduces the parameter count by 0.64 M and improves classification accuracy by 2.61%. Through comparisons with six commonly used models, including ConvNeXtV2 and GoogLeNet, CI-RepViT achieves superior performance in key metrics such as accuracy and precision.Conclusion This study provides technical support for efficient and lightweight rock thin-section recognition under complex lithological conditions.

       

    /

    返回文章
    返回