电光与控制, 2023, 30 (11): , 网络出版: 2024-01-20  

基于非下采样轮廓波变换的遥感地物分割算法

A Remote Sensing Ground Object Segmentation Algorithm Based on Non-subsampled Contourlet Transform
作者单位
武汉工程大学智能机器人湖北省重点实验室, 武汉 430000
摘要
针对遥感地物图像具有背景复杂且种类众多的特点, 利用传统算法进行分割会导致边缘模糊、信息丢失及分割精度低的问题, 提出了一种基于改进DeepLabV3+网络的语义分割算法。首先, 在主干网络中引入改进后的特征提取网络CHRNet; 其次, 使用非下采样轮廓波变换(NSCT)算法重构空洞空间金字塔池化(ASPP)模块中的全局池化操作; 最后, 在模型编码和解码阶段添加无参数的注意力机制SimAM, 加强模块间的特征传递, 提高特征利用率。实验表明, 在PASCAL VOC2012和WHDLD数据集上, 改进算法的平均交并比(MIoU)分别达到了81.56%和64.2%, 较原有算法分别提升了约4.61和2.8个百分点, 改进算法在保证分割速率的同时, 提升了分割精度。
Abstract
Remote sensing ground object images have the characteristics of complex background and numerous varieties, and traditional segmentation algorithms will lead to edge blur, information loss and low segmentation accuracy.To solve the problems, a semantic segmentation algorithm based on the improved DeepLabV3+ network is proposed.Firstly, the improved feature extraction network CHRNet is introduced into the backbone network.Secondly, the Non-Subsampled Contourlet Transform (NSCT) algorithm is used to reconstruct the global pooling operation in the Atrous Spatial Pyramid Pooling (ASPP) module.Finally, the parameter-free attention mechanism SimAM is added in model encoding and decoding stages to enhance feature transfer among modules and improve feature utilization ratio.The experimental results show that Mean Intersection over Union (MIoU) of the improved algorithm is 81.56% on PASCAL VOC2012 data set and 64.2% on WHDLD data set, which are about 4.61 percentage points and 2.8 percentage points higher than those of the original algorithm.The improved algorithm can enhance segmentation accuracy while ensuring segmentation speed.
参考文献

[1] WANG G H, ZHAI Q Y, LIN J H.Multi-scale network for remote sensing segmentation[J].IET Image Processing, 2022, 16(6):1742-1751.

[2] LIU Z W, LUO P, WANG X G, et al.Deep learning face attributes in the wild[C]//Proceedings of the IEEE International Conference on Computer Vision.Santiago:IEEE, 2015: 3730-3738.

[3] LONG J, SHELHAMER E, DARRELL T.Fully convolutional networks for semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Boston:IEEE, 2015:3431-3440.

[4] PENG C, ZHANG X Y, YU G, et al.Large kernel matters-improve semantic segmentation by global convolutional network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Hawaii:IEEE, 2017: 4353-4361.

[5] ZHAO H S, SHI J P, QI X J, et al.Pyramid scene parsing network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Hawaii:IEEE, 2017:2881-2890.

[6] CHEN L C, PAPANDREOU G, SCHROFF F, et al.Rethinking atrous convolution for semantic image segmentation[R].Los Alamos: arXiv Preprint, 2017:arXiv:1760.05587.

[7] CHEN L C, ZHU Y K, PAPANDREOU G, et al.Encoder-decoder with atrous separable convolution for semantic image segmentation[C]//Proceedings of the European Conference on Computer Vision (ECCV).Cham:Springer, 2018:833-851.

[8] CHOLLET F.Xception:deep slearning with depthwise separable convolutions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Hawaii:IEEE, 2017:1251-1258.

[9] 张文博,瞿珏,王崴,等.融合多尺度特征的改进DeepLabV3+图像语义分割算法[J].电光与控制,2022,29(11):12-16, 30.

[10] RONNEBERGER O, FISCHER P, BROX T.U-Net:convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Computer-Assisted Intervention.Cham:Springer, 2015:234-241.

[11] BADRINARAYANAN V, KENDALLl A, CIPOLLA R.SegNet:a deep convolutional encoder-decoder architecture for image segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12):2481-2495.

[12] ZEILER M D, FERGUS R.Visualizing and understanding convolutional networks[C]//European Conference on Computer Vision.Cham:Springer, 2014:818-833.

[13] NEWELL A, YANG K Y, DENG J.Stacked hourglass networks for human pose estimation[C]//European Conference on Computer Vision.Cham:Springer, 2016:483-499.

[14] SUN K, XIAO B, LIU D, et al.Deep high-resolution representation learning for human pose estimation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Long Beach:IEEE, 2019:5586-5696.

[15] WU H S, LIANG C X, LIU M S, et al.Optimized HRNet for image semantic segmentation[J].Expert Systems with Applications, 2021, 174:114532.

[16] SUN K, ZHAO Y, JIANG B R, et al.High-resolution representations for labeling pixels and regions[R].Los Alamos: arXiv Preprint, 2019: arXiv:1904.04514.

[17] 刘洋,张国军.基于NSCT及RT的遥感图像阴影消除方法[J].计算机仿真,2022,39(5):465-469.

[18] YANG L X, ZHANG R Y, LI L D, et al.SimAM:a simple, parameter-free attention module for convolutional neural networks[C]//International Conference on Machine Learning.New York:PMLR, 2021:11863-11874.

[19] FU J, LIU J, TIAN H J, et al.Dual attention network for scene segmentation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Los Angeles:IEEE, 2019:3146-3154.

[20] LI H C, XIONG P F, AN J, et al.Pyramid attention network for semantic segmentation[R].Los Alamos: arXiv Preprint, 2018:arXiv:1805.10180.

[21] SANDLER M, HOWARD A, ZHU M L, et al.MobileNetV2:inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Salt Lake:IEEE, 2018:4510-4520.

[22] 何雪东,宣士斌,王款,等.融合累积分布函数和通道注意力机制的DeepLabV3+图像分割算法[J].计算机应用,2023,43(3):936-942.

[23] 马素刚,陈期梅,侯志强,等.基于密集连接和特征增强的语义分割算法[J].计算机工程,2023,49(3):263-270.

[24] ZHOU Z W, SIDDIQUEE R M M, TAJBAKHSH N, et al.UNet++:a nested U-NET architecture for medical image segmentation[C]//Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support.Cham:Springer, 2018:3-11.

[25] SUN Y, BI F K, GAO Y T, et al.A multi-attention UNet for semantic segmentation in remote sensing images[J].Symmetry, 2022, 14(5):906-921.

闵锋, 彭伟明, 况永刚, 毛一新, 郝琳琳. 基于非下采样轮廓波变换的遥感地物分割算法[J]. 电光与控制, 2023, 30(11): . MIN Feng, PENG Weiming, KUANG Yonggang, MAO Yixin, HAO Linlin. A Remote Sensing Ground Object Segmentation Algorithm Based on Non-subsampled Contourlet Transform[J]. Electronics Optics & Control, 2023, 30(11): .

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!