激光与光电子学进展, 2019, 56 (13): 131007, 网络出版: 2019-07-11   

基于多尺度特征提取和全连接条件随机场的图像语义分割方法 下载: 1298次

Image Semantic Segmentation Based on Multi-Scale Feature Extraction and Fully Connected Conditional Random Fields
作者单位
1 河北工业大学人工智能与数据科学学院, 天津 300401
2 河北省大数据计算重点实验室, 天津 300401
图 & 表

图 1. 特征融合前的单分支网络结构

Fig. 1. Single-branch network structure before feature fusion

下载图片 查看原文

图 2. 多尺度特征融合

Fig. 2. Multi-scale feature fusion

下载图片 查看原文

图 3. FullCRF优化语义粗分割结果

Fig. 3. FullCRF optimization semantic rough segmentation result

下载图片 查看原文

图 4. 分割结果对比图

Fig. 4. Comparison of segmentation results

下载图片 查看原文

表 1特征融合前单个分支编码器部分的参数设置表

Table1. Parameter setting table of single branch encoder before feature fusion

RGB encoderDepth encoder
Conv block1:3×3 Conv 643×3 Conv 642×2 maxpoolingConv block2:3×3 Conv 1283×3 Conv 1282×2 maxpoolingConv block3:3×3 Conv 2563×3 Conv 2562×2 maxpoolingConv block1:3×3 Conv 643×3 Conv 642×2 maxpoolingConv block2:3×3 Conv 1283×3 Conv 1282×2 maxpoolingConv block3:3×3 Conv 2563×3 Conv 2562×2 maxpooling
Conv block4:3×3 Conv 5123×3 Conv 5123×3 Conv 5122×2 maxpoolingConv block5:3×3 Conv 5123×3 Conv 5123×3 Conv 5122×2 maxpoolingConv block4:3×3 Conv 5123×3 Conv 5123×3 Conv 5122×2 maxpoolingConv block5:3×3 Conv 5123×3 Conv 5123×3 Conv 512

查看原文

表 2不同网络在NYUv2数据集上的结果

Table2. Results of different networks on NYUv2 dataset

MethodInputdata typePA /%MA /%MIoU /%
Method in Ref. [6]RGB60.042.229.2
Method in Ref. [6]Depth57.135.224.2
Method in Ref. [25]RGB-depth60.3-28.6
Method in Ref. [26]RGB-depth63.831.5-
Method in Ref. [6]RGB-depth61.542.430.5
Method in Ref. [22]RGB-depth65.642.227.8
MSF-CRFRGB-depth66.944.230.2

查看原文

表 340个类别的类别精度对比表

Table3. Comparison of classification accuracy of 40 categories

DatasetWallFloorCabinetBedChairSofaTableDoor
FuseNet89.295.767.975.774.671.049.334.8
MSF-CRF91.896.571.073.773.583.149.527.1
DatasetWindowBookshelfPictureCounterBlindsDeskShelfCurtain
FuseNet52.948.068.156.467.215.112.656.5
MSF-CRF53.860.766.663.545.626.017.358.5
DatasetDresserPillowMirrorFloormatClothesCeilingBooksFridge
FuseNet28.444.330.738.822.975.521.211.9
MSF-CRF45.349.354.919.015.969.210.721.0
DatasetTVPaperTowelShowerBoxWhite boardPersonNightstand
FuseNet39.15.723.034.9732.523.235.1
MSF-CRF50.84.329.630.63.324.349.454.0
DatasetToiletSinkLampBathtubBagOther structOther furnitureOther prop
FuseNet75.032.440.151.91.619.810.845.7
MSF-CRF78.732.940.250.11.09.318.746.8

查看原文

表 440个类别的IoU对比表

Table4. Comparison of IoU of 40 categories

DatasetWallFloorCabinetBedChairSofaTableDoor
FuseNet59.570.844.759.341.247.531.819.6
MSF-CRF57.270.445.063.743.850.235.415.4
DatasetWindowBookshelfPictureCounterBlindsDeskShelfCurtain
FuseNet27.530.044.134.442.511.35.834.8
MSF-CRF32.730.848.038.536.317.06.143.1
DatasetDresserPillowMirrorFloormatClothesCeilingBooksFridge
FuseNet23.729.624.329.58.542.314.88.9
MSF-CRF32.134.342.517.09.439.89.514.0
DatasetTVPaperTowelShowerBoxWhite boardPersonNightstand
FuseNet31.53.818.520.3422.414.826.6
MSF-CRF39.13.721.826.12.420.732.940.1
DatasetToiletSinkLampBathtubBagOther structOther furnitureOther prop
FuseNet49.124.328.841.11.111.17.921.9
MSF-CRF50.121.231.239.80.97.313.425.0

查看原文

董永峰, 杨雨訢, 王利琴. 基于多尺度特征提取和全连接条件随机场的图像语义分割方法[J]. 激光与光电子学进展, 2019, 56(13): 131007. Yongfeng Dong, Yuxin Yang, Liqin Wang. Image Semantic Segmentation Based on Multi-Scale Feature Extraction and Fully Connected Conditional Random Fields[J]. Laser & Optoelectronics Progress, 2019, 56(13): 131007.

本文已被 3 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!