面向无人机自主飞行的无监督单目视觉深度估计

赵栓峰; 黄涛; 许倩; 耿龙龙

doi:doi:10.3788/LOP57.021012

激光与光电子学进展, 2020, 57 (2): 021012, 网络出版: 2020-01-03

面向无人机自主飞行的无监督单目视觉深度估计下载： 1264次

Unsupervised Monocular Depth Estimation for Autonomous Flight of Drones

赵栓峰黄涛 ^*许倩耿龙龙

作者单位

西安科技大学机械工程学院, 陕西西安 710054

图 & 表

图 1. 双目深度估计原理

Fig. 1. Principle of binocular depth estimation

下载图片查看原文

图 2. 无监督单目深度估计结构图

Fig. 2. Structural diagram of unsupervised monocular depth estimation

下载图片查看原文

图 3. 图像重构模型

Fig. 3. Model of image reconstruction

下载图片查看原文

图 4. 训练过程中各部分损失函数。(a)重构图像与原图的结构相似性损失;(b)重构图像与原图之差绝对值损失;(c)总的图像重构损失;(d)视差图平滑性损失;(e)左右视差图一致性损失;(f)本模型的总损失

Fig. 4. Loss function of each part of training process. (a) Structural similarity loss of reconstructed image and original image; (b) absolute value loss of difference between reconstructed image and original image; (c) total image reconstruction loss; (d) loss of disparity smoothness; (e) loss of consistency in left and right disparity maps; (f) total loss of our model

下载图片查看原文

图 5. 无人机实验平台。(a)无人机;(b) NVIDIA Jeston TX2与Pixhawk连接

Fig. 5. Platform of drone experiment. (a) Drone; (b) connection of NVIDIA Jeston TX2 and Pixhawk

下载图片查看原文

图 6. KITTI数据集上预测深度图实例。(a)输入的图片;(b)真实深度图;(c)文献[ 15]预测的深度图;(d)文献[ 20]预测的深度; (e) 本模型基于VGG-16预测的深度;(f)本模型基于ResNet-50预测的深度

Fig. 6. Examples of depth map predicted on KITTI dataset. (a) Input image; (b) ground truth depth map; (c) depth map predicted by Ref. [15] ; (d) depth map predicted in Ref. [20]; (e) depth map predicted by our model based on VGG-16; (f) depth map predicted by our model based on ResNet-50

下载图片查看原文

图 7. 真实室外场景上预测深度图实例。(a)输入的图片;(b)真实深度图

Fig. 7. Examples of depth map predicted in real outdoor scenes. (a) Input images; (b) ground truth depth maps

下载图片查看原文

表 1KITTI数据集上实验结果对比

Table1. Comparison of experimental results on KITTI dataset

Method	Supervised	Error (lower is better)	Accuracy (higher is better)	Time /s
Method	Supervised	E_REL		E_RMSE	Log E_RMSE	δ<1.25	δ<1.252	δ<1.252
Ref. [12]	Yes	0.203	6.307	0.282	0.702	0.890	0.958	0.051
Ref. [15]	Yes	0.202	6.523	0.275	0.678	0.895	0.965	0.045
Ref. [19]	No	0.208	6.856	0.283	0.678	0.885	0.957	0.062
Ref. [20]	No	0.159	5.789	0.234	0.796	0.923	0.963	0.057
Our (VGG-16)	No	0.148	5.496	0.226	0.812	0.912	0.960	0.056
Our (RseNet-50)	No	0.124	5.331	0.219	0.847	0.945	0.975	0.048

查看原文

表 2Make3D数据集上实验结果对比

Table2. Comparisonof experimental results on Make3D dataset

Method	Supervised	Error (lower is better)			Accuracy (higher is better)			Time/s
Method	Supervised	E_REL		E_RMSE	Log E_RMSE	δ<1.25	δ<1.252	δ<1.252
Ref. [12]	Yes	0.417	8.526	0.403	0.692	0.899	0.948	0.068
Ref. [15]	Yes	0.462	9.972	0.456	0.656	0.887	0.945	0.048
Ref. [19]	No	0.443	8.326	0.398	0.662	0.885	0.932	0.074
Ref. [20]	No	0.387	7.895	0.354	0.704	0.899	0.946	0.054
Our (VGG16)	No	0.361	8.102	0.377	0.727	0.905	0.958	0.061
Our (RseNet-50)	No	0.328	7.529	0.348	0.751	0.924	0.962	0.053

查看原文

赵栓峰, 黄涛, 许倩, 耿龙龙. 面向无人机自主飞行的无监督单目视觉深度估计[J]. 激光与光电子学进展, 2020, 57(2): 021012. Zhao Shuanfeng, Huang Tao, Xu Qian, Geng Longlong. Unsupervised Monocular Depth Estimation for Autonomous Flight of Drones[J]. Laser & Optoelectronics Progress, 2020, 57(2): 021012.

面向无人机自主飞行的无监督单目视觉深度估计下载： 1264次

图 1. 双目深度估计原理

Fig. 1. Principle of binocular depth estimation

图 2. 无监督单目深度估计结构图

Fig. 2. Structural diagram of unsupervised monocular depth estimation

图 3. 图像重构模型

Fig. 3. Model of image reconstruction

图 4. 训练过程中各部分损失函数。(a)重构图像与原图的结构相似性损失;(b)重构图像与原图之差绝对值损失;(c)总的图像重构损失;(d)视差图平滑性损失;(e)左右视差图一致性损失;(f)本模型的总损失

图 5. 无人机实验平台。(a)无人机;(b) NVIDIA Jeston TX2与Pixhawk连接

Fig. 5. Platform of drone experiment. (a) Drone; (b) connection of NVIDIA Jeston TX2 and Pixhawk

图 6. KITTI数据集上预测深度图实例。(a)输入的图片;(b)真实深度图;(c)文献[ 15]预测的深度图;(d)文献[ 20]预测的深度; (e) 本模型基于VGG-16预测的深度;(f)本模型基于ResNet-50预测的深度

Fig. 6. Examples of depth map predicted on KITTI dataset. (a) Input image; (b) ground truth depth map; (c) depth map predicted by Ref. [15] ; (d) depth map predicted in Ref. [20]; (e) depth map predicted by our model based on VGG-16; (f) depth map predicted by our model based on ResNet-50

图 7. 真实室外场景上预测深度图实例。(a)输入的图片;(b)真实深度图

Fig. 7. Examples of depth map predicted in real outdoor scenes. (a) Input images; (b) ground truth depth maps

表 1KITTI数据集上实验结果对比

Table1. Comparison of experimental results on KITTI dataset

表 2Make3D数据集上实验结果对比

Table2. Comparisonof experimental results on Make3D dataset

关于本站 Cookie 的使用提示

全站搜索

面向无人机自主飞行的无监督单目视觉深度估计 下载： 1264次

图 1. 双目深度估计原理

Fig. 1. Principle of binocular depth estimation

图 2. 无监督单目深度估计结构图

Fig. 2. Structural diagram of unsupervised monocular depth estimation

图 3. 图像重构模型

Fig. 3. Model of image reconstruction

图 4. 训练过程中各部分损失函数。(a)重构图像与原图的结构相似性损失;(b)重构图像与原图之差绝对值损失;(c)总的图像重构损失;(d)视差图平滑性损失;(e)左右视差图一致性损失;(f)本模型的总损失

图 5. 无人机实验平台。(a)无人机;(b) NVIDIA Jeston TX2与Pixhawk连接

Fig. 5. Platform of drone experiment. (a) Drone; (b) connection of NVIDIA Jeston TX2 and Pixhawk

图 6. KITTI数据集上预测深度图实例。(a)输入的图片;(b)真实深度图;(c)文献[ 15]预测的深度图;(d)文献[ 20]预测的深度; (e) 本模型基于VGG-16预测的深度;(f)本模型基于ResNet-50预测的深度

Fig. 6. Examples of depth map predicted on KITTI dataset. (a) Input image; (b) ground truth depth map; (c) depth map predicted by Ref. [15] ; (d) depth map predicted in Ref. [20]; (e) depth map predicted by our model based on VGG-16; (f) depth map predicted by our model based on ResNet-50

图 7. 真实室外场景上预测深度图实例。(a)输入的图片;(b)真实深度图

Fig. 7. Examples of depth map predicted in real outdoor scenes. (a) Input images; (b) ground truth depth maps

表 1KITTI数据集上实验结果对比

Table1. Comparison of experimental results on KITTI dataset

表 2Make3D数据集上实验结果对比

Table2. Comparisonof experimental results on Make3D dataset

相关论文

相关资讯

关于本站 Cookie 的使用提示

全站搜索

面向无人机自主飞行的无监督单目视觉深度估计下载： 1264次