Deep-DSO: Improving Mapping of Direct Sparse Odometry Using CNN-Based Single-Image Depth Estimation
Metadatos
Mostrar el registro completo del ítemAutor
Herrera-Granda, Erick P.; Torres-Cantero, Juan C.; Herrera-Granda, Israel D.; Lucio-Naranjo, José FEditorial
MDPI
Materia
CNN direct sparse odometry Monocular visual odometry Monocular 3D reconstruction
Fecha
2025-10-19Referencia bibliográfica
Herrera-Granda, E.P.; Torres-Cantero, J.C.; Herrera-Granda, I.D.; Lucio-Naranjo, J.F.; Rosales, A.; Revelo-Fuelagán, J.; Peluffo-Ordóñez, D.H. Deep-DSO: Improving Mapping of Direct Sparse Odometry Using CNN-Based Single-Image Depth Estimation. Mathematics 2025, 13, 3330. https://doi.org/10.3390/math13203330
Patrocinador
CropID Project (Ref. AS No. 25)Resumen
In recent years, SLAM, visual odometry, and structure-from-motion approaches have
widely addressed the problems of 3D reconstruction and ego-motion estimation. Of the
many input modalities that can be used to solve these ill-posed problems, the pure visual
alternative using a single monocular RGB camera has attracted the attention of multiple
researchers due to its low cost and widespread availability in handheld devices. One
of the best proposals currently available is the Direct Sparse Odometry (DSO) system,
which has demonstrated the ability to accurately recover trajectories and depth maps using
monocular sequences as the only source of information. Given the impressive advances in
single-image depth estimation using neural networks, this work proposes an extension of
the DSO system, named DeepDSO. DeepDSO effectively integrates the state-of-the-art NeW
CRF neural network as a depth estimation module, providing depth prior information
for each candidate point. This reduces the point search interval over the epipolar line.
This integration improves the DSO algorithm’s depth point initialization and allows each
proposed point to converge faster to its true depth. Experimentation carried out in the TUMMono dataset demonstrated that adding the neural network depth estimation module to
the DSO pipeline significantly reduced rotation, translation, scale, start-segment alignment,
end-segment alignment, and RMSE errors.





