Real-time monocular 3D reconstruction of scenarios using artificial intelligence techniques

Herrera-Granda, Erick P.

dc.contributor.advisor	Torres Cantero, Juan Carlos
dc.contributor.advisor	Peluffo-Ordóñez, Diego Hernán
dc.contributor.author	Herrera-Granda, Erick P.
dc.contributor.other	Universidad de Granada. Programa de Doctorado en Tecnologías de la Información y Comunicación	es_ES
dc.date.accessioned	2024-04-18T06:33:47Z
dc.date.available	2024-04-18T06:33:47Z
dc.date.issued	2024
dc.identifier.citation	Herrera-Granda, Erick P. Real-time monocular 3D reconstruction of scenarios using artificial intelligence techniques. Granada: Universidad de Granada, 2024. [https://hdl.handle.net/10481/90846]	es_ES
dc.identifier.isbn	9788411952583
dc.identifier.uri	https://hdl.handle.net/10481/90846
dc.description.abstract	This research presents a comprehensive study on monocular 3D reconstruction of environments using only RGB images as input acquired through a monocular sensor. The objectives were to develop a suitable taxonomy, review seminal algorithms, compare open-source methods, and develop a novel 3D reconstruction system using the principal classic techniques combined with artificial intelligence to improve the overall system performance. An exhaustive literature review led to a proposed taxonomy with three classifications: direct vs indirect, dense vs sparse, and classic vs machine learning. This resulted in 10 categories used to classify 42 notable monocular SLAM, SFM, and VO systems based on 11 identified criteria. Subsequently, through rigorous benchmarking, ten prominent open-source algorithms were implemented across the taxonomy to discern each method's advantages and limitations. The TUM-Mono dataset, considered the most complete benchmark comprising 50 outdoor and indoor sequences, was used for evaluation. Statistical analysis revealed that sparse-direct methods significantly outperformed others, with DSO excelling. In addition, it was evidenced that integrating machine learning modules into the SLAM pipeline significantly contributes to the system performance and the final reconstruction quality. Consequently, DSO was selected for enhancement by integrating the stateof- the-art single image depth estimation NeW-CRFs CNN module. This module introduced depth prior knowledge to refine DSO's depth initialization and tracking. Using the TUM-Mono dataset, the new DeepDSO method was benchmarked against DSO and CNN-DSO. DeepDSO surpassed the others across various metrics, including translation error, rotation error, scale error, alignment error, and RMSE. Statistical tests confirmed DeepDSO's superiority, achieving an impressive RMSE of 0.0624, which corresponds to an error reduction close to 13.35% with respect to the original DSO system. DeepDSO pushes monocular VO boundaries by strategically integrating machine learning-based depth estimation. In addition, the taxonomy and comparative analysis provide guidelines for appropriate algorithm selection and implementation. This study validates the benefits of implementing artificial intelligence within SLAM, VO and SFM systems and lays the groundwork for continued depth initialization and point-tracking optimisations.	es_ES
dc.description.sponsorship	Tesis Univ. Granada.	es_ES
dc.description.sponsorship	SDAS Research Group	es_ES
dc.format.mimetype	application/pdf	en_US
dc.language.iso	eng	es_ES
dc.publisher	Universidad de Granada	es_ES
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.title	Real-time monocular 3D reconstruction of scenarios using artificial intelligence techniques	es_ES
dc.type	doctoral thesis	es_ES
europeana.type	TEXT	en_US
europeana.dataProvider	Universidad de Granada. España.	es_ES
europeana.rights	http://creativecommons.org/licenses/by-nc-nd/3.0/	en_US
dc.rights.accessRights	open access	es_ES
dc.type.hasVersion	VoR	es_ES

Ficheros en el ítem

Nombre:: 94862.pdf
Tamaño:: 18.94Mb
Formato:: PDF

Este ítem aparece en la(s) siguiente(s) colección(ones)

Tesis
Tesis leídas en la Universidad de Granada

Mostrar el registro sencillo del ítem

Excepto si se señala otra cosa, la licencia del ítem se describe como Attribution-NonCommercial-NoDerivatives 4.0 Internacional