Object-Centric Masked Image Modelling for Self-Supervised Pre-Training in Remote Sensing object Detection

Sivakumaran, AR.; Shiva Prasanna, K.; Sai Sreeja, L.; Sai Srija, K.

dc.contributor.author	Sivakumaran, AR.
dc.contributor.author	Shiva Prasanna, K.
dc.contributor.author	Sai Sreeja, L.
dc.contributor.author	Sai Srija, K.
dc.date.accessioned	2025-04-22T10:54:31Z
dc.date.available	2025-04-22T10:54:31Z
dc.date.issued	2024-12-31
dc.identifier.citation	AR. Sivakumaran, K. Shiva Prasanna, L. Sai Sreeja, K. Sai Srija (2024). Object-Centric Masked Image Modelling for Self-Supervised Pre-Training in Remote Sensing object Detection,Vol.15(5).265-276. ISSN 1989-9572	es_ES
dc.identifier.issn	1989-9572
dc.identifier.uri	https://hdl.handle.net/10481/103727
dc.description.abstract	The proliferation of remote sensing technologies has led to an increasing demand for effective object detection in satellite and aerial imagery, with applications ranging from environmental monitoring to urban planning. Traditional methods for analyzing such imagery often rely on manual inspection, which is both time-consuming and prone to human error. While recent advancements in automated object detection have improved efficiency, these systems frequently suffer from limitations in accurately identifying and classifying objects due to their reliance on simplistic masking techniques and insufficient context understanding. In this work, we propose a novel Object-Centric Masked Image Modelling (OCMIM) algorithm designed to enhance self-supervised pre-training for remote sensing object detection. The OCMIM algorithm comprises two key components: the Object-Centric Data Generator (OCDG) and the Attention-Guided Mask Generator (AGMG). The OCDG component empowers the model to capture comprehensive object-level context information, accommodating various scales and multiple categories, thus enriching the pre-training process. Complementing this, the AGMG focuses on improving the reconstruction of object regions by intelligently masking the most attention-worthy regions instead of employing random masking, thereby enabling more accurate object detection and classification. Our proposed OCMIM algorithm leverages the strengths of existing pre-trained models such as Mask R-CNN (M-RCNN) and RetinaNet, enhancing their performance through the integration of OCDG and AGMG. For evaluation purposes, we utilized several pre-trained models, including M-RCNN and RetinaNet, and conducted experiments on diverse datasets such as NWPU, DIAR, and UCAS. Given the extensive training time required for these models, we specifically employed M-RCNN in conjunction with OCMIM for detailed experiments on the NWPU dataset.	es_ES
dc.language.iso	eng	es_ES
dc.publisher	Universidad de Granada	es_ES
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Remote sensing technology	es_ES
dc.subject	Object Detection	es_ES
dc.subject	Object-Centric Masked Image Modelling	es_ES
dc.subject	Attention-Guided Mask Generator	es_ES
dc.title	Object-Centric Masked Image Modelling for Self-Supervised Pre-Training in Remote Sensing object Detection	es_ES
dc.type	journal article	es_ES
dc.rights.accessRights	open access	es_ES
dc.type.hasVersion	VoR	es_ES

Ficheros en el ítem

Nombre:: B10+Object+Centric.pdf
Tamaño:: 760.4Kb
Formato:: PDF

Este ítem aparece en la(s) siguiente(s) colección(ones)

Vol. 15.5 (2024)

Mostrar el registro sencillo del ítem

Excepto si se señala otra cosa, la licencia del ítem se describe como Attribution-NonCommercial-NoDerivatives 4.0 Internacional