Mostrar el registro sencillo del ítem
Object-Centric Masked Image Modelling for Self-Supervised Pre-Training in Remote Sensing object Detection
dc.contributor.author | Sivakumaran, AR. | |
dc.contributor.author | Shiva Prasanna, K. | |
dc.contributor.author | Sai Sreeja, L. | |
dc.contributor.author | Sai Srija, K. | |
dc.date.accessioned | 2025-04-22T10:54:31Z | |
dc.date.available | 2025-04-22T10:54:31Z | |
dc.date.issued | 2024-12-31 | |
dc.identifier.citation | AR. Sivakumaran, K. Shiva Prasanna, L. Sai Sreeja, K. Sai Srija (2024). Object-Centric Masked Image Modelling for Self-Supervised Pre-Training in Remote Sensing object Detection,Vol.15(5).265-276. ISSN 1989-9572 | es_ES |
dc.identifier.issn | 1989-9572 | |
dc.identifier.uri | https://hdl.handle.net/10481/103727 | |
dc.description.abstract | The proliferation of remote sensing technologies has led to an increasing demand for effective object detection in satellite and aerial imagery, with applications ranging from environmental monitoring to urban planning. Traditional methods for analyzing such imagery often rely on manual inspection, which is both time-consuming and prone to human error. While recent advancements in automated object detection have improved efficiency, these systems frequently suffer from limitations in accurately identifying and classifying objects due to their reliance on simplistic masking techniques and insufficient context understanding. In this work, we propose a novel Object-Centric Masked Image Modelling (OCMIM) algorithm designed to enhance self-supervised pre-training for remote sensing object detection. The OCMIM algorithm comprises two key components: the Object-Centric Data Generator (OCDG) and the Attention-Guided Mask Generator (AGMG). The OCDG component empowers the model to capture comprehensive object-level context information, accommodating various scales and multiple categories, thus enriching the pre-training process. Complementing this, the AGMG focuses on improving the reconstruction of object regions by intelligently masking the most attention-worthy regions instead of employing random masking, thereby enabling more accurate object detection and classification. Our proposed OCMIM algorithm leverages the strengths of existing pre-trained models such as Mask R-CNN (M-RCNN) and RetinaNet, enhancing their performance through the integration of OCDG and AGMG. For evaluation purposes, we utilized several pre-trained models, including M-RCNN and RetinaNet, and conducted experiments on diverse datasets such as NWPU, DIAR, and UCAS. Given the extensive training time required for these models, we specifically employed M-RCNN in conjunction with OCMIM for detailed experiments on the NWPU dataset. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Universidad de Granada | es_ES |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Remote sensing technology | es_ES |
dc.subject | Object Detection | es_ES |
dc.subject | Object-Centric Masked Image Modelling | es_ES |
dc.subject | Attention-Guided Mask Generator | es_ES |
dc.title | Object-Centric Masked Image Modelling for Self-Supervised Pre-Training in Remote Sensing object Detection | es_ES |
dc.type | journal article | es_ES |
dc.rights.accessRights | open access | es_ES |
dc.type.hasVersion | VoR | es_ES |