Self-Supervised Learning on Small In-Domain Datasets Can Overcome Supervised Learning in Remote Sensing
Metadata
Show full item recordAuthor
Sánchez Fernández, Andrés J.; Moreno Álvarez, Sergio; Rico Gallego, Juan Antonio; Tabik, SihamEditorial
Institute of Electrical and Electronics Engineers
Materia
Deep learning Fraction estimation Land use and land cover (LULC)
Date
2024-07-02Referencia bibliográfica
A. J. Sanchez-Fernandez, S. Moreno-Álvarez, J. A. Rico-Gallego and S. Tabik, "Self-Supervised Learning on Small In-Domain Datasets Can Overcome Supervised Learning in Remote Sensing," in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 17, pp. 12797-12810, 2024, doi: 10.1109/JSTARS.2024.3421622
Sponsorship
Project SmartFoRest (in part by MCIN/AEI/10.13039/501100011033 and in part by the European Union NextGenerationEU/ PRTR under Grant TED2021-129690B-I00); Spanish Ministry of Science and Innovation under Grant PID2019-110315RB-I00 (APRISA); European Regional Development Fund (ERDF/FEDER); Junta de Extremadura under Grant IB20040; Project Thematic Center on Mountain Ecosystem & Remote sensing, Deep learning-AI e-Services University of Granada-Sierra Nevada, Ministry of Science and Innovation through the FEDER funds from the Spanish Pluriregional Operational Program 2014-2020 (POPE) under Grant LifeWatch-2019-10-UGR-01, Grant LifeWatch-ERIC action line, Grant LifeWatch-2019-10-UGR-01_WP-8, Grant LifeWatch-2019-10-UGR-01_WP-7, and Grant LifeWatch-2019-10-UGR-01_WP-4Abstract
The availability of high-resolution satellite images has accelerated the creation of new datasets designed to tackle broader remote sensing (RS) problems. Although popular tasks, such as scene classification, have received significant attention, the recent release of the Land-1.0 RS datasetmarks the initiation of endeavors to estimate land-use and land-cover (LULC) fraction values per RGB satellite image. This challenging problem involves estimating LULC composition, i.e., the proportion of different LULC classes from satellite imagery, with major applications in environmental monitoring, agricultural/urban planning, and climate change studies. Currently, supervised deep learning models—the state-of-the-art in image classification—require large volumes of labeled training data to provide good generalization. To face the challenges posed by the scarcity of labeled RS data, self-supervised learning (SSL) models have recently emerged, learning directly from unlabeled data by leveraging the underlying structure. This is the first article to investigate the performance of SSL in LULC fraction estimation on RGB satellite patches using in-domain knowledge. We also performed a complementary analysis on LULC scene classification. Specifically, we pretrained Barlow Twins, MoCov2, SimCLR, and SimSiam SSL models with ResNet-18 using the Sentinel2GlobalLULC smallRS dataset and then performed transfer learning to downstream tasks on Land-1.0. Our experiments demonstrate that SSL achieves competitive or slightly better results when trained on a smaller high-quality in-domain dataset of 194 877 samples compared to the supervised model trained on ImageNet-1k with 1 281 167 samples. This outcome highlights the effectiveness of SSL using in-distribution datasets, demonstrating efficient learning with fewer but more relevant data.