Spectral unmixing as a preprocessing step for SVM-based material identification in historical manuscripts
Metadatos
Mostrar el registro completo del ítemAutor
López Baldomero, Ana Belén; Martínez Domingo, Miguel Ángel; George, Sony; Valero Benito, Eva MaríaEditorial
Nature Portfolio
Materia
hyperspectral imaging cultural heritage historical manuscripts material identification machine learning ink classification
Fecha
2025-10-06Referencia bibliográfica
López-Baldomero, A.B., Martínez-Domingo, M.Á., George, S. et al. Spectral unmixing as a preprocessing step for SVM-based material identification in historical manuscripts. npj Herit. Sci. 13, 492 (2025). https://doi.org/10.1038/s40494-025-02029-7
Patrocinador
Open access funding provided by NTNU Norwegian University of Science and Technology (incl St. Olavs Hospital - Trondheim University Hospital). This research was supported by Grant PID2021-124446NB-I00 funded by MICIU/AEI/10.13039/501100011033 and by ERDF, EU, by the Ministry of Universities (Spain) [grant number FPU2020-05532], and byMICIU through a short-term mobility grant [grant number EST24/00040].Resumen
When performing material identification from hyperspectral images, a common challenge is the mixing of spectral signals at boundaries between materials. This study investigates spectral unmixing as a preprocessing step to improve machine learning-based classification of inks and writing supports in documents. Hyperspectral data of mock-ups and historical samples were acquired in the VNIR and SWIR ranges, including metallo-gallate, carbon-containing, and non-carbon-containing inks (sepia or mixtures with iron gall) applied to paper and parchment. A subtractive mixing model with automatic endmember extraction was used to generate presence maps and exclude pixels below a concentration threshold. Three support vector machine classifiers were trained using (1) unprocessed reflectance spectra, (2) reconstructed spectra from unmixing, and (3) pure unmixed spectra. Reconstructed spectra provided the best overall performance and classification maps, while unmixed spectra outperformed in ink identification, particularly bleed-through detection. Unmixing also revealed areas of lower classification confidence, offering potential for broader hyperspectral applications.





