Spectral unmixing as a preprocessing step for SVM-based material identification in historical manuscripts López Baldomero, Ana Belén Martínez Domingo, Miguel Ángel George, Sony Valero Benito, Eva María hyperspectral imaging cultural heritage historical manuscripts material identification machine learning ink classification When performing material identification from hyperspectral images, a common challenge is the mixing of spectral signals at boundaries between materials. This study investigates spectral unmixing as a preprocessing step to improve machine learning-based classification of inks and writing supports in documents. Hyperspectral data of mock-ups and historical samples were acquired in the VNIR and SWIR ranges, including metallo-gallate, carbon-containing, and non-carbon-containing inks (sepia or mixtures with iron gall) applied to paper and parchment. A subtractive mixing model with automatic endmember extraction was used to generate presence maps and exclude pixels below a concentration threshold. Three support vector machine classifiers were trained using (1) unprocessed reflectance spectra, (2) reconstructed spectra from unmixing, and (3) pure unmixed spectra. Reconstructed spectra provided the best overall performance and classification maps, while unmixed spectra outperformed in ink identification, particularly bleed-through detection. Unmixing also revealed areas of lower classification confidence, offering potential for broader hyperspectral applications. 2025-10-07T07:21:05Z 2025-10-07T07:21:05Z 2025-10-06 journal article López-Baldomero, A.B., Martínez-Domingo, M.Á., George, S. et al. Spectral unmixing as a preprocessing step for SVM-based material identification in historical manuscripts. npj Herit. Sci. 13, 492 (2025). https://doi.org/10.1038/s40494-025-02029-7 https://hdl.handle.net/10481/106849 10.1038/s40494-025-02029-7 eng http://creativecommons.org/licenses/by-nc-nd/4.0/ open access Attribution-NonCommercial-NoDerivatives 4.0 Internacional Nature Portfolio