Ink classification in historical documents using hyperspectral imaging and machine learning methods
Metadatos
Mostrar el registro completo del ítemAutor
López Baldomero, Ana Belén; Buzzelli, Marco; Moronta-Montero, Francisco; Martínez Domingo, Miguel Ángel; Valero Benito, Eva MaríaEditorial
Elsevier
Materia
Ink classification historical documents hyperspectral imaging material identification machine learning approach data fusion cultural heritage
Fecha
2025-02Referencia bibliográfica
López-Baldomero, A. B. et al. (2025). Ink classification in historical documents using hyperspectral imaging and machine learning methods. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 125916. https://doi.org/10.1016/j.saa.2025.125916
Patrocinador
MICIU/AEI/10.13039/501100011033 PID2021-124446NB-I00; ERDF, EU; Ministry of Universities (Spain) [FPU2020-05532]; “ESF Investing in your future” [PRE2022-101352]; Ministry for Universities and Research (MUR) (Italy)Resumen
Ink identification using only spectral reflectance information poses significant challenges due to material degradation, aging, and spectral overlap between ink classes. This study explores the use of hyperspectral imaging and machine learning techniques to classify three distinct types of inks: pure metallo-gallate, carbon-containing, and non-carbon-containing inks. Six supervised classification models, including five traditional algorithms (Support Vector Machines, K-Nearest Neighbors, Linear Discriminant Analysis, Random Forest, and Partial Least Squares Discriminant Analysis) and one Deep Learning-based model, were evaluated. The methodology integrates data fusion from different imaging systems, sample extraction, ground truth creation, and a post-processing step to increase uniformity. The evaluation was performed using both mock-up samples and historical documents, achieving micro-averaged accuracy above 90% for all models. The best performance was obtained using the DL-based model (98% F1-score), followed by the Support Vector Machine model. In the case study documents, the overall performance of the traditional model was better. This study highlights the potential of hyperspectral imaging combined with machine learning for non-invasive ink identification and mapping, even under challenging conditions, contributing to the conservation and analysis of historical manuscripts.