Towards Improving Skin Cancer Diagnosis by Integrating Microarray and RNA-Seq Datasets
Metadatos
Mostrar el registro completo del ítemAutor
Gálvez Gómez, Juan Manuel; Castillo Secilla, Daniel; Herrera Maldonado, Luis Javier; Valenzuela Cansino, Olga; Caba Pérez, Octavio; Prados Salazar, José Carlos; Ortuño, Francisco M.; Rojas Ruiz, IgnacioEditorial
Institute of Electrical and Electronics Engineers (IEEE)
Materia
Gene expression Transcriptomic technologies Machine learning
Fecha
2019-10-11Referencia bibliográfica
J. M. Gálvez et al., "Towards Improving Skin Cancer Diagnosis by Integrating Microarray and RNA-Seq Datasets," in IEEE Journal of Biomedical and Health Informatics, vol. 24, no. 7, pp. 2119-2130, July 2020, doi: 10.1109/JBHI.2019.2953978
Patrocinador
Government of Andalusia under Grant P12–TIC–2082 as part of the development of the research project “Advanced Computer Systems in Applications in the field of Biotechnology and Bioinformatics”; Government of Spain under Grant RTI2018-101674-B-I00; Feder Andalusia Operational Program Framework under Grant B-TIC-414-UGR18Resumen
Many clinical studies have revealed the high biological similarities existing among different skin pathological states. These similarities create difficulties in the efficient diagnosis of skin cancer, and encourage to study and design new intelligent clinical decision support systems. In this sense, gene expression analysis can help find differentially expressed genes (DEGs) simultaneously discerning multiple skin pathological states in a single test. The integration of multiple heterogeneous transcriptomic datasets requires different pipeline stages to be properly designed: from suitable batch merging and efficient biomarker selection to automated classification assessment. This article presents a novel approach addressing all these technical issues, with the intention of providing new sights about skin cancer diagnosis. Although new future efforts will have to be made in the search for better biomarkers recognizing specific skin pathological states, our study found a panel of 8 highly relevant multiclass DEGs for discerning up to 10 skin pathological states: 2 healthy skin conditions a priori, 2 cataloged precancerous skin diseases and 6 cancerous skin states. Their power of diagnosis over new samples was widely tested by previously well-trained classification models. Robust performance metrics such as overall and mean multiclass F1-score outperformed recognition rates of 94% and 80%, respectively. Clinicians should give special attention to highlighted multiclass DEGs that have high gene expression changes present among them, and understand their biological relationship to different skin pathological states.