A hypothesis-driven method based on machine learning for neuroimaging data analysis
Metadatos
Mostrar el registro completo del ítemAutor
Gorriz Sáez, Juan Manuel; García Puntonet, Carlos; Ramírez Pérez De Inestrosa, Javier; SIPBA groupEditorial
Elsevier
Materia
General Linear Model Linear Regression Model Support Vector Regression permutation tests Magnetic resonance imaging Random Field Theory
Fecha
2022-09-09Referencia bibliográfica
J.M. Gorriz... [et al.]. A hypothesis-driven method based on machine learning for neuroimaging data analysis, Neurocomputing, Volume 510, 2022, Pages 159-171, ISSN 0925-2312, [https://doi.org/10.1016/j.neucom.2022.09.001]
Patrocinador
MCIN/AEI; FEDER ``Una manera de hacer Europa" RTI2018-098913-B100; Junta de Andalucia; European Commission CV20-45250 A-TIC-080-UGR18 B-TIC586-UGR20 P20-00525; research project ACACIA US-1264994; European Commission; Junta de Andalucia (Consejeria de Economia, Conocimiento, Empresas y Universidad)Resumen
There remains an open question about the usefulness and the interpretation of machine learning (ML)
approaches for discrimination of spatial patterns of brain images between samples or activation states.
In the last few decades, these approaches have limited their operation to feature extraction and linear
classification tasks for between-group inference. In this context, statistical inference is assessed by randomly
permuting image labels or by the use of random effect models that consider between-subject variability.
These multivariate ML-based statistical pipelines, whilst potentially more effective for detecting
activations than hypotheses-driven methods, have lost their mathematical elegance, ease of interpretation,
and spatial localization of the ubiquitous General linear Model (GLM). Recently, the estimation of
the conventional GLM parameters has been demonstrated to be connected to an univariate classification
task when the design matrix in the GLM is expressed as a binary indicator matrix. In this paper we
explore the complete connection between the univariate GLM and ML-based regressions. To this purpose
we derive a refined statistical test with the GLM based on the parameters obtained by a linear Support
Vector Regression (SVR) in the inverse problem (SVR-iGLM). Subsequently, random field theory (RFT) is
employed for assessing statistical significance following a conventional GLM benchmark. Experimental
results demonstrate how parameter estimations derived from each model (mainly GLM and SVR) result
in different experimental design estimates that are significantly related to the predefined functional task.
Moreover, using real data from a multisite initiative the proposed ML-based inference demonstrates statistical
power and the control of false positives, outperforming the regular GLM.