A connection between pattern classification by machine learning and statistical inference with the General Linear Model
release_iyjb7gf4jvcg3lpqlcevhab3t4
by
Juan-Manuel Gorriz,
John Suckling,
Javier Ramirez,
Carmen Jimenez-Mesa,
Fermin Segovia
2021 Volume 26, Issue 11, p1-1
Abstract
A connection between the general linear model (GLM) with frequentist statistical testing and machine learning (MLE) inference is derived and illustrated. Initially, the estimation of GLM parameters is expressed as a Linear Regression Model (LRM) of an indicator matrix; that is, in terms of the inverse problem of regressing the observations. Both approaches, i.e. GLM and LRM, apply to different domains, the observation and the label domains, and are linked by a normalization value in the least-squares solution. Subsequently, we derive a more refined predictive statistical test: the linear Support Vector Machine (SVM), that maximizes the class margin of separation within a permutation analysis. This MLE-based inference employs a residual score and associated upper bound to compute a better estimation of the actual (real) error. Experimental results demonstrate how parameter estimations derived from each model result in different classification performance in the equivalent inverse problem. Moreover, using real data, the MLE-based inference including model-free estimators demonstrates an efficient trade-off between type I errors and statistical power.
In text/plain
format
Archived Files and Locations
application/pdf
27.0 MB
file_thtizj2gvrbmtc4wpvycdhhd7e
|
ieeexplore.ieee.org (publisher) web.archive.org (webarchive) |
access all versions, variants, and formats of this works (eg, pre-prints)
Crossref Metadata (via API)
Worldcat
SHERPA/RoMEO (journal policies)
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar