A1 Refereed original research article in a scientific journal
Prediction with a flexible finite mixture-of-regressions
Authors: Ilmari Ahonen, Jaakko Nevalainen, Denis Larocque
Publisher: Elsevier B.V.
Publication year: 2019
Journal: Computational Statistics and Data Analysis
Journal name in source: Computational Statistics and Data Analysis
Volume: 132
First page : 212
Last page: 224
Number of pages: 13
ISSN: 0167-9473
eISSN: 1872-7352
DOI: https://doi.org/10.1016/j.csda.2018.01.012
Finite mixture regression (FMR) is widely used for modeling data that originate from heterogeneous populations. In these settings, FMR can offer increased predictive power compared to more traditional one-class models. However, existing FMR methods rely heavily on mixtures of linear models, where the linear predictor must be given as an input. A flexible FMR model is presented using a combination of the random forest learner and a penalized linear FMR. The performance of the new method is assessed by predictive log-likelihood in extensive simulation studies. The method is shown to achieve equal performance with the existing FMR methods when the true regression functions are in fact linear and superior performance in cases where at least one of the regression functions is nonlinear. The method can handle a large number of covariates, and its predictive ability is not greatly affected by surplus variables.