MIXING IT UP: NEW METHODS FOR FINITE MIXTURE MODELLING OF MULTI-SPECIES DATA IN ECOLOGY

FRANCIS K. C. HUI
2015 Bulletin of the Australian Mathematical Society  
In this thesis, new methods are proposed for using finite mixture models to analyse multi-species data in ecology. Developments range from theoretical results to empirical studies, offering contributions to the literatures of finite mixture models, species distribution models, variable selection, cluster analysis and ordination. To begin, a comparison on several real datasets demonstrates that mixture models offer better predictions of how species communities respond to the environment,
more » ... to modelling species separately. This is achieved by borrowing strength across species-organisms with similar environmental responses are clustered together, forming a small number of archetypal responses. These results have appeared in [5] . A major challenge in applying mixture models generally is model selection, and two important contributions are made on this front. On how to choose the number of mixture components, complete likelihood information criteria (despite being a popular approach) are shown to potentially underfit the true number of components. As an alternative, a new observed likelihood information criterion is proposed, which is proven to be order consistent (see [3] ). On how to choose the variables to enter into each component, two new penalties are proposed that exploit the grouped structure of covariates in mixture of regression models, leading to desirable asymptotic and finite sample properties (see [4] ). The performance of all penalised likelihood methods depends critically on the choice of tuning parameter. In the case of adaptive LASSO regression, a new information criterion is proposed for tuning parameter selection that, unlike previous criteria, explicitly accounts for the effect of penalisation on the bias-variance tradeoff. The proposed criterion is shown to outperform many currently used criteria in selecting the tuning parameter (see [2] ).
doi:10.1017/s0004972715000945 fatcat:2uj3x7jsrzcsjixwt256et7k7e