Conditional Two Level Mixture with Known Mixing Proportions: Applications to School and Student Level Overweight and Obesity Data from Birmingham, England

Authors

  • Shakir Hussain School of Health and Population Science, University of Birmingham, Birmingham, UK
  • Mehdi AL-Alak Central Organization for Statistics, Baghdad, Iraq
  • Ghazi Shukur Department of Economic and Statistics, Linnaeus University, Sweden

DOI:

https://doi.org/10.6000/1929-6029.2014.03.03.9

Keywords:

Parametric Expectation Maximization, Multilevel Mixture, Conditional Multilevel Mixture Known Mix, Overweight and Obesity Data

Abstract

Two Level (TL) models allow the total variation in the outcome to be decomposed as level one and level two or ‘individual and group’ variance components. Two Level Mixture (TLM) models can be used to explore unobserved heterogeneity that represents different qualitative relationships in the outcome.

In this paper, we extend the standard TL model by introducing constraints to guide the TLM algorithm towards a more appropriate data partitioning. Our constraints-based methods combine the mixing proportions estimated by parametric Expectation Maximization (EM) of the outcome and the random component from the TL model. This forms new two level mixing conditional (TLMc) approach by means of prior information. The new framework advantages are: 1. avoiding trial and error tactic used by TLM for choosing the best BIC (Bayesian Information Criterion), 2. permitting meaningful parameter estimates for distinct classes in the coefficient space and finally 3. allowing smaller residual variances. We show the benefit of our method using overweight and obesity from Body Mass Index (BMI) for students in year 6. We apply these methods on hierarchical BMI data to estimate student multiple deprivation and school Club effects.

Author Biographies

Shakir Hussain, School of Health and Population Science, University of Birmingham, Birmingham, UK

School of Health and Population Science

Ghazi Shukur, Department of Economic and Statistics, Linnaeus University, Sweden

Department of Economic and Statistics

References

Hox J. Multilevel Analysis Techniques and Applications. Lawrence Erlbaum Associates, Inc, New Jersey 2002.

Muthén B, Asparouhov T. Multilevel regression mixture analysis. J Royal Statist Soc Ser A 2009; 172(3): 639-57. http://dx.doi.org/10.1111/j.1467-985X.2009.00589.x DOI: https://doi.org/10.1111/j.1467-985X.2009.00589.x

Wedel M, DeSarbo WS. Mixture regression models’ in Jacques A. Hagenaars and Allan L. McCutcheon eds, Applied Latent Class Analysis, Cambridge University Press, 2002; pp. 366-382. DOI: https://doi.org/10.1017/CBO9780511499531.014

Vermunt JK. Latent class and finite mixture models for multilevel data sets. Statist Methods Med Res 2008; 17(1): 33-51. http://dx.doi.org/10.1177/0962280207081238 DOI: https://doi.org/10.1177/0962280207081238

Dempster A, Laird N, Rubin D. Maximum likelihood from incomplete data via the EM algorithm (with discussion). J Royal Statist Soc Ser B 1977; 39(1): 1-38. DOI: https://doi.org/10.1111/j.2517-6161.1977.tb01600.x

Schwarz G. Estimating the dimension of a model. Ann Statist 1978; 6(2): 461-64. http://dx.doi.org/10.1214/aos/1176344136 DOI: https://doi.org/10.1214/aos/1176344136

Titterington DM, Smith AFM, Makov UE. Statistical analysis of finite Mixture Distributions. Wiley, New York 1985.

McLachlan G, Peel D. Finite Mixture Models, Wiley-Interscience, New York 2000. http://dx.doi.org/10.1002/0471721182 DOI: https://doi.org/10.1002/0471721182

Fraley C, Raftery A. Model-based clustering, discriminant analysis and density estimation. J Am Statist Assoc 2002; 97(456): 611-631. http://dx.doi.org/10.1198/016214502760047131 DOI: https://doi.org/10.1198/016214502760047131

Gelman A, Hill J. Data analysis using regression and multilevel/Hierarchical Models. Cambridge 2007. DOI: https://doi.org/10.1017/CBO9780511790942

Lunn D, Jackson C, Best N, Thomas A, Spiegelhalther D. The BUGS Book. CRC press 2013. DOI: https://doi.org/10.1201/b13613

Downloads

Published

2014-08-05

How to Cite

Hussain, S., AL-Alak, M., & Shukur, G. (2014). Conditional Two Level Mixture with Known Mixing Proportions: Applications to School and Student Level Overweight and Obesity Data from Birmingham, England . International Journal of Statistics in Medical Research, 3(3), 298–308. https://doi.org/10.6000/1929-6029.2014.03.03.9

Issue

Section

General Articles