Determinants factors in predicting life expectancy using machine learning

Бесплатный доступ

Introduction. Life expectancy is, by definition, the average number of years a person can expect to live from birth to death. It is therefore the best indicator for assessing the health of human beings, but also a comprehensive index for assessing the level of economic development, education and health systems . From our extensive research, we have found that most existing studies contain qualitative analyses of one or a few factors. There is a lack of quantitative analyses of multiple factors, which leads to a situation where the predominant factor influencing life expectancy cannot be identified with precision. However, with the existence of various conditions and complications witnessed in society today, several factors need to be taken into consideration to predict life expectancy. Therefore, various machine learning models have been developed to predict life expectancy. The aim of this article is to identify the factors that determine life expectancy. Materials and Methods. Our research uses the Pearson correlation coefficient to assess correlations between indicators, and we use multiple linear regression models, Ridge regression, and Lasso regression to measure the impact of each indicator on life expectancy . For model selection, the Akaike information criterion, the coefficient of variation and the mean square error were used. R2 and the mean square error were used. Results. Based on these criteria, multiple linear regression was selected for the development of the life expectancy prediction model, as this model obtained the smallest Akaike information criterion of 6109.07, an adjusted coefficient of 85 % and an RMSE of 3.85. Conclusion and Discussion. At the end of our study, we concluded that the variables that best explain life expectancy are adult mortality, infant mortality, percentage of expenditure, measles, under-five mortality, polio, total expenditure, diphtheria, HIV / AIDS, GDP, longevity of 1.19 years, resource composition, and schooling. The results of this analysis can be used by the World Health Organization and the health sectors to improve society.

Еще

Life expectancy, machine learning, machine learning models

Короткий адрес: https://sciup.org/142236332

IDR: 142236332   |   DOI: 10.23947/2687-1653-2022-22-4-373-383

Список литературы Determinants factors in predicting life expectancy using machine learning

  • Arias E. United States Life Tables, 2009. National Vital Statistics Reports. 2014; 62:1-63.
  • Yafei Wu, Ke Hu, Yaofeng Han, et al. Spatial Characteristics of Life Expectancy and Geographical Detection of Its Influencing Factors in China. International Journal of Environmental Research and Public Health. 2020;17:906. https://doi.org/10.3390/ijerph17030906
  • Ming Wen, Danan Gu. Air Pollution Shortens Life Expectancy and Health Expectancy for Older Adults: The Case of China. The Journals of Gerontology: Series A. 2012;67:1219-1229. https://doi.org/10.1093/gerona/gls094
  • Cervantes PAM, López NR, Rambaud SC. The Relative Importance of Globalization and Public Expenditure on Life Expectancy in Europe: An Approach Based on MARS Methodology. International Journal of Environmental Research and Public Health. 2020;17:8614. http://dx.doi.org/10.3390/iierph17228614
  • Reynolds MM, Avendano M. Social Policy Expenditures and Life Expectancy in High-Income Countries. American Journal of Preventive Medicine. 2018;54:72-79. https://doi.org/10.1016/i.amepre.2017.09.001
  • Sede IP, Ohemeng W. Socio-economic determinants of life expectancy in Nigeria (1980-2011). Health Economics Review. 2015;5:1-11.
  • Daquan Huang, Shuimiao Yang, Tao Liu. Life Expectancy in Chinese Cities: Spatially Varied Role of Socioeconomic Development, Population Structure, and Natural Conditions. International Journal of Environmental Research and Public Health. 2020;17:6597. http://dx.doi.org/10.3390/iierph17186597
  • Okamoto, K. Life Expectancy at Age 65 and Environmental Factors: An Ecological Study in Japan. Archives of Gerontology and Geriatrics. 2006;43:85-91. http://dx.doi.org/10.1016/i.archger.2005.09.005
  • WHO. Ambient Air Pollution: A Global Assessment of Exposure and Burden of Disease. World Health Organization, 2016. 121 p. https://apps.who.int/iris/handle/10665/250141
  • Xinjie Zha, Yuan Tian, Xing Gao, et al. Quantitatively Evaluate the Environmental Impact Factors of the Life Expectancy in Tibet, China. Environmental Geochemistry and Health. 2019;41:1507-1520. https ://link. springer. com/article/10.1007/s10653-018-0211-z
  • Nkalu CN, Edeme RK. Environmental Hazards and Life Expectancy in Africa: Evidence from GARCH Model. SAGE Open; 2019, 9. https://doi.org/10.1177/2158244019830500
  • Inglehart Ronald, Christian Welzel. How Development Leads to Democracy What We Know About Modernization. Foreign Affairs. 2009;88:33-48.
  • Cockerham WC. The Social Determinants of the Decline of Life Expectancy in Russia and Eastern Europe: A Lifestyle Explanation. Journal of Health and Social Behavior. 1997;38:117-130.
  • Jessica Y Ho, Arun S Hendi. Recent Trends in Life Expectancy across High Income Countries: Retrospective Observational Study. BMJ. 2018;362:k2562. https://doi.org/10.1136/bmi.k2562
  • Penuelas J, Krisztin T, Obersteiner M, et al. Country-Level Relationships of the Human Intake of N and P, Animal and Vegetable Food, and Alcoholic Beverages with Cancer and Life Expectancy. International Journal of Environmental Research and Public Health. 2020;17:7240. https://doi.org/10.3390/iierph17197240
  • Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine Learning in Medicine: A Practical Introduction. BMC Medical Research Methodology. 2019;19:1-18. https://bmcmedresmethodol.biomedcentral.com/ articles/10.1186/s12874-019-0681-4
  • Malpe V, Tugaonkar P. Machine Learning Trends in Medical Sciences. In: Proc. 2nd International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud), August 2018. P. 495-499.
  • KumarRaiarshi. WHO. Life Expectancy. Statistical Analysis on Factors Influencing Life Expectancy. https://www.kaggle.com/datasets/kumaraiarshi/life-expectancy-who
  • Thu Pham-Gia, Vartan Choulakian. Distribution of the Sample Correlation Matrix and Applications. Open Journal of Statistics. 2014;4:48571.10.4236/ois.2014.45033
  • Svensson K. Predicting Life Expectancy Using Machine Learning. 2018. https://www.semanticscholar.org/paper/Predicting-Life-Expectancy-Using-Machine-Learning-Svensson/984adbb5aee 16d38a6686895dda2afd3087b2261
  • Müller AC, Guido S. Introduction to Machine Learning with Python.,1st ed. O'Reilly Media, Inc.; 2016.
  • Ki-Young Lee, Kyu-Ho Kom, Jeong-Jin Kang, et al. Comparison and Analysis of Linear Regression and Artificial g Neural Network. International Journal of Applied Engineering Research. 2017;12:9820-9825.
  • Paraqape RS, Challacombe SJ. HIV/AIDS in India: An Overview of the Indian Epidemic. Oral Diseases. 2016;22:10-14. http://dx.doi.org/10.1111/odi. 12457
  • Haebong Woo. Patterns and Evolution of Life Span Inequality Using the Gini Coefficient. Pogön Sahoe Yön'gu. 2013;33:419-451. 10.15709/hswr.2013.33.4.419. https://www.researchgate.net/publication/275246163 Patterns and Evolution of Life Span Inequality Using the Gini Coefficient
  • Chen Wu. Human Capital, Life Expectancy, and the Environment. Journal of International Trade and Economic Development. 2017;26:885-906.
  • Carolina Cosculluela-Martínez, Raquel Ibar Alonso, Geoffrey JD Hewings. Life Expectancy Index: Age Structure of the Population and Environmental Change. Social Indicators Research. 2019; 142:507-522.
  • Muhamad Haroon Shah, Nianyong Wang, Irfan Ullah, et al. Does Environment Quality and Public Spending on Environment Promote Life Expectancy in China? Evidence from a Nonlinear Autoregressive Distributed Lag Approach. International Journal of Health Planning and Management. 2021;36:545-560. http://dx.doi.org/10.1002/hpm.3100
  • Mariani F, Perez-Barahona A, Raffin, N. Life Expectancy and the Environment. Journal of Economic Dynamics and Control. 2010;34:798-815.
  • Tuljapurkar Sh, Horvitz CC. From Stage to Age in Variable Environments: Life Expectancy and Survivorship. Ecology. 2006;87:1497-1509. http://dx.doi.org/10.1890/0012-9658(2006)87|1497:FSTAIV|2.0.CO;2
  • Kampa M, Castanas E. Human Health Effects of Air Pollution. Environmental Pollution. 2008;151:362-367. http://dx.doi.org/10.1016/j.envpol.2007.06.012
  • Tagaris E, Kuo-Jen Liao, DeLucia AJ, et al. Potential Impact of Climate Change on Air Pollution-Related Human Health Effects. Environmental Science and Technology. 2009;43:4979-4988.
  • Anderson JO, Thundiyil JG, Stolbach, A. Clearing the Air: A Review of the Effects of Particulate Matter Air Pollution on Human Health. Journal of Medical Toxicology. 2012;8:166-175. http://dx.doi.org/10.1007/s13181-011-0203-1
  • Wuffle A, Brians CL, Coulter K. Taking the Temperature: Implications for the Adoption of Election Day Registration, State level Voter Turnout, and Life Expectancy. PS: Political Science & Politics. 2012;45:78-82.
  • Brunner E, Maruyama K. SP4-32 Health and Sustainability: An International Ecological Study of Caibon Dioxide Emissions and Life Expectancy. Journal of Epidemiology and Community Health. 2011;65 :A442-A443.
  • Clootens N. Public Debt, Life Expectancy, and the Environment. Environmental Modeling & Assessment. 2017;22:267-278.
  • Tetzlaff F, Epping J, Sperlich S, et al. Widening Income Inequalities in Life Expectancy? Time Trend Analysis Using German Health Insurance Data. Journal of Epidemiology and Community Health. 2020;74:592-597. 10.1136/jech-2019-212966
Еще
Статья научная