Skip navigation
  • Home
  • Browse
    • Communities
      & Collections
    • Browse Items by:
    • Publication Date
    • Author
    • Title
    • Subject
    • Department
  • Sign on to:
    • My MacSphere
    • Receive email
      updates
    • Edit Profile


McMaster University Home Page
  1. MacSphere
  2. Open Access Dissertations and Theses Community
  3. Open Access Dissertations and Theses
Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/29158
Title: Using Machine Learning To Predict Type 2 Diabetes With Self-Controllable Lifestyle Risk Factors
Authors: Zhao, Xubin
Advisor: Zargoush, Manaf
Department: Computational Engineering and Science
Keywords: Machine Learning, Diabetes, Lifestyle Risk Factors
Publication Date: 2023
Abstract: Globally, the prevalence of diabetes has seen a significant increase, rising from 211 million in 1990 (3.96% of the global population at that time) to 476 million in 2017 (6.31% of the global population in 2017). Extensive research has been conducted to study the causes of diabetes from a data-driven approach, leading to the development of prospective models for predicting future diabetes risks. These studies have highlighted the strong correlation between diabetes and various biomarker factors, such as BMI, age, and certain blood test measures. However, there is a lack of research that focuses on building prospective models to predict future diabetes risks based on lifestyle factors. Therefore, this thesis aims to employ popular machine learning methods to investigate whether it is possible to predict future diabetes using prospective models that incorporate self-controllable lifestyle factors. Our analysis produced remarkable results, with the biomarker model achieving an average validation AUC score of 0.78, while the lifestyle model reached 0.70. Notably, lifestyle features demonstrate a greater predictive capacity for short-term new-onset diabetes when compared to the long-term endpoint. The biomarker model identified visceral fat as the most significant risk factor, whereas income level and employment emerged as the top risk factors in the lifestyle model. This thesis represents an innovative approach to diabetes prediction by leveraging lifestyle factors, providing valuable data-driven insights into the root causes of diabetes. It addresses a critical research gap by highlighting the significant role of lifestyle factors in predicting the future onset of diabetes, particularly within the context of parametric modeling.
URI: http://hdl.handle.net/11375/29158
Appears in Collections:Open Access Dissertations and Theses

Files in This Item:
File Description SizeFormat 
ZHAO_XUBIN_202311_MSc.pdf
Access is allowed from: 2024-11-03
1.13 MBAdobe PDFView/Open
Show full item record Statistics


Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.

Sherman Centre for Digital Scholarship     McMaster University Libraries
©2022 McMaster University, 1280 Main Street West, Hamilton, Ontario L8S 4L8 | 905-525-9140 | Contact Us | Terms of Use & Privacy Policy | Feedback

Report Accessibility Issue