A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Using Machine Learning Techniques to Identify Key Risk Factors for Diabetes and Undiagnosed Diabetes
[article]
2021
arXiv
pre-print
This paper reviews a wide selection of machine learning models built to predict both the presence of diabetes and the presence of undiagnosed diabetes using eight years of National Health and Nutrition Examination Survey (NHANES) data. Models are tuned and compared via their Brier Scores. The most relevant variables of the best performing models are then compared. A Support Vector Machine with a linear kernel performed best for predicting diabetes, returning a Brier score of 0.0654 and an AUROC
arXiv:2105.09379v1
fatcat:3ugmxnepivdvvph4hbldu2ipc4