XGBoost in handling missing values for life insurance risk prediction

Deandra Aulia Rusdah, Hendri Murfi
2020 SN Applied Sciences  
Insurance risk prediction is carried out to classify the levels of risk in insurance industries. From the machine learning point of view, the problem of risk level prediction is a multi-class classification. To classify the risk, a machine learning model will predict the level of applicant's risk based on historical data. In the insurance applicant's historical data, there will be the possibility of missing values so that it is necessary to deal with these problems to provide better
more » ... XGBoost is a machine learning method that is widely used for classification problems and can handle missing values without an imputation preprocessing. This paper analyzed the performance of the XGBoost model in handling the missing values for risk prediction in life insurance. The simulations show that the XGBoost model without any imputation preprocessing gives a comparable accuracy to one of the XGBoost models with an imputation preprocessing.
doi:10.1007/s42452-020-3128-y fatcat:n7rg6crdxbdz3bu3awasbeb6ky