Extreme Gradient Boosting Model-based Forecasting of Big Data Online Sales Record

Gagan Sharma, Sunil Patil
2022 SAMRIDDHI A Journal of Physical Sciences Engineering and Technology  
Nowadays, big data plays a crucial role for many online e-commerce businesses to generate more sales. Big data is a huge collection of data and information which are utilized by many organizations to forecast which products, costs, and advertisements are better to maximize their business profits. This paper aims to apply the extreme gradient boosting (XGBoost) based model to forecast sales growth of online products, specifically books and magazines, from massive datasets present in online
more » ... ng. PySpark, as the best suitable and compatible framework, is used for data analysis. The result shows that the proposed model has higher forecasting accuracy with a minimum error rate than other models. A comparative visualization and conclusion are presented in terms of the proposed system's prediction accuracy, error rate, and efficiency.
doi:10.18090/samriddhi.v14i01.18 fatcat:qdlkfyuwv5f3vbsxx3b3hizxq4