Clustering and classification to characterize daily electricity demand
시간단위 전력사용량 시계열 패턴의 군집 및 분류분석

Dain Park, Sanghoo Yoon
2017 Journal of the Korean Data and Information Science Society  
The purpose of this study is to identify the pattern of daily electricity demand through clustering and classification. The hourly data was collected by KPS (Korea Power Exchange) between 2008 and 2012. The time trend was eliminated for conducting the pattern of daily electricity demand because electricity demand data is times series data. We have considered k-means clustering, Gaussian mixture model clustering, and functional clustering in order to find the optimal clustering method. The
more » ... fication analysis was conducted to understand the relationship between external factors, day of the week, holiday, and weather. Data was divided into training data and test data. Training data consisted of external factors and clustered number between 2008 and 2011. Test data was daily data of external factors in 2012. Decision tree, random forest, Support vector machine, and Naive Bayes were used. As a result, Gaussian model based clustering and random forest showed the best prediction performance when the number of cluster was 8.
doi:10.7465/jkdi.2017.28.2.395 fatcat:rp4rthdrerdkdhpm7oscjspwqi