A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Improvement of training set structure in fusion data cleaning using Time-Domain Global Similarity method
2017
Journal of Instrumentation
Traditional data cleaning identifies dirty data by classifying original data sequences, which is a class-imbalanced problem since the proportion of incorrect data is much less than the proportion of correct ones for most diagnostic systems in Magnetic Confinement Fusion (MCF) devices. When using machine learning algorithms to classify diagnostic data based on class-imbalanced training set, most classifiers are biased towards the major class and show very poor classification rates on the minor
doi:10.1088/1748-0221/12/10/c10004
fatcat:m6rsqtxb2fhvbgbmtqxoo2dzwq