A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
An unsupervised classification process for large datasets using web reasoning
2016
Proceedings of the International Workshop on Semantic Big Data - SBD '16
Determining valuable data among large volumes of data is one of the main challenges in Big Data. We aim to extract knowledge from these sources using a Hierarchical Multi-Label Classification process called Semantic HMC. This process automatically learns a label hierarchy and classifies items from very large data sources. Five steps compose the Semantic HMC process: Indexation, Vectorization, Hierarchization, Resolution and Realization. The first three steps construct automatically the label
doi:10.1145/2928294.2928301
dblp:conf/sigmod/PeixotoHCBS16
fatcat:f6ina2i74rfmrpzk76z45wmrnu