NELL2RDF: Reading the Web, and Publishing it as Linked Data [article]

José M. Giménez-García, Maísa Duarte, Antoine Zimmermann, Christophe Gravier, Estevam R. Hruschke Jr., Pierre Maret
2018 arXiv   pre-print
NELL is a system that continuously reads the Web to extract knowledge in form of entities and relations between them. It has been running since January 2010 and extracted over 50,000,000 candidate statements. NELL's generated data comprises all the candidate statements together with detailed information about how it was generated. This information includes how each component of the system contributed to the extraction of the statement, as well as when that happened and how confident the system
more » ... s in the veracity of the statement. However, the data is only available in an ad hoc CSV format that makes it difficult to exploit out of the context of NELL. In order to make it more usable for other communities, we adopt Linked Data principles to publish a more standardized, self-describing dataset with rich provenance metadata.
arXiv:1804.05639v1 fatcat:bmchwzpa7nbdnlcx73x6kj4gpy