Cosema: Content-based Semantic Annotator release_bwc4gxycebfaxie52yxsrwct6m

by Angela Fogarolli

Released as a article-journal .

Abstract

In this paper, we present a library for creating automatic annotations for entities and concepts inside any textual content. The tool is based on DBpedia. In particular, the annotations are generated using the DBpedia link structure as a source of knowledge for Word Sense Disambiguation. DBpedia is used as a reference to obtain information on lexicographic relationships. By using such information in combination with statistical information extraction techniques, it is possible to deduce concepts related to the terms extracted from a corpus. Moreover, by combining statistical information extraction with named entity recognition and the use of the OKKAM ENS infrastructure, it is also possible to obtain unique annotations for entities in the content. The advantage of this approach, in addition of improving information retrieval and categorization capabilities, consists in the fact that the generate concept and entity annotations can be referred to with unique identifiers around the Web. For this reason different description for the same entity or concept can be semantically aggregated from the Web.
In text/plain format

Archived Files and Locations

application/pdf   67.5 kB
file_dt3bqc7x6nhprhk3i2rhljem64
biss.pensoft.net (web)
web.archive.org (webarchive)
application/pdf   66.5 kB
file_zijafbvnw5ddbdw4or6tfsbtji
ris.utwente.nl (web)
web.archive.org (webarchive)
application/pdf   152.4 kB
file_jle5ccx3tndj5kvgewtkayzgdq
web.archive.org (webarchive)
www.thinkmind.org (web)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   unknown
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: b01f9c8d-3d5e-4074-ad58-2c24ecb123a6
API URL: JSON