Cosema: Content-based Semantic Annotator release_bwc4gxycebfaxie52yxsrwct6m

by Angela Fogarolli

Entity Metadata (schema)

abstracts[] {'sha1': '731e562d4ece7856c9bafd1ad6e290777f64602d', 'content': 'In this paper, we present a library for creating automatic annotations for entities and concepts inside any textual content. The tool is based on DBpedia. In particular, the annotations are generated using the DBpedia link structure as a source of knowledge for Word Sense Disambiguation. DBpedia is used as a reference to obtain information on lexicographic relationships. By using such information in combination with statistical information extraction techniques, it is possible to deduce concepts related to the terms extracted from a corpus. Moreover, by combining statistical information extraction with named entity recognition and the use of the OKKAM ENS infrastructure, it is also possible to obtain unique annotations for entities in the content. The advantage of this approach, in addition of improving information retrieval and categorization capabilities, consists in the fact that the generate concept and entity annotations can be referred to with unique identifiers around the Web. For this reason different description for the same entity or concept can be semantically aggregated from the Web.', 'mimetype': 'text/plain', 'lang': None}
container
container_id
contribs[] {'index': 0, 'creator_id': None, 'creator': None, 'raw_name': 'Angela Fogarolli', 'given_name': None, 'surname': None, 'role': 'author', 'raw_affiliation': None, 'extra': None}
ext_ids {'doi': None, 'wikidata_qid': None, 'isbn13': None, 'pmid': None, 'pmcid': None, 'core': None, 'arxiv': None, 'jstor': None, 'ark': None, 'mag': None, 'doaj': None, 'dblp': None, 'oai': None, 'hdl': None}
files[] {'state': 'active', 'ident': 'dt3bqc7x6nhprhk3i2rhljem64', 'revision': '6ee560c0-90a2-4e34-8401-cf22da83a0c9', 'redirect': None, 'extra': None, 'edit_extra': None, 'size': 67505, 'md5': '1918c60f7aa53a114a29306187c057fb', 'sha1': '37be274c2fd1ec202466002dca1fda58369099e5', 'sha256': '9e71255dd453198e3a2373d2ae0ffbf536cc4edeb51cbb6885a184240ae5c613', 'urls': [{'url': 'https://biss.pensoft.net/lib/ajax_srv/generate_pdf.php?document_id=37223&readonly_preview=1&file_id=0', 'rel': 'web'}, {'url': 'https://web.archive.org/web/20200216123955/https://biss.pensoft.net/lib/ajax_srv/generate_pdf.php?document_id=37223&readonly_preview=1&file_id=0', 'rel': 'webarchive'}], 'mimetype': 'application/pdf', 'content_scope': None, 'release_ids': ['bwc4gxycebfaxie52yxsrwct6m'], 'releases': None}
{'state': 'active', 'ident': 'zijafbvnw5ddbdw4or6tfsbtji', 'revision': '903c4354-a702-4c7e-83c6-be95a6f4855b', 'redirect': None, 'extra': None, 'edit_extra': None, 'size': 66540, 'md5': 'ceaff7341a8871d900fd6e6055f5e09d', 'sha1': 'ccdd7dc6baddf7180f19c566ea3ee011b3361a87', 'sha256': '0d9bc2c6cf531c26b8f23d666a59b88b52e0f9dc5c1e940f5993ad8cbbcdc159', 'urls': [{'url': 'https://ris.utwente.nl/ws/files/121371765/BISS_article_37223.pdf', 'rel': 'web'}, {'url': 'https://web.archive.org/web/20200307000640/https://ris.utwente.nl/ws/files/121371765/BISS_article_37223.pdf', 'rel': 'webarchive'}], 'mimetype': 'application/pdf', 'content_scope': None, 'release_ids': ['bwc4gxycebfaxie52yxsrwct6m'], 'releases': None}
{'state': 'active', 'ident': 'jle5ccx3tndj5kvgewtkayzgdq', 'revision': '0b5c7440-95a2-4ee5-b4ac-0092c107cb3f', 'redirect': None, 'extra': None, 'edit_extra': None, 'size': 152417, 'md5': '0b59549b6df91e668445586ad4697ecb', 'sha1': '7fee66e58fc928d48d88a3c920aad834d951397e', 'sha256': '955973ae76920760f9bcedbe4e3c9534bd82fb2bf24fafcc315c80c2bdbd72bd', 'urls': [{'url': 'https://web.archive.org/web/20180420154422/http://www.thinkmind.org/download.php?articleid=semapro_2010_5_50_50111', 'rel': 'webarchive'}, {'url': 'http://www.thinkmind.org/download.php?articleid=semapro_2010_5_50_50111', 'rel': 'web'}], 'mimetype': 'application/pdf', 'content_scope': None, 'release_ids': ['bwc4gxycebfaxie52yxsrwct6m'], 'releases': None}
filesets []
issue
language
license_slug
number
original_title
pages
publisher
refs[] {'index': 0, 'target_release_id': None, 'extra': {'authors': ['V Uren', 'P Cimiano', 'J Iria', 'S Handschuh', 'M Vargasvera', 'E Motta', 'F Ciravegna'], 'issue': None, 'volume': None}, 'key': 'b0', 'year': 2006, 'container_name': None, 'title': 'Semantic annotation for knowledge management: Requirements and a survey of the state of the art', 'locator': None}
{'index': 1, 'target_release_id': None, 'extra': {'authors': ['T Berners-Lee', 'J Hendler', 'O Lassila']}, 'key': 'b1', 'year': 2001, 'container_name': None, 'title': 'The Semantic Web', 'locator': None}
{'index': 2, 'target_release_id': None, 'extra': {'authors': ['D Fensel']}, 'key': 'b2', 'year': 2003, 'container_name': None, 'title': 'Ontologies: A Silver Bullet for Knowledge Management and Electronic Commerce', 'locator': None}
{'index': 3, 'target_release_id': None, 'extra': {'authors': ['P Hayes', 'Semantics'], 'url': 'http://www.w3.org/TR/rdf-mt/'}, 'key': 'b3', 'year': 2004, 'container_name': None, 'title': None, 'locator': None}
{'index': 4, 'target_release_id': None, 'extra': None, 'key': 'b4', 'year': 2004, 'container_name': None, 'title': 'RDF Vocabulary Description Language 1.0: RDF Schema, W3C', 'locator': None}
{'index': 5, 'target_release_id': None, 'extra': {'authors': ['P Patel-Schneider', 'P Hayes', 'I Horrocks'], 'url': 'http://www.w3.org/TR/owl-semantics/'}, 'key': 'b5', 'year': 2003, 'container_name': None, 'title': 'Web Ontology Language (OWL) Abstract Syntax and Semantics', 'locator': None}
{'index': 6, 'target_release_id': None, 'extra': {'authors': ['C Bizer', 'J Lehmann', 'G Kobilarov', 'S Auer', 'C Becker', 'R Cyganiak', 'S Hellmann'], 'url': 'http://jens-lehmann.org/files/2009dbpediajws.pdf'}, 'key': 'b6', 'year': 2009, 'container_name': None, 'title': 'Dbpedia : a crystallization point for the web of data', 'locator': None}
{'index': 7, 'target_release_id': None, 'extra': {'authors': ['M Dowman', 'V Tablan', 'H Cunningham', 'B Popov']}, 'key': 'b7', 'year': 2005, 'container_name': None, 'title': 'Web-assisted annotation, semantic indexing and search of television and radio news', 'locator': None}
{'index': 8, 'target_release_id': None, 'extra': {'authors': ['E Camon', 'M Magrane', 'D Barrell', 'D Binns', 'W Fleischmann', 'P Kersey', 'N Mulder', 'T Oinn', 'J Maslen', 'A Cox', 'R Apweiler'], 'issue': None, 'url': 'http://dx.doi.org/10.1101/gr.461403', 'volume': '13'}, 'key': 'b8', 'year': 2003, 'container_name': None, 'title': 'The gene ontology annotation (goa) project: implementation of go in swiss-prot, trembl, and interpro', 'locator': None}
{'index': 9, 'target_release_id': None, 'extra': {'authors': ['V Hennie Brugman', 'L Hollink']}, 'key': 'b9', 'year': 2008, 'container_name': None, 'title': 'A common multimedia annotation framework for cross linking cultural heritage digital collections', 'locator': None}
{'index': 10, 'target_release_id': None, 'extra': {'authors': ['S Handschuh', 'S Staab']}, 'key': 'b10', 'year': 2002, 'container_name': None, 'title': 'Authoring and annotation of web pages in cream', 'locator': None}
{'index': 11, 'target_release_id': None, 'extra': {'authors': ['S Cucerzan'], 'url': 'http://acl.ldc.upenn.edu/D/D07/D07-1074.pdf'}, 'key': 'b11', 'year': 2007, 'container_name': None, 'title': 'Large-scale named entity disambiguation based on wikipedia data', 'locator': None}
{'index': 12, 'target_release_id': None, 'extra': {'authors': ['A Fogarolli']}, 'key': 'b12', 'year': 2009, 'container_name': None, 'title': 'Word sense disambiguation based on wikipedia link structure', 'locator': None}
{'index': 13, 'target_release_id': None, 'extra': {'authors': ['A Krizhanovsky'], 'url': 'http://arxiv.org/abs/cs/0606097v1'}, 'key': 'b13', 'year': None, 'container_name': None, 'title': 'Synonym search in wikipedia: Synarcher," arxiv.org, search for synomyms in Wikipedia using hyperlinks and categories', 'locator': None}
{'index': 14, 'target_release_id': None, 'extra': {'authors': ['S Chernov', 'T Iofciu', 'W Nejdl', 'X Zhou']}, 'key': 'b14', 'year': 2006, 'container_name': None, 'title': 'Extracting semantic relationships between wikipedia categories', 'locator': None}
{'index': 15, 'target_release_id': None, 'extra': {'authors': ['P Schonhofen']}, 'key': 'b15', 'year': 2006, 'container_name': None, 'title': 'Identifying document topics using the wikipedia category network', 'locator': None}
{'index': 16, 'target_release_id': None, 'extra': {'authors': ['A.-M Vercoustre', 'J Thom', 'J Pehcevski']}, 'key': 'b16', 'year': 2008, 'container_name': None, 'title': 'Entity ranking in wikipedia', 'locator': None}
{'index': 17, 'target_release_id': None, 'extra': {'authors': ['Y Watanabe', 'M Asahara', 'Y Matsumoto'], 'url': 'http://www.aclweb.org/anthology/D/D07/D07-1068'}, 'key': 'b17', 'year': 2007, 'container_name': None, 'title': 'A graph-based approach to named entity categorization in Wikipedia using conditional random fields', 'locator': None}
{'index': 18, 'target_release_id': None, 'extra': {'authors': ['Z Syed', 'T Finin', 'A Joshi']}, 'key': 'b18', 'year': 2008, 'container_name': None, 'title': 'Wikipedia as an ontology for describing documents', 'locator': None}
{'index': 19, 'target_release_id': None, 'extra': {'authors': ['S Adafre', 'M De Rijke']}, 'key': 'b19', 'year': 2005, 'container_name': None, 'title': 'Discovering missing links in wikipedia', 'locator': None}
{'index': 20, 'target_release_id': None, 'extra': {'authors': ['J Voss'], 'url': 'http://eprints.rclis.org/archive/00003610/'}, 'key': 'b20', 'year': 2005, 'container_name': None, 'title': 'Measuring wikipedia', 'locator': None}
{'index': 21, 'target_release_id': None, 'extra': {'authors': ['J Kamps', 'M Koolen'], 'url': 'http://dblp.uni-trier.de/db/conf/ecir/ecir2008.html#KampsK08', 'volume': '4956'}, 'key': 'b21', 'year': 2008, 'container_name': None, 'title': 'The importance of link evidence in wikipedia', 'locator': None}
{'index': 22, 'target_release_id': None, 'extra': {'authors': ['Y Ollivier', 'P Senellart']}, 'key': 'b22', 'year': 2007, 'container_name': None, 'title': 'Finding related pages using Green measures: An illustration with Wikipedia', 'locator': None}
{'index': 23, 'target_release_id': None, 'extra': {'authors': ['R Mihalcea'], 'url': 'http://www.cs.unt.edu/∼rada/papers/mihalcea.naacl07.pdf'}, 'key': 'b23', 'year': 2007, 'container_name': None, 'title': 'Using wikipedia for automatic word sense disambiguation', 'locator': None}
{'index': 24, 'target_release_id': None, 'extra': {'authors': ['R Mihalcea', 'A Csomai'], 'url': 'http://84.11.13.37/Volumes/CIKM07/docs/p233.pdf'}, 'key': 'b24', 'year': 2007, 'container_name': None, 'title': 'Wikify!: linking documents to encyclopedic knowledge', 'locator': None}
{'index': 25, 'target_release_id': None, 'extra': {'authors': ['P Bouquet', 'H Stoermer', 'C Niederee', 'A Mana'], 'volume': None}, 'key': 'b25', 'year': 2008, 'container_name': None, 'title': 'Entity name system: The back-bone of an open and scalable web of data', 'locator': None}
release_date
release_stage
release_type article-journal
release_year
subtitle
title Cosema: Content-based Semantic Annotator
version
volume
webcaptures []
withdrawn_date
withdrawn_status
withdrawn_year
work_id x5tomcpksrgl3j72uwvcb6loe4
As JSON via API

Extra Metadata (raw JSON)

longtail_oa True