Filters








23,114 Hits in 9.3 sec

BENGAL: An Automatic Benchmark Generator for Entity Recognition and Linking

Axel-Cyrille Ngonga Ngomo, Michael Röder, Diego Moussallem, Ricardo Usbeck, René Speck
2018 Proceedings of the 11th International Conference on Natural Language Generation  
The manual creation of gold standards for named entity recognition and entity linking is time-and resource-intensive.  ...  We hence present BENGAL, a novel automatic generation of such gold standards as a complement to manually created benchmarks.  ...  The authors gratefully acknowledge financial support from the German Federal Ministry of Education and Research within Eurostars, a joint programme of EU-REKA and the European Community under the project  ... 
doi:10.18653/v1/w18-6541 dblp:conf/inlg/NgomoRMUS18 fatcat:rrbpmqvbhvgt7kqn2ixzjlrwbe

BENGAL: An Automatic Benchmark Generator for Entity Recognition and Linking [article]

Axel-Cyrille Ngonga Ngomo, Michael Röder, Diego Moussallem, Ricardo Usbeck, René Speck
2018 arXiv   pre-print
The manual creation of gold standards for named entity recognition and entity linking is time- and resource-intensive.  ...  We hence present BENGAL, a novel automatic generation of such gold standards as a complement to manually created benchmarks.  ...  The authors gratefully acknowledge financial support from the German Federal Ministry of Education and Research within Eurostars, a joint programme of EU-REKA and the European Community under the project  ... 
arXiv:1710.08691v3 fatcat:bqytsstipven5acziwoi2yjpre

Open Knowledge Extraction Challenge 2017 [chapter]

René Speck, Michael Röder, Sergio Oramas, Luis Espinosa-Anke, Axel-Cyrille Ngonga Ngomo
2017 Communications in Computer and Information Science  
The challenge makes use of small gold standard datasets that consist of manually curated documents and large silver standard datasets that consist of automatically generated synthetic documents.  ...  This year, the challenge goes in the third round and consists of three tasks which include named entity identification, typing and disambiguation by linking to a knowledge base depending on the task.  ...  Also this work was partially funded by the Spanish Ministry of Economy and Competitiveness under the Maria de Maeztu Units of Excellence Programme (MDM-2015-0502).  ... 
doi:10.1007/978-3-319-69146-6_4 fatcat:35uqvuzc2ndjpk2wc6bqtlg4my

All that Glitters Is Not Gold – Rule-Based Curation of Reference Datasets for Named Entity Recognition and Entity Linking [chapter]

Kunal Jha, Michael Röder, Axel-Cyrille Ngonga Ngomo
2017 Lecture Notes in Computer Science  
In this work, we analyze existing gold standards and derive a set of rules for annotating documents for named entity recognition and entity linking.  ...  First, they do not share a common set of rules pertaining to what is to be marked and linked as an entity.  ...  Acknowledgments This work has been supported by the H2020 project HOBBIT (GA no. 688227) as well as the the EuroStars projects DIESEL (project no. 01QE1512C) and QAMEL (project no. 01QE1549C).  ... 
doi:10.1007/978-3-319-58068-5_19 fatcat:3jeakohlfvawva6dqdq74iy7vq

pioNER: Datasets and Baselines for Armenian Named Entity Recognition [article]

Tsolak Ghukasyan, Garnik Davtyan, Karen Avetisyan, Ivan Andrianov
2018 arXiv   pre-print
We present a 163000-token named entity corpus automatically generated and annotated from Wikipedia, and another 53400-token corpus of news sentences with manual annotation of people, organization and location  ...  The corpora were used to train and evaluate several popular named entity recognition models.  ...  Aside from the lack of training data, we also address the absence of a benchmark dataset of Armenian texts for named entity recognition.  ... 
arXiv:1810.08699v1 fatcat:p6fpka5gibcxxesbjha6ee235y

Datasets, GATE Evaluation Framework for Benchmarking Wikipedia-Based NER Systems

Milan Dojchinovski, Tomás Kliegr
2013 International Semantic Web Conference  
Entities recognized in the original datasets were enriched with new annotations -a link to Wikipedia and the most specific type from the DBpedia Ontology.  ...  We present a wikifier evaluation framework consisting of software support and two datasets (News and Tweets), which were derived from datasets previously published at WEKEX 2011 and MSM Challenge 2013.  ...  This research was supported by the European Union's 7th Framework Programme via the LinkedTV project (FP7-287911) and CTU in Prague grant (SGS13/100/OHK3/1T/18).  ... 
dblp:conf/semweb/DojchinovskiK13 fatcat:3625pvlgrvbanne5e6yhbgsumm

Slot Filling for Extracting Reskilling and Upskilling Options from the Web [chapter]

Albert Weichselbraun, Roger Waldvogel, Andreas Fraefel, Alexander van Schie, Philipp Kuntschik
2022 Lecture Notes in Computer Science  
We also introduce a German gold standard that comprises 169 documents and over 3800 annotations for benchmarking the necessary content extraction, entity linking, entity recognition and slot filling tasks  ...  Afterwards, entity recognition and entity linking methods draw upon a domain ontology to locate relevant entities such as skills, occupations and topics.  ...  Integrating background knowledge from a proprietary ontology allows the application of graph-based entity linking methods for the identification of known entities, which are complemented by entity recognition  ... 
doi:10.1007/978-3-031-08473-7_25 fatcat:tcvofd3gajbhtfzh747plh7heu

Automatic Entity Recognition and Typing from Massive Text Corpora

Xiang Ren, Ahmed El-Kishky, Chi Wang, Jiawei Han
2015 Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '15  
These methods can automatically identify token spans as entity mentions in documents and label their types (e.g., people, product, food) in a scalable way.  ...  To unlock the value of these unstructured text data from various domains, it is of great importance to gain an understanding of entities and their relationships.  ...  His research focuses on knowledge acquisition from text data and mining linked data. He is the recipient of C. L. and Jane W. Acknowledgments Research was sponsored in part by the U.S.  ... 
doi:10.1145/2783258.2789988 pmid:26705508 pmcid:PMC4688010 dblp:conf/kdd/RenEWH15 fatcat:2z3gzinvrbawpkgziwpsxcl6p4

CMEE-IL: Code Mix Entity Extraction in Indian Languages from Social Media Text @ FIRE 2016 - An Overview

Pattabhi R. K. Rao, Sobha Lalitha Devi
2016 Forum for Information Retrieval Evaluation  
However there is no benchmark data available where all these systems could be compared on same data for respective languages in this new generation user generated text.  ...  Entity recognition and extraction has gained increased attention in Indian research community.  ...  text normalization and named entity recognition for English.  ... 
dblp:conf/fire/RaoD16 fatcat:owk3dfmomvc7bm2mqyesrrzftq

What did you Mention? A Large Scale Mention Detection Benchmark for Spoken and Written Text [article]

Yosi Mass, Lili Kotlerman, Shachar Mirkin, Elad Venezian, Gera Witzling, Noam Slonim
2018 arXiv   pre-print
We describe a large, high-quality benchmark for the evaluation of Mention Detection tools.  ...  The benchmark contains annotations of both named entities as well as other types of entities, annotated on different types of text, ranging from clean text taken from Wikipedia, to noisy spoken data.  ...  NEEL (Named Entity rEcognition and Linking) 4 and ERD (Entity Recognition and Disambiguation (Carmel et al., 2014) ).  ... 
arXiv:1801.07507v3 fatcat:npchvjpamndwnn2zhn3iufonpa

Fine-grained Entity Recognition with Reduced False Negatives and Large Type Coverage [article]

Abhishek Abhishek, Sanya Bathla Taneja, Garima Malik, Ashish Anand, Amit Awekar
2019 arXiv   pre-print
Fine-grained Entity Recognition (FgER) is the task of detecting and classifying entity mentions to a large set of types spanning diverse domains such as biomedical, finance and sports.  ...  Our extensive empirical experimentation warrants the quality of the generated datasets. Along with this, we also provide a manually annotated dataset for benchmarking FgER systems.  ...  Aryabartta Sahu at Department of Computer Science and Engineering, IIT Guwahati. Abhishek is supported by MHRD fellowship, Government of India.  ... 
arXiv:1904.13178v1 fatcat:vehug6uoefcshen6iguqvqj6ju

An Unsupervised Language-Independent Entity Disambiguation Method and its Evaluation on the English and Persian Languages [article]

Majid Asgari-Bidhendi, Behrooz Janfada, Amir Havangi, Sayyed Ali Hossayni, Behrouz Minaei-Bidgoli
2021 arXiv   pre-print
Entity linking mainly consists of two tasks: recognition and disambiguation of named entities. Most studies address these two tasks separately or focus only on one of them.  ...  Entity Linking is one of the essential tasks of information extraction and natural language understanding.  ...  Acknowledgments The authors certify that they have NO affiliations with or involvement in any organization or entity with any financial interest, or non-financial interest in the subject matter or materials  ... 
arXiv:2102.00395v1 fatcat:rsdlr777ljb3zknyoyqjnyjbcy

PoetryLab: An Open Source Toolkit for the Analysis of Spanish Poetry Corpora

Elena González-Blanco, Salvador Ros Muñoz, Javier De la Rosa, Álvaro Pérez Pozo, Laura Hernández, Mirella De Sisto, Aitor Díaz, Omar Khalil, José Luis Rodríguez, Leire Leguina
2020 Zenodo  
The effort crystallized in the PoetryLab, an extensible open-source toolkit for syllabification, scansion, enjambment detection, rhyme detection, and historical named entity recognition for Spanish poetry  ...  To tackle the issue in the realm of the Spanish poetic tradition, our approach consisted in designing a set of tools that any scholar could use to automatically enrich the analysis of Spanish poetry.  ...  Carry out the detection of literary phenomena relied on linguistic characteristics Normalization of historical spelling and variants generation • Tagging of names automatically detected and proposed names  ... 
doi:10.5281/zenodo.4299614 fatcat:odndzxejx5cljhk64nhnq2wceu

On the Importance of Drill-Down Analysis for Assessing Gold Standards and Named Entity Linking Performance

Fabian Odoni, Philipp Kuntschik, Adrian M.P. Braşoveanu, Albert Weichselbraun
2018 Procedia Computer Science  
Rigorous evaluations and analyses of evaluation results are key towards improving Named Entity Linking systems.  ...  We present three uses cases in order to demonstrate the usefulness of Orbis for both research and production systems: (i) improving Named Entity Linking tools; (ii) detecting gold standard errors; and  ...  Acknowledgements The research presented in this paper has been conducted as part of the DISCOVER Project (www.htwchur.ch/discover), funded by the Swiss Commission for Technology and Innovation (CTI).  ... 
doi:10.1016/j.procs.2018.09.004 fatcat:lfzfdn3u7jbknljxanfyimnvnu

GERBIL – Benchmarking Named Entity Recognition and Linking consistently

Michael Röder, Ricardo Usbeck, Axel-Cyrille Ngonga Ngomo, Ruben Verborgh
2018 Semantic Web Journal  
Our approach to this problem opens a way to address the deprecation of URIs of existing gold standards for named entity recognition and entity linking, a feature which is currently not supported by the  ...  In the domains of named entity recognition and entity linking, the large number of systems and their orthogonal evaluation w.r.t. measures and datasets has led to an unclear landscape regarding the abilities  ...  This work was supported by the German Federal Ministry of Education and Research under the project number 03WKCJ4D and the Eurostars projects DIESEL (E!9367) and QAMEL (E!  ... 
doi:10.3233/sw-170286 fatcat:2z3uaekzovhhte6vemt62ctr24
« Previous Showing results 1 — 15 out of 23,114 results