Filters








1,000 Hits in 5.5 sec

Finding and Typing New Named Entities in Tibetan from Chinese-Tibetan Parallel Corpora

Lirong Qiu
2014 International Journal of Multimedia and Ubiquitous Engineering  
In this paper, we describe a method for Chinese-Tibetan bilingual named entity recognition using easily obtainable bilingual dictionary and parallel political corpora.  ...  We present two distinct steps for NER, one step identifying entity candidates in Tibetan, and the second step typing the entity into the semantic class.  ...  Acknowledgements Our work is supported by the National nature science foundation of China (No. 61103161), the Program for New Century Excellent Talents in University (NCET-12-0579) and the "985" special  ... 
doi:10.14257/ijmue.2014.9.9.16 fatcat:45ogb7tk4zcxzhwdkhp34ot3u4

Parallel sequence tagging for concept recognition [article]

Lenz Furrer
2020 arXiv   pre-print
Named Entity Recognition (NER) and Normalisation (NEN) are core components of any text-mining system for biomedical texts.  ...  Availability and Implementation: Source code freely available for download at https://github.com/OntoGene/craft-st. Supplementary data are available at arXiv online.  ...  Acknowledgements We would like to thank the organisers of the CRAFT shared task 2019 for the well-organised competition with high-quality annotations and prompt support.  ... 
arXiv:2003.07424v2 fatcat:n3pp3zfefvakxg7qxmdco45zqy

A descriptive algorithm for a wine tasting lexicon corpus

Margarita Goded Rambaud
2009 Scire. Representación y organización del conocimiento  
Acknowledgements I am indebted to Alfredo Poves Luelmo, from the Universidad Complutense de Madrid, for his help comparing corpora and with the calculation of the figures in table II.  ...  Notes (1) Collins Cobuild Dictionary. Editor's note  ...  in the hierarchy may have an extended meaning covering some or all sensory modalities below in such a hierarchy.  ... 
doi:10.54886/scire.v15i2.3711 fatcat:lwojjefdpvewfb5z5ec23fdxdy

Animacy Detection with Voting Models

Joshua L. Moore, Christopher J. C. Burges, Erin Renshaw, Wen-tau Yih
2013 Conference on Empirical Methods in Natural Language Processing  
Animacy detection is a problem whose solution has been shown to be beneficial for a number of syntactic and semantic tasks.  ...  We present a state-of-the-art system for this task which uses a number of simple classifiers with heterogeneous data sources in a voting scheme.  ...  Acknowledgments We wish to thank Andrzej Pastusiak for his help with the labeling tool.  ... 
dblp:conf/emnlp/MooreBRY13 fatcat:jveplqen4fc2blsomcvv73sveq

D3.1 TV programme annotation model

Raphael Troncy, Ismail Harrando, Benoit Huet, Jean Carrive, Steffen Lalande, Michael Stormbom, Tiina Lindh-Knuutila, Lauri Saarikoski, Kim Viljanen
2019 Zenodo  
This deliverable describes also a number of tools that perform named entity recognition and disambiguation on both automatic transcription and true subtitles of TV programs.  ...  Furthermore, this deliverable describes two tools that enable to convert the legacy metadata coming from both INA and Yle into RDF, the W3C standard for representing knowledge graph on the web, following  ...  the European Union's Horizon 2020 research and innovation programme via the project MeMAD (GA 780069).  ... 
doi:10.5281/zenodo.4796668 fatcat:4yfr45rvojfzrha2lcgkmkpxtu

University of Pennsylvania

Breck Baldwin, Mike Collins, Jason Eisner, Adwait Ratnaparkhi, Joseph Rosenzweig, Anoop Sarkar
1995 Proceedings of the 6th conference on Message understanding - MUC6 '95  
For the first few months, tools were built and the system was extended at weekly 'hack sessions.'  ...  The table of first names overlaps with place names and time words. For example, Canada and Tuesday are women's names. In such cases, the evidence from the table is discarded.  ... 
doi:10.3115/1072399.1072416 dblp:conf/muc/BaldwinCERRS95 fatcat:uqj4x5my7zgrpez2arnzccbbsq

A descriptive algorithm for a wine tasting lexicon corpus

Margarita Goded Rambaud
2007 Ibersid - Journal of Information and Documentation Systems  
He demonstrated that a verb that has its basic meaning in a sensory modality to the left in the hierarchy, may have an extended meaning covering some or all sensory modalities below in such a hierarchy  ...  As a result, this differentiation is invalidated since his definition for entities includes now both objects and relationships.  ... 
doi:10.54886/ibersid.v1i.3324 fatcat:f5nctuo2vzdq3fgjmqbaalogga

Automatically generated NE tagged corpora for English and Hungarian

Eszter Simon, Dávid Márk Nemeskey
2012 Named Entity Workshop  
Since our method is mainly languageindependent, we used it to generate corpora for English and Hungarian. The corpora are freely available.  ...  Supervised Named Entity Recognizers require large amounts of annotated text. Since manual annotation is a highly costly procedure, reducing the annotation cost is essential.  ...  The authors are grateful to Attila Zséder for his work on Wikipedia parsing and to András Kornai for his insightful comments.  ... 
dblp:conf/aclnews/SimonN12 fatcat:ykx4xmy6hrb77ns2uuseuyetu4

An Automatically Generated Annotated Corpus for Albanian Named Entity Recognition

Klesti Hoxha, Artur Baxhaku
2018 Cybernetics and Information Technologies  
Named Entity Recognition (NER) is an important task in many NLP pipelines. It has become especially important for knowledge bases that power many of the nowadays information retrieval systems.  ...  In order to cope with the high demand for annotated training corpora for supervised NER systems, automatic generation approaches have been proposed.  ...  This extends the gazetteer with a list of aliases for many of the included entities. Finally, the gazetteer is extracted.  ... 
doi:10.2478/cait-2018-0009 fatcat:atzksu2vmrgvlgpljlbpgzlnmu

New York University

Ralph Grishman, John Sterling
1993 Proceedings of the 5th conference on Message understanding - MUC5 '93  
I n mixed-case text, we used capitalization as the primary indication of the beginning of a name ; in monocase text, we employed BBN's part-of-speech tagger and looked for proper noun tags .  ...  ,"un") and a word defined in the dictionary . If all of these procedures fail, we assign a default definition .  ...  SPONSORSHI P The development of the entire PROTEUS system has been sponsored primarily b y the Advanced Research Projects Agency as part of the Strategic Computing Program , under Contract N00014-85-K-0163 and  ... 
doi:10.3115/1072017.1072036 dblp:conf/muc/GrishmanS93 fatcat:pjyvbhgxargm3fugiixrv3eqxu

CRL/Brandeis

Jim Cowie, Louise Guthrie, Wang Jin, William Ogden, James Pustejovsky, Rong Wang, Takahiro Wakao, Scott Waterman, Yorick Wilks
1993 Proceedings of a workshop on held at Fredericksburg, Virginia September 19-23, 1993 -  
To extend the system's coverage of English and Japanese some of the content of these lexical structures was derived from machine readable dictionaries.  ...  The CRL/Brandeis group have implemented statistical methods for focusing on the relevant parts of texts, programs which recognize and mark names of people, places and organizations and also dates.  ...  The actual identification of appropriate entities was much higher (60%) and for entity name recognition (54%).  ... 
doi:10.3115/1119149.1119173 dblp:conf/tipster/CowieGJOPWWWW93 fatcat:cedhymxglffvxcsypirgtsyg34

A modular framework for biomedical concept recognition

David Campos, Sérgio Matos, José Oliveira
2013 BMC Bioinformatics  
) and on entity normalization (F1-measure for overlap name matching and correct identifier included in the returned list of identifiers: species 88%, cell 71%, cellular components 72%, gene and proteins  ...  for overlap matching: species 95%, cell 92%, cellular components 83%, gene and proteins 76%, chemicals 65%, biological processes and molecular functions 63%, disorders 85%, and anatomical entities 82%  ...  Chemicals: a dictionary of chemical names was built using the ChEBI database of molecular entities [46] ; Species: the dictionary provided by LINNAEUS [29] was extended by adding the entries from the  ... 
doi:10.1186/1471-2105-14-281 pmid:24063607 pmcid:PMC3849280 fatcat:xmjcjbm74jbudpf3gcqjiib46m

Text analytics for life science using the Unstructured Information Management Architecture

R. Mack, S. Mukherjea, A. Soffer, N. Uramoto, E. Brown, A. Coden, J. Cooper, A. Inokuchi, B. Iyer, Y. Mass, H. Matsuzawa, L. V. Subramaniam
2004 IBM Systems Journal  
Acknowledgments BioTeKS is in large part a systems integration effort that builds on technologies and expertise developed  ...  Dictionaries can be customized for specific categories of entities. POS (part of speech) tagger Assigns parts of speech to tokens, using context.  ...  Uses dictionary lookup for drug names, quantities, and dosage qualifiers and FST rules for identifying phrasal combinations of these elements.  ... 
doi:10.1147/sj.433.0490 fatcat:altfinouzbdy7mrcqx2kzdzecy

Learning to Recognize Phenotype Candidates in the Auto-Immune Literature Using SVM Re-Ranking

Nigel Collier, Mai-vu Tran, Hoang-quynh Le, Quang-Thuy Ha, Anika Oellrich, Dietrich Rebholz-Schuhmann, Luis M. Rocha
2013 PLoS ONE  
Using partial matching the best micro-averaged F-score for phenotypes and five other entity classes was 79.9%.  ...  This paper presents novel techniques for identifying potential complex phenotype mentions by exploiting a hybrid model based on machine learning, rules and dictionary matching.  ...  Open-source tools for NER include BANNER [21] , ABNER [20] , LINGPIPE [31] , the GENIA tagger [32] and NERSuite, a named entity recognition toolkit based on CRFSuite [33] .  ... 
doi:10.1371/journal.pone.0072965 pmid:24155869 pmcid:PMC3796529 fatcat:jcy2fr7yrjgwngtfqvr4vhczdq

AIDA-light: High-Throughput Named-Entity Disambiguation

Dat Ba Nguyen, Johannes Hoffart, Martin Theobald, Gerhard Weikum
2014 The Web Conference  
On the other hand, methods that rely on rich context features and joint inference for mapping names onto entities pay the price of being much slower.  ...  State-of-the-art methods for Named Entity Disambiguation (NED) face major tradeoffs regarding efficiency/scalability vs. accuracy.  ...  be many candidate entities for common mentions (e.g., common first names, last names, etc.).  ... 
dblp:conf/www/NguyenHTW14 fatcat:tqgfmzmxhzhmtjgvrysfxbfehi
« Previous Showing results 1 — 15 out of 1,000 results