Filters








26 Hits in 3.7 sec

A Probabilistic Annotation Model for Crowdsourcing Coreference

Silviu Paun, Jon Chamberlain, Udo Kruschwitz, Juntao Yu, Massimo Poesio
2018 Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing  
This paper addresses one crucial hurdle on the way to make this possible, by introducing a new model of annotation for aggregating crowdsourced anaphoric annotations.  ...  Crowdsourcing has been proposed as an alternative; but this approach has not been widely used for coreference.  ...  Datasets The largest coreference dataset with crowdsourced annotations is the Phrase Detectives corpus.  ... 
doi:10.18653/v1/d18-1218 dblp:conf/emnlp/PaunCKYP18 fatcat:6mhyg2gytvas7ehrvckg4lq2vm

A Mention-Ranking Model for Abstract Anaphora Resolution

Ana Marasovic, Leo Born, Juri Opitz, Anette Frank
2017 Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing  
This corpus presents a greater challenge due to a mixture of nominal and pronominal anaphors and a greater range of confounders.  ...  We also report first benchmark results on an abstract anaphora subset of the ARRAU corpus.  ...  Event coreference is restricted to a subclass of events, and usually focuses on coreference between verb (phrase) and noun (phrase) mentions of similar abstractness levels (e.g. purchase -acquire) with  ... 
doi:10.18653/v1/d17-1021 dblp:conf/emnlp/MarasovicBOF17 fatcat:tsi2bejulvcjbhx2yua7zjemvu

Optimising crowdsourcing efficiency: Amplifying human computation with validation

Jon Chamberlain, Udo Kruschwitz, Massimo Poesio
2018 it - Information Technology  
Crowdsourcing has revolutionised the way tasks can be completed but the process is frequently inefficient, costing practitioners time and money.  ...  A validation model is described, simulated and tested on real data from an online crowdsourcing game to collect data about human language.  ...  The dataset (Phrase Detectives Corpus 1.0) was used to determine what the collective quality of the players were, as well as the quality of individual decisions.  ... 
doi:10.1515/itit-2017-0020 fatcat:smo72cwf7veaxjn5bivmuvz43u

A Mention-Ranking Model for Abstract Anaphora Resolution [article]

Ana Marasović, Leo Born, Juri Opitz, Anette Frank
2017 arXiv   pre-print
This corpus presents a greater challenge due to a mixture of nominal and pronominal anaphors and a greater range of confounders.  ...  We also report first benchmark results on an abstract anaphora subset of the ARRAU corpus.  ...  Event coreference is restricted to a subclass of events, and usually focuses on coreference between verb (phrase) and noun (phrase) mentions of similar abstractness levels (e.g. purchase -acquire) with  ... 
arXiv:1706.02256v2 fatcat:zbu4zwaspzfrrhhfln4eythbjm

Predicting the Relative Difficulty of Single Sentences With and Without Surrounding Context

Elliot Schumacher, Maxine Eskenazi, Gwen Frishkoff, Kevyn Collins-Thompson
2016 Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing  
Not surprisingly, sentences that contain these contextual dependencies take more effort to comprehend: an anaphoric noun phrase, or NP (e.g., "he"), automatically triggers the need to resolve reference  ...  Data Set The study sentences were drawn from a corpus combining the American National Corpus (Reppen et al., 2005) , the New York Times Corpus (Sandhaus, 2008) , and the North American News Text Corpus  ... 
doi:10.18653/v1/d16-1192 dblp:conf/emnlp/SchumacherEFC16 fatcat:626n243jondc3im4adw3hbsewm

Predicting the Relative Difficulty of Single Sentences With and Without Surrounding Context [article]

Elliot Schumacher, Maxine Eskenazi, Gwen Frishkoff, Kevyn Collins-Thompson
2016 arXiv   pre-print
Not surprisingly, sentences that contain these contextual dependencies take more effort to comprehend: an anaphoric noun phrase, or NP (e.g., "he"), automatically triggers the need to resolve reference  ...  Data Set The study sentences were drawn from a corpus combining the American National Corpus (Reppen et al., 2005) , the New York Times Corpus (Sandhaus, 2008) , and the North American News Text Corpus  ... 
arXiv:1606.08425v3 fatcat:nnikdcf3ibafznyql7c3bjjvlm

DiscoFuse: A Large-Scale Dataset for Discourse-Based Sentence Fusion [article]

Mor Geva, Eric Malmi, Idan Szpektor, Jonathan Berant
2019 arXiv   pre-print
For entity resolution, both anaphoric pronouns ("she", "they", "his") and anaphoric nominals ("the team", "the man") are considered, based on the output of a coreference system.  ...  V A set of POS tags for verbal phrases. Table 16 .  ...  Phenomenon Input Detection Generation Discourse connective (A,B)  ... 
arXiv:1902.10526v3 fatcat:sfdjsmddlvddveky2p2y7a7olq

Learning from Disagreement: A Survey

Alexandra N. Uma, Tommaso Fornaciari, Dirk Hovy, Silviu Paun, Barbara Plank, Massimo Poesio
2021 The Journal of Artificial Intelligence Research  
Our analysis of some documents from the Phrase Detectives corpus showed similar results.  ...  The dataset we used, and that we call pdis here, is extracted from the Phrase Detectives 2 corpus for coreference resolution (Poesio et al., 2019) , 3 which used a simplified, binary definition of the  ... 
doi:10.1613/jair.1.12752 fatcat:l2dvmhtqtzelnbdcxdgra4ejwu

Comparing Bayesian Models of Annotation

Silviu Paun, Bob Carpenter, Jon Chamberlain, Dirk Hovy, Udo Kruschwitz, Massimo Poesio
2018 Transactions of the Association for Computational Linguistics  
Lately, model-based analysis of corpus annotations have proven better at all three tasks. But there has been relatively little work comparing them on the same datasets.  ...  The analysis of crowdsourced annotations in natural language processing is concerned with identifying (1) gold standard labels, (2) annotator accuracies and biases, and (3) item difficulties and error  ...  In addition, we include the Phrase Detectives 1.0 (PD) corpus (Chamberlain et al., 2016) , which differs in a number of key ways from the Snow et al. (2008) datasets: It has a much larger number of  ... 
doi:10.1162/tacl_a_00040 fatcat:ekkl7vqqanar7jsvpm5xtb6gcq

Advances in statistical script learning [article]

Karl Pichotta
2017
Finally, motivated by this result, we investigate incorporating features derived from these models into a baseline noun coreference resolution system.  ...  Coreference resolution systems, semantic role labeling, and even syntactic parsing systems could, in principle, benefit from event co-occurrence models.  ...  For a particular dependency path connecting two noun phrases, they calculate the percentage of occurrences of the path in a large newswire corpus which connect two likely-coreferent noun phrases (e.g.  ... 
doi:10.15781/t27w67p0b fatcat:eykrdybuafgeva6mm54pmxdex4

A Framework for Learning Assessment through Multimodal Analysis of Reading Behaviour and Language Comprehension [article]

Santosh Kumar Barnwal
2021 arXiv   pre-print
Anaphor overlap, all sentences .1: Coh-Metrix version 3.0 indices Verb phrase density, incidence 4.  ...  The model covers more than 3 million words and phrases. GloVe embedding is also 300-dimensional vectors. The word vectors are trained on a very large corpus of 840 billion tokens corpus.  ... 
arXiv:2110.11938v1 fatcat:dw6mmccebneazlse2gxvy7ytdu

Représentations et transmission des connaissances à la lumière de l'innovation numérique. Actes du colloque Jeunes Chercheurs 2019 PRAXILING, 7-8 Novembre 2019

Julien Magnier, Anne-Laure Biales, Pierre Bellet, Olivier Le Deuff, Chrysta Pélissier, Didier Ozil, Yigong Guo, Aurélie Doelrasad, José Samaniego, François Mangenot, Arnaud Richard, Mehdi Mirzapour (+13 others)
2019
Anaphore, cataphore et mémoire discursive. Pratiques, 57(1) :15-43. [55] Soon, W. M., Ng, H. T., and Lim, C. Y. (2001). A machine learning approach to coreference resolution of noun phrases.  ...  Supervised noun phrase coreference research : The first fifteen years. In ACL. [46] Ng, V. and Gardent, C. (2002). Improving machine learning approaches to coreference resolution.  ... 
doi:10.18463/toubol.001 fatcat:yutdrmnhynentjlvjcktbbkgwm

Regional Linguistic Data Initiative (ReLDI)

Tanja Samardžić, Nikola Ljubešić, Maja Miličević
2015
Other papers address a broad range of topics, including word-sense disambiguation, corpus analysis, text and author modeling, and linguistic resources.  ...  Acknowledgments We are indebted to Hristo Tanev from the Joint Research Centre of the European Commission, who provided the corpus of news articles in Polish.  ...  ., when an anaphoric pronoun is far away from its preceding coreferent mention).  ... 
doi:10.5167/uzh-127250 fatcat:tuiafgzehzhtfhtf53sz3wafqa

Republic; 2. the Second BSNLP Workshop, held in conjunction with IIS 2009: Intelligent Information Systems

Bulgaria Hissar, Jakub Piskorski, Jan Šnajder, Roman Yangarber, Tomas Krilavičius, Natalia Loukachevitch, Preslav Nakov, Maciej Piasecki, Jakub Piskorski, Kiril Simov, Jan Šnajder, Josef Steinberger (+6 others)
2015 Hissar, Bulgaria 1. the First BSNLP Workshop, held in conjunction with ACL 2007 Conference in   unpublished
Other papers address a broad range of topics, including word-sense disambiguation, corpus analysis, text and author modeling, and linguistic resources.  ...  Acknowledgments We are indebted to Hristo Tanev from the Joint Research Centre of the European Commission, who provided the corpus of news articles in Polish.  ...  ., when an anaphoric pronoun is far away from its preceding coreferent mention).  ... 
fatcat:oayf2lfw7bg3tmawukofn5xdpq

Strategies to Address Data Sparseness in Implicit Semantic Role Labeling

Parvin Sadat Feizabadi
2019
For this purpose, they selected a data set annotated with both semantic roles and coreference chains and extracted coreference chains with anaphoric pronouns which filled a semantic role of a predicate  ...  The precision improves dramatically for the WSD condition, from 0.17 to 1.0.  ... 
doi:10.11588/heidok.00025842 fatcat:6j27mkll7zh2nlyvmd42336b4m
« Previous Showing results 1 — 15 out of 26 results