274 Hits in 3.7 sec

Experiments on bridging across languages and genres

Yulia Grishina
2016 Proceedings of the Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2016)  
Furthermore, for the complete exploration of extended coreference relations, we exploit an existing near-identity scheme to augment our annotations with near-identity links, and we report on the results  ...  After discussing our annotation guidelines, we describe annotation experiments on the German part of our parallel coreference corpus and show that our interannotator agreement results are reliable, considering  ...  This work was supported by a scholarship from the Friedrich Wingert foundation.  ... 
doi:10.18653/v1/w16-0702 dblp:conf/naacl/Grishina16 fatcat:hcz5htjpmnfupoasf27ot7sxwy

Interesting Linguistic Features in Coreference Annotation of an Inflectional Language [chapter]

Maciej Ogrodniczuk, Katarzyna Głowińska, Mateusz Kopeć, Agata Savary, Magdalena Zawisławska
2013 Lecture Notes in Computer Science  
Starting from the notion of a mention, its borders and potential vs. actual referentiality, we discuss the problem of complete and near-identity, zero subjects and dominant expressions.  ...  This paper reports on linguistic features and decisions that we find vital in the process of annotation and resolution of coreference for highly inflectional languages.  ...  Near-Identity Near-identity is a novel coreference relation defined in [15] .  ... 
doi:10.1007/978-3-642-41491-6_10 fatcat:impxf75twvg5teixr4owmgvnvu

Polish Coreference Corpus [chapter]

Maciej Ogrodniczuk, Katarzyna Głowińska, Mateusz Kopeć, Agata Savary, Magdalena Zawisławska
2016 Lecture Notes in Computer Science  
properties related to mentions, clusters and near-identity links.  ...  Finally, we report on our negative experiences concerning the annotation of the near-identity relation. In the conclusion we put forward some guidelines for the future research in the area.  ...  Acknowledgements The work reported here was cofunded by the "Computerbased methods for coreference resolution in Polish texts", a project financed by the Polish National Science Centre (contract number  ... 
doi:10.1007/978-3-319-43808-5_17 fatcat:3pxfb3dovfbxjc665vvhblj2pu

Coreference Annotation Schema for an Inflectional Language [chapter]

Maciej Ogrodniczuk, Magdalena Zawisławska, Katarzyna Głowińska, Agata Savary
2013 Lecture Notes in Computer Science  
Creating a coreference corpus for an inflectional and freeword-order language is a challenging task due to specific syntactic features largely ignored by existing annotation guidelines, such as the absence  ...  of building the first, to our best knowledge, corpus of general coreference of Polish.  ...  (see, e.g., [18] ), which require stable and consistent annotation model for all languages involved.  ... 
doi:10.1007/978-3-642-37247-6_32 fatcat:w66m4swncjaurcftvxaxv2qcee

Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR [chapter]

Adèle Désoyer, Frédéric Landragin, Isabelle Tellier, Anaïs Lefeuvre, Jean-Yves Antoine, Marco Dinarelli
2018 Lecture Notes in Computer Science  
One specific aspect of the system is that it has been trained on data that come exclusively from transcribed speech, namely ANCOR (ANaphora and Coreference in ORal corpus), the first large-scale French  ...  We present ANCOR, a French corpus annotated with coreference relations which is freely available and large enough to serve the needs of data-driven approaches  ...  This work was supported by grant ANR-15-CE38-0008 ("DEMOCRAT" project) from the French National Research Agency (ANR), and by APR Centre-Val-de-Loire region ("ANCOR" project).  ... 
doi:10.1007/978-3-319-75477-2_36 fatcat:jkg6tce6lzb6tkwvw7myqstxke

RuCoCo: a new Russian corpus with coreference annotation [article]

Vladimir Dobrovolskii, Mariia Michurina, Alexandra Ivoylova
2022 arXiv   pre-print
We present a new corpus with coreference annotation, Russian Coreference Corpus (RuCoCo).  ...  RuCoCo contains news texts in Russian, part of which were annotated from scratch, and for the rest the machine-generated annotations were refined by human annotators.  ...  Acknowledgements We are grateful to our annotation team from General Linguistics Department of RSUH for their hard work, attentive approach to the project and immense help in discussions.  ... 
arXiv:2206.04925v1 fatcat:4w7f3y5ml5gypgbk4hytvlxsoe

Cross-document coreference: An approach to capturing coreference without context

Kristin Wright-Bettner, Martha Palmer, Guergana Savova, Piet de Groen, Timothy Miller
2019 Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019)  
The THYME colon cancer corpus This annotation effort merged and expanded on document-level annotations created by two prior projectsa temporal relations project 1 (Styler et al., 2014) , and a coreference  ...  In this paper, we discuss a cross-document coreference annotation schema that we developed to further automatic extraction of timelines in the clinical domain.  ...  Acknowledgments The work was supported by funding R01LM010090 from the National Library Of Medicine.  ... 
doi:10.18653/v1/d19-6201 dblp:conf/acl-louhi/Wright-BettnerP19 fatcat:rtgnbpce5zdpxebrmyrx47773y

Cross-document Event Identity via Dense Annotation [article]

Adithya Pratapa, Zhengzhong Liu, Kimihiro Hasegawa, Linwei Li, Yukari Yamakawa, Shikun Zhang, Teruko Mitamura
2021 arXiv   pre-print
Such annotation setup reduces the pool of event mentions and prevents one from considering the possibility of quasi-identity relations.  ...  In this paper, we study the identity of textual events from different documents.  ...  We also thank the Mechanical Turk workers for their help in our annotation process.  ... 
arXiv:2109.06417v1 fatcat:6cwedtrjxrg7dkgzbozeteunhy

Bridging Relations in Polish: Adaptation of Existing Typologies

Maciej Ogrodniczuk, Magdalena Zawisławska
2016 Proceedings of the Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2016)  
The classification is confronted with existing annotation of other-than-identity relations in a portion of Polish Coreference Corpus.  ...  Findings from the process are intended to facilitate development of annotation guidelines of a new referencerelated project.  ...  of the coreference, near-identity and other semantic relations.  ... 
doi:10.18653/v1/w16-0703 dblp:conf/naacl/OgrodniczukZ16 fatcat:x3gvnwvlgbfwhb5um5ellyhhgi

Vagueness and Referential Ambiguity in a Large-Scale Annotated Corpus

Yannick Versley
2008 Research on Language and Computation  
In this paper, we argue that difficulties in the definition of coreference itself contribute to lower inter-annotator agreement in certain cases.  ...  Data from a large referentially annotated corpus serves to corroborate this point, using a quantitative investigation to assess which effects or problems are likely to be the most prominent.  ...  The referential annotation of TüBa-D/Z was carried out by Karin Naumann and Vera Möller (principal annotators) as well as Kathrin Eichler and Simone Müller (parallel annotation on selected parts).  ... 
doi:10.1007/s11168-008-9059-1 fatcat:6x7gh2mmprhqrggr7rpaxafhhi

Identity, non-identity, and near-identity: Addressing the complexity of coreference

Marta Recasens, Eduard Hovy, M. Antònia Martí
2011 Lingua  
It argues that coreference is best handled when identity is treated as a continuum, ranging from full identity to non-identity, with room for near-identity relations to explain currently problematic cases  ...  treatment of cases in previous coreference annotation efforts.  ...  By problematic we mean those cases that involved disagreements during the coreference annotation process of the AnCora corpus or cases encountered in the other sources that could be argued either way-coreferent  ... 
doi:10.1016/j.lingua.2011.02.004 fatcat:qvqd27vp3rgs5noe5s6rilkbvm

Coreferential Relations in Basque: The Annotation Process

Klara Ceberio, Itziar Aduriz, Arantza Díaz de Ilarraza, Ines Garcia-Azkoaga
2018 Journal of Psycholinguistic Research  
A part of the corpus was tagged by two annotators who marked up the same text independently, and by another annotator that acted as judge, solving problems in case of disagreement.  ...  Due to the fact that Basque is not an Indo-European language, it differs considerably in grammar from the languages spoken surrounding areas.  ...  Even if the declension mark is not identical we annotate it as identical since it does not give any new semantic information.  ... 
doi:10.1007/s10936-018-9559-6 pmid:29399705 fatcat:sq44lk27dvaj3ax6jqygis67y4

Switch-reference and its role in referential choice in Mbyá Guaraní narratives

Guillaume Thomas, Gregory Antono, Laurestine Bradford, Angelika Kiss, Darragh Winkelman
2021 Corpus Linguistics and Linguistic Theory  
We propose several rules, in order to come to a unified coreference annotating approach. Identity Coreference relations of the type Identity must adhere to strict rules.  ...  Such an approach should reduce potential interrater disagreement. Table 1 presents the coreference relation types I propose to use.  ...  Any NP that could receive coreference information should receive it. (2) NEARFIRST: choose the neares antecedent.  ... 
doi:10.1515/cllt-2020-0028 fatcat:7awawv5a3nd6fbhfvk4gi6zcyi

Can we Fix the Scope for Coreference? Problems and Solutions for Benchmarks beyond OntoNotes [article]

Amir Zeldes
2021 arXiv   pre-print
; 2. cross-linguistic generalizability; and 3. a separation of identity and scope, which can resolve old problems involving temporal and modal domain consistency.  ...  Current work on automatic coreference resolution has focused on the OntoNotes benchmark dataset, due to both its size and consistency.  ...  Annotating near-identity from corefer- ence disagreements. In Proceedings of LREC 2012, pages 165–172, Istanbul, Turkey, 2012. Tanya Reinhart. Pragmatics and linguistics.  ... 
arXiv:2112.09742v1 fatcat:eo6koz34xrhlzolkubyojjodwq

Analysis and Reference Resolution of Bridge Anaphora across Different Text Genres [chapter]

Iris Hendrickx, Orphée De Clercq, Veronique Hoste
2011 Lecture Notes in Computer Science  
After briefly presenting the annotation guidelines and inter-annotation agreement results, we conduct an in-depth manual analysis of the di↵erent types of bridge relations found in our data sets.  ...  This inspired us to investigate to what extent a standard coreference resolution system for Dutch is capable of resolving bridge relations across di↵erent text genres and study the e↵ect of adding semantic  ...  Table 3 . 3 Manual analysis according to the typology for Near Identity.  ... 
doi:10.1007/978-3-642-25917-3_1 fatcat:hmpc36nka5fzbhzwqxz7c6xupi
« Previous Showing results 1 — 15 out of 274 results