Filters








564 Hits in 2.9 sec

Building a Situation-Based Language Knowledge Base [chapter]

Qiang Zhou, Zushun Chen
2005 Lecture Notes in Computer Science  
a large scale Chinese treebank.  ...  We developed a supporting platform to make full use of the abundant information contained in current Chinese semantic lexicons so as to gradually summarize the complete situation descriptions, organize  ...  and a large scale Chinese treebank.  ... 
doi:10.1007/978-3-540-30586-6_36 fatcat:r4nhntq3dranfnkilevpfuqucu

Build a Large-Scale Syntactically Annotated Chinese Corpus [chapter]

Qiang Zhou
2003 Lecture Notes in Computer Science  
This paper reports on our research to build a large-scale Tsinghua Chinese Treebank (TCT). We propose a two-stage approach to reduce manual proofreading labors as much as possible.  ...  treebank.  ...  Acknowledgements This work was supported by the Chinese National Science Foundation (Grant No. 69903007, 60173008), National 973 Foundation (Grant No. 1998030507) and National 863 plan (Grant No. 2001AA114040  ... 
doi:10.1007/978-3-540-39398-6_15 fatcat:sxzftfckbza4tc2je2qthkes2a

Dependency Parsing for Weibo: An Efficient Probabilistic Logic Programming Approach

William Yang Wang, Lingpeng Kong, Kathryn Mazaitis, William W Cohen
2014 Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)  
We present a new GFL/FUDG-annotated Chinese treebank with more than 18K tokens from Sina Weibo (the Chinese equivalent of Twitter).  ...  We formulate the dependency parsing problem as many small and parallelizable arc prediction tasks: for each task, we use a programmable probabilistic firstorder logic to infer the dependency arc of a token  ...  The authors are solely responsible for the contents of the paper, and the opinions expressed in this publication do not reflect those of the funding agencies.  ... 
doi:10.3115/v1/d14-1122 dblp:conf/emnlp/WangKMC14 fatcat:zqvwuwp6irgcfnyoq6x4u4bqfi

Tibetan Information Extraction Technology Integrated with Event Feature and Semantic Role Labelling

Fucheng Wan, Jianhua Xia, Yansong Wang
2017 MATEC Web of Conferences  
For Tibetan language information extraction, through experiments analyzed, syntactic analysis model which is integrated with information of semantics, as well as the evaluation of program can be used in  ...  [17] used the co-occurrence frequency of named entities to calculate the relationship between Tibetan entities, but in the open field, there are a fewer researches on Tibetan event extraction at home  ...  [14] Researched automatic parsing model by combining the rules of sentence analysis and MEM based on the research of Li Xiang, Liu Qun, et al. [15] . Jin Ming, et al.  ... 
doi:10.1051/matecconf/201712801016 fatcat:ltedbe52onbyzflrb5ypmb54dq

Aligning Chinese-English Parallel Parse Trees: Is it Feasible?

Dun Deng, Nianwen Xue
2014 Proceedings of LAW VIII - The 8th Linguistic Annotation Workshop  
This work is done in the context of an annotation project where we construct a parallel treebank by doing word and phrase alignments simultaneously.  ...  We investigate the feasibility of aligning Chinese and English parse trees by examining cases of incompatibility between Chinese-English parallel parse trees.  ...  Any opinions, findings, conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect those of the sponsor or any of the people mentioned above.  ... 
doi:10.3115/v1/w14-4904 dblp:conf/acllaw/DengX14 fatcat:klqpyev2pzbazg74ettue72cc4

82 Treebanks, 34 Models: Universal Dependency Parsing with Multi-Treebank Models [article]

Aaron Smith, Bernd Bohnet, Miryam de Lhoneux, Joakim Nivre, Yan Shao, Sara Stymne
2018 arXiv   pre-print
Instead of training a single parsing model for each treebank, we trained models with multiple treebanks for one language or closely related languages, greatly reducing the number of models.  ...  Our system is a pipeline consisting of three components: the first performs joint word and sentence segmentation; the second predicts part-of- speech tags and morphological features; the third predicts  ...  learn representations of tokens in context, and are trained together with a multilayer perceptron that predicts transitions and arc labels based on a few BiLSTM vectors. allow the construction of non-projective  ... 
arXiv:1809.02237v1 fatcat:jhjl2eqnejaixjj5jbzfro3u3u

82 Treebanks, 34 Models: Universal Dependency Parsing with Multi-Treebank Models

Aaron Smith, Bernd Bohnet, Miryam de Lhoneux, Joakim Nivre, Yan Shao, Sara Stymne
2018 Proceedings of the  
Instead of training a single parsing model for each treebank, we trained models with multiple treebanks for one language or closely related languages, greatly reducing the number of models.  ...  Our system is a pipeline consisting of three components: the first performs joint word and sentence segmentation; the second predicts part-ofspeech tags and morphological features; the third predicts dependency  ...  Our parser is extended with a SWAP transition to allow the construction of nonprojective dependency trees (Nivre, 2009) .  ... 
doi:10.18653/v1/k18-2011 dblp:conf/conll/SmithBLNSS18 fatcat:dsxah2jmcvbcta6fzhtvkwnyta

Parsing-based Chinese word segmentation integrating morphological and syntactic information

Xihong Wu, Meng Zhang, Xiaojun Lin
2011 2011 7th International Conference on Natural Language Processing and Knowledge Engineering  
Chinese morphology intensively investigates the constructions and usages of Chinese words, which is helpful to Chinese word segmentation.  ...  Experiments on Penn Chinese Treebank(CTB) 5.0 show that the proposed model obtains competitive performances as the CRFs-based model.  ...  A. The Construction of Chinese Morpheme Corpus It is difficult to construct a Chinese morpheme corpus owing to its large scales and ambiguities.  ... 
doi:10.1109/nlpke.2011.6138178 dblp:conf/nlpke/WuZL11 fatcat:md6tnidy7rdupcm3h62tahsrfa

A Universal Part-of-Speech Tagset [article]

Slav Petrov, Dipanjan Das, Ryan McDonald
2011 arXiv   pre-print
As a result, when combined with the original treebank data, this universal tagset and mapping produce a dataset consisting of common parts-of-speech for 22 different languages.  ...  In addition to the tagset, we develop a mapping from 25 different treebank tagsets to this universal set.  ...  Acknowledgements We would like to thank Joakim Nivre for allowing us to use a preliminary tagset mapping used in the work of . The second author was supported in part by NSF grant IIS-0844507.  ... 
arXiv:1104.2086v1 fatcat:krcodolyxnai5n467h3vgssd44

Dependency Chart Parsing Algorithm Based on Ternary-Span Combination

Meixun JIN, Yong-Hun LEE, Jong-Hyeok LEE
2013 IEICE transactions on information and systems  
This eventually leads to state-of-the-art performance of dependency parsing on the Chinese data of the CoNLL shared task. key words: dependency chart parsing, span-based parsing, factor-based parsing,  ...  This paper presents a new span-based dependency chart parsing algorithm that models the relations between the left and right dependents of a head.  ...  the Chinese treebank of CoNLL'07 (abbreviated as Chinese data).  ... 
doi:10.1587/transinf.e96.d.93 fatcat:fur4nuvsw5aqzj2indccggnhji

Toward Better Chinese Word Segmentation for SMT via Bilingual Constraints

Xiaodong Zeng, Lidia S. Chao, Derek F. Wong, Isabel Trancoso, Liang Tian
2014 Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)  
We propose dealing with the induced word boundaries as soft constraints to bias the continuous learning of a supervised CRFs model, trained by the treebank data (labeled), on the bilingual data (unlabeled  ...  This study investigates on building a better Chinese word segmentation model for statistical machine translation.  ...  The work of Isabel Trancoso was supported by national funds through FCT-Fundação para a Ciêcia e a Tecnologia, under project PEst-OE/EEI/LA0021/2013.  ... 
doi:10.3115/v1/p14-1128 dblp:conf/acl/ZengCWTT14 fatcat:7u2oznufyvghzig3js7ft657qm

CPTAM: Constituency Parse Tree Aggregation Method [article]

Adithya Kulkarni, Nasim Sabetpour, Alexey Markin, Oliver Eulenstein, Qi Li
2022 arXiv   pre-print
Diverse Natural Language Processing tasks employ constituency parsing to understand the syntactic structure of a sentence according to a phrase structure grammar.  ...  Specifically, we propose the first truth discovery solution for tree structures by minimizing the weighted sum of Robinson-Foulds (RF) distances, a classic symmetric distance metric between two trees.  ...  Any opinions, findings, and conclusions, or recommendations expressed in this document are those of the author(s) and should not be interpreted as the views of any U.S. Government. The U.S.  ... 
arXiv:2201.07905v1 fatcat:tnzepapil5gcfmndk2aqsggh3q

Dependency Parsing Using Global Features [chapter]

Tetsuji Nakagawa
2010 Text, Speech and Language Technology  
In an extrinsic evaluation setup, ELMoLex ranked 7 th for Event Extraction, Negation Resolution tasks and 11 th for Opinion Analysis task by F1 score.  ...  In this paper, we present the details of the neural dependency parser and the neural tagger submitted by our team 'ParisNLP' to the CoNLL 2018 Shared Task on parsing from raw text to Universal Dependencies  ...  2014) to whom we owe a lot.  ... 
doi:10.1007/978-90-481-9352-3_5 fatcat:wxfl4um2efe5tigujw427ka3eu

A syntactic component for Vietnamese language processing

Phuong Le-Hong, Azim Roussanaly, Thi-Minh-Huyen Nguyen
2015 Journal of Language Modelling  
We first discuss the construction of a lexicalized tree-adjoining grammar using an automatic extraction approach.  ...  We then present the construction and evaluation of a deep syntactic parser based on the extracted grammar. This is a complete system that produces syntactic structures for Vietnamese sentences.  ...  is also isolating; Chinese is classified in a branch of Sino-Tibetan language family.  ... 
doi:10.15398/jlm.v3i1.89 fatcat:4ql43c6wivhpzjiqjbdiao5b6m

Neural Probabilistic Model for Non-projective MST Parsing [article]

Xuezhe Ma, Eduard Hovy
2017 arXiv   pre-print
On top of the neural network, we introduce a probabilistic structured layer, defining a conditional log-linear model over non-projective trees.  ...  Our parser achieves state-of-the-art parsing performance on nine datasets.  ...  Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of DARPA.  ... 
arXiv:1701.00874v4 fatcat:udsk5df7tjhd5bd6hc4tb352hy
« Previous Showing results 1 — 15 out of 564 results