236 Hits in 6.5 sec

Fast structural query with application to chinese treebank sentence retrieval

Chia-Hsin Huang, Tyng-Ruey Chuang, Hahn-Ming Lee
2004 Proceedings of the 2004 ACM symposium on Document engineering - DocEng '04  
As searching the Chinese Treebank corpora is structural in nature and often deals with structural similarities, conventional XML query languages, such as XPath and XQuery, are inflexible and inefficient  ...  For example, the Chinese Treebank corpus developed at the Institute of Information Science, Academia Sinica, Taiwan, is a semantically annotated corpus that has been used to help parse and study Chinese  ...  ACKNOWLEDGMENTS We would like to thank Dr. Keh-Jian Chen for his suggestions and insightful comments on this work. He also provided us with the CKIP Chinese Treebank corpus for our experiments.  ... 
doi:10.1145/1030397.1030400 dblp:conf/doceng/HuangCL04 fatcat:jju55zzngzdtzj4ipo4gttyff4

Has Computational Linguistics Become More Applied? [chapter]

Kenneth Church
2009 Lecture Notes in Computer Science  
The model gives high accuracy in translating the Queries from English to Arabic solving the translation and transliteration ambiguities and with orthographic query expansion; it gives high degree of accuracy  ...  We evaluate our method comparing the answers given by a traditional information retrieval systemvector space model adjusted for article retrieval, instead of document retrieval-and the answers to 21 questions  ...  The paper concentrates on deriving non-obvious information about clause structure of complex sentences from the Prague Dependency Treebank.  ... 
doi:10.1007/978-3-642-00382-0_1 fatcat:oddvfzds4nfwjam2ccqeaxe2y4

Ranking Algorithms for Word Ordering in Surface Realization

Alessandro Mazzei, Mattia Cerrato, Roberto Esposito, Valerio Basile
2021 Information  
In this paper, we propose to apply general learning-to-rank algorithms to the task of word ordering in the broader context of surface realization.  ...  experiments show promising results, in particular highlighting the performance of the pairwise approach, paving the way for a more transparent surface realization from arbitrary tree- and graph-like structures  ...  s adaptation of SVM to information retrieval [19] ). Lastly, listwise algorithms are able to consider the whole list of documents in the query during the computation of the cost function [24, 25] .  ... 
doi:10.3390/info12080337 fatcat:63yzlggmgfdzfili7t7q6tpody

Asian language processing: current state-of-the-art

Chu-Ren Huang, Takenobu Tokunaga, Sophia Yat Mei Lee
2007 Language Resources and Evaluation  
The challenge is made more formidable by the fact that as a whole, Asian languages range from the language with most speakers in the world (Mandarin Chinese, close to 900 million native speakers) to the  ...  Major Asian languages such as Mandarin Chinese, Hindi, Japanese, Korean, and Thai have benefited from several years of intense language processing research, and fast-developing languages (e.g., Filipino  ...  We would also like to thank all the reviewers, whose prompt action helped us through all the submitted papers with helpful comments.  ... 
doi:10.1007/s10579-007-9041-9 fatcat:ryrtqspk5nggdgzp7o75knch3m

Robust text processing in automated information retrieval

Tomek Strzalkowski
1994 Proceedings of the fourth conference on Applied natural language processing -  
The backbone of our system is a traditional retrieval engine which builds inverted index files from pre-processed documents, and then searches and ranks the documents in response to user queries.  ...  This paper outlines a prototype text retrieval system which uses relatively advanced natural language processing techniques in order to enhance the effectiveness of statistical document retrieval.  ...  ACKNOWLEDGEMENTS We would like to thank Donna Harman of NIST for making her PRISE system available to us.  ... 
doi:10.3115/974358.974396 dblp:conf/anlp/Strzalkowski94 fatcat:3irhkmnwuzgqpif7ld2fizgpeu

A Survey of the Usages of Deep Learning in Natural Language Processing [article]

Daniel W. Otter, Julian R. Medina, Jugal K. Kalita
2019 arXiv   pre-print
Analyzed research areas include several core linguistic processing issues in addition to a number of applications of computational linguistics.  ...  A discussion of the current state of the art is then provided along with recommendations for future research in the field.  ...  and Chinese datasets on the English Penn Treebank.  ... 
arXiv:1807.10854v3 fatcat:ajyv5o743naixeo5c5y6p6tg3e

Message from the general chair

Benjamin C. Lee
2015 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)  
Learning-based Multi-Sieve Co-reference Resolution with Knowledge Lev Ratinov and Dan Roth Saturday 11:00am-11:30am -202 A (ICC) We explore the interplay of knowledge and structure in co-reference resolution  ...  The model is not restricted to nominal  ...  ) and ranking the retrieved sentence pairs according to the relevance between the query and the translation equivalents.  ... 
doi:10.1109/ispass.2015.7095776 dblp:conf/ispass/Lee15 fatcat:ehbed6nl6barfgs6pzwcvwxria

Improving Machine Translation through Linked Data

Ankit Srivastava, Georg Rehm, Felix Sasaki
2017 Prague Bulletin of Mathematical Linguistics  
We conclude with an analysis of best practices for multilingual linked data sets in order to optimise their benefit to multilingual and cross-lingual applications.  ...  With the ever increasing availability of linked multilingual lexical resources, there is a renewed interest in extending Natural Language Processing (NLP) applications so that they can make use of the  ...  Acknowledgements We would like to thank the anonymous reviewers for their insightful and helpful comments.  ... 
doi:10.1515/pralin-2017-0033 fatcat:qs6lpz5wwbcelayobupsfpxmse

Computational linguistics and grammar engineering [article]

Emily M. Bender, Guy Emerson
2021 Zenodo  
computational studies with HPSG; computational resources developed within HPSG; how those resources are deployed, for both practical applications and linguistic research; and finally, a sampling of linguistic  ...  We discuss the relevance of HPSG for computational linguistics, and the relevance of computational linguistics for HPSG, including: the theoretical and computational infrastructure required to carry out  ...  Acknowledgments We would like to thank Stephan Oepen for helpful comments on an early draft of this chapter, Stefan Müller for detailed comments as volume editor and Elizabeth Pankratz for careful copy  ... 
doi:10.5281/zenodo.5599867 fatcat:qfrfqb5fnngdtbbqhhm3dkmmua

Few-Shot Relation Extraction on Ancient Chinese Documents

Bo Li, Jiyu Wei, Yang Liu, Yuze Chen, Xi Fang, Bin Jiang
2021 Applied Sciences  
The paired attention network enhances and extracts relations between support and query instances. Experimental results show that our model achieved promising performance with scarce corpus.  ...  In this work, we aim to develop a relation extractor for ancient Chinese documents to automatically extract the relations by using unstructured data.  ...  Acknowledgments: Thanks to the East Asia Digital Humanities Lab which annotates the TinyACD-RC dataset and supports this research. Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/app112412060 fatcat:w7xyy3m3vffttldcvt7vn7ft3a

A Systematic Literature Review on Text Generation Using Deep Neural Network Models

Noureen Fatima, Ali Shariq Imran, Zenun Kastrati, Sher Muhammad Daudpota, Abdullah Soomro, Sarang Shaikh
2022 IEEE Access  
The application of text generation in various fields has resulted in a lot of interest from the scientific community in this area.  ...  To the best of our knowledge, there is a lack of extensive review and up-to-date body of knowledge of text generation deep learning models.  ...  Table 3 shows the keywords used to perform queries.  ... 
doi:10.1109/access.2022.3174108 fatcat:662z5imliba2zosgrqdhm55dcy

BIOSMILE web search: a web application for annotating biomedical entities and relations

Hong-Jie Dai, Chi-Hsin Huang, Ryan T. K. Lin, Richard Tzong-Han Tsai, Wen-Lian Hsu
2008 Nucleic Acids Research  
After receiving keyword query input, BWS retrieves matching PubMed abstracts and lists them along with snippets by order of relevancy to protein-protein interaction.  ...  To date, BWS has been field tested by over 30 biologists and questionnaires have shown that subjects are highly satisfied with its capabilities and usability. BWS is accessible free of charge at  ...  The search engine accepts users' queries and retrieves matching PubMed abstracts. Each query is wrapped as a remote web service call and sent to the NCBI Entrez Utilities Web Service (12) .  ... 
doi:10.1093/nar/gkn319 pmid:18515840 pmcid:PMC2447743 fatcat:vg6lwdgj6zeslg6pveflfjz4ou

An Analysis of Prepositional-Phrase Attachment Disambiguation

Mohammed H. Hamdan, Imtiaz H. Khan
2018 International Journal of Computational Linguistics Research  
Finally, they created around 12,766 ambiguous sentences with quadruples structured which extracted from a WSJ Penn Treebank.  ...  To conclude, an accuracy of parsing Arabic treebank was lower than other languages such as English and Chinese Treebank, though they have the same size of data.  ... 
doi:10.6025/jcl/2018/9/2/60-80 fatcat:3czwutykxvh6jojm62rnvocmpq

Cross Language Information Retrieval Model for Discovering WSDL Documents Using Arabic Language Query

Prof. Dr., Dr. Ayman, Fahad Kamal
2013 International Journal of Advanced Computer Science and Applications  
Text mining techniques were applied on WSDL content and user's query to be ready for CLIR methods.  ...  , This paper proposes the application of CLIR techniques and IR methods to support Bilingual Web service discovery process the second language that proposed here is Arabic.  ...  queries with different languages "Arabic in our model" and retrieve the suitable service and WSDL document with a translated WSDL version to query writer's language.  ... 
doi:10.14569/ijacsa.2013.040817 fatcat:mnfb5gyvtjdm7gcttascygdf3e

Quantitative Linguistic Investigations across Universal Dependencies Treebanks [chapter]

Chiara Alzetta, Felice Dell'Orletta, Simonetta Montemagni, Petya Osenova, Kiril Simov, Giulia Venturi
2020 Proceedings of the Seventh Italian Conference on Computational Linguistics CLiC-it 2020  
While the first statistical parsers have long been trained on the Penn treebank phrase structures, dependency treebanks, whether natively annotated with dependencies, or converted from phrase structures  ...  Moreover, in order to create a point of contact with the large community working in quantitative linguistics it seemed expedient to create a workshop dedicated to quantitative syntactic measures on treebanks  ...  Thanks are due to the parents and children who participated in the observational studies, and to the researchers who contributed the corpora to the CHILDES archive.  ... 
doi:10.4000/books.aaccademia.8210 fatcat:ctirgjeeirccbgy6aen5ufd7pe
« Previous Showing results 1 — 15 out of 236 results