Filters








161 Hits in 4.7 sec

Restructuring tagged corpora with morpheme adjustment rules

Toshihisa Tashiro, Noriyoshi Uratani, Tsuyoshi Morimoto
1994 Proceedings of the 15th conference on Computational linguistics -   unpublished
.), it is ditficult to use tagged corpora with an incompatible morphological infm'mation system. This paper proposes a me,hod of converting tagged corpora frOlll olte lllorphellle system to allother,  ...  A part-of-speech tagged corpus is a very hnportant knowledge source for natural language processing researchers. 'Poday, several part-of-speech tagged corpora are readily available for research use.  ...  [1] . '.l'his paper proposes another method of acquiring large part-of-speech tagged corpora: restructuring tagged corpora by using morpheme adjustment rules.  ... 
doi:10.3115/991886.991986 fatcat:4ksaxni45rb6jhqorpqjrouu3u

Building an Experimental German User Interface Terminology Linked to SNOMED CT

David Hashemian Nik, Zdenko Kasáč, Zsófia Goda, Anita Semlitsch, Stefan Schulz
2019 Studies in Health Technology and Informatics  
The second step was to fill up the n-gram table with human and machine translations, manually enriched by POS tags. Top-down and bottom-up methods for manual terminology population were used.  ...  Grammar rules were formulated and embedded into a term generator, which then created one-to-many German variants per SNOMED CT description.  ...  It also produces single word compounds, using specific tags and rules as explained above.  ... 
doi:10.3233/shti190202 pmid:31437904 fatcat:xvo2tn7mrjembgavn47hqna2pm

Machine-Translation History and Evolution: Survey for Arabic-English Translations

Nabeel Alsohybe, Neama Dahan, Fadl Ba-Alwi
2017 Current Journal of Applied Science and Technology  
As a result of the rapid changes in information and communication technology (ICT), the world has become a small village where people from all over the world connect with each other in dialogue and communication  ...  It reads the sentence words and provide us with every possible POS tags of the single word.  ...  Stanford parser can distinguish the POS tags of the English sentences successfully with high precision.  ... 
doi:10.9734/cjast/2017/36124 fatcat:wyc7lka3ovfz5eo66yvl33ir2i

Self-Organizing Machine Translation: Example-Driven Induction of Transfer Functions [article]

Patrick Juola
1994 arXiv   pre-print
This system has been used to infer English->French and English->Urdu transfer functions from small corpora.  ...  This paper describes an attempt to merge the Example-Based Machine Translation (EBMT) approach with psycholinguistic principles.  ...  psycholinguistics; Wayne Citrin, for his engineering help; Michael Main, for assistance in formalizing what eventually became Theorem 1; as well as an unrelated faculty member, Karl Winklmann, for similar help with  ... 
arXiv:cmp-lg/9406012v1 fatcat:eeqyzule2vaz3eaeitfikmmk2q

Indian English Evolution and Focusing Visible Through Power Laws

Vineeta Chand, Devin Kapper, Sumona Mondal, Shantanu Sur, Rana Parshad
2017 Languages  
The results demonstrate that IE consistently follows power law frequency distributions and the corpora are each best fit by Mandelbrot's Law.  ...  Age and gender-separated sub-corpora of the most recent corpus show minimal deviation, providing apparent time evidence for emerging IE dialect stability.  ...  We opt to follow the rule of thumb where a model with a ∆ > 10 has no support for consideration [65] .  ... 
doi:10.3390/languages2040026 fatcat:ltqgzsybhjc73ivohc5zg7zaae

Clitics in the wild: Empirical studies on the microvariation of the pronominal, reflexive and verbal clitics in Bosnian, Croatian and Serbian [article]

Zrinka Kolaković, Edyta Jurkiewicz-Rohrbacher, Björn Hansen, Dušica Filipović Đurđević, Nataša Fritz
2022 Zenodo  
It allows to construct a series of hierarchies where the factors relevant for predicting clitic climbing interact with each other.  ...  It fills the gap between the theoretical and normative literature by including solid data on variation found in dialects and spoken language and obtained from massive Web Corpora and speakers' acceptability  ...  He argues that CC depends on restructuring contexts of raising and control verbs (both subject and object), with CC taking place only with restructuring infinitives but not with non-restructuring infinitives  ... 
doi:10.5281/zenodo.5792972 fatcat:h3oqs5gzhrhclbjj7fzoijnpdq

ALA Vol. 4, No. 2 - Full text

Nina GOLOB
2014 Acta Linguistica Asiatica  
Based on the analysis it proposes a layered metalinguistic labelling solution to achieve the greates efficiency with the smallest possible number of labels being employed.  ...  Designing the error tags The next step was to decide how to apply the error tags in the compositions.  ...  The selection of an appropriate word-group for this category, in particular, was a trial and error process to begin with, and irrespective of the fact that the threshold was adjusted, it was difficult  ... 
doi:10.4312/ala.4.2.1-84 fatcat:vnzq32uennephn2w4iinoomapy

Linked open data to represent multilingual poetry collections. A proposal to solve interoperability issues between poetic repertoires

Elena González-Blanco, Gimena del Rio, Clara Martínez Cantón
2016 Zenodo  
Starling data and guidance with respect to this.  ...  reviewers for insightful feedback and helpful comments, Monika Rind-Pawlowski and Irina Nevskaya for their help in deciphering Starling language identifiers and Irina Nevskaya and Anna Dybo for providing us with  ...  interoperable with OLiA-linked corpora Eckle-Kohler et al., 2015) .  ... 
doi:10.5281/zenodo.2551595 fatcat:4nbzl534ebgnbort742fhfoqam

Getting Past the Language Gap: Innovations in Machine Translation [chapter]

Rodolfo Delmonte
2012 Mobile Speech and Advanced Natural Language Solutions  
Feature weights can be adjusted and the process iterated a number of times -typically 20 iterations.  ...  Then, I will dedicate section "Hybrid and Rule-Based MT Systems" to hybrid methods and systems.  ...  During decoding, both the conventional Hiero-style SCFG rules with general tag X and SRL-aware SCFG rules are used in a synchronous Chart Parsing algorithm.  ... 
doi:10.1007/978-1-4614-6018-3_6 fatcat:2njkc6meabhaxosl4wircumfjm

Plagiarism Detection for Indonesian Texts

Lucia D. Krisnawati, Klaus U. Schulz
2013 Proceedings of International Conference on Information Integration and Web-based Applications & Services - IIWAS '13  
This research was conducted with the support of a DAAD-Indonesian German Scholarship Program (DAAD-IGSP).  ...  Darjowidjojo in [41] listed 9 morphophonemic rules while Pisceldo et al. [122] defined 11 morphophonemic rules, 4 rules belong to the first group and 7 rules deal with the second group.  ...  How these affixes occur in a word is governed by morphotactics which is akin to a syntax of a morpheme. Morphotactic rules represent the ordering restrictions in place on the ordering of morphemes.  ... 
doi:10.1145/2539150.2539213 dblp:conf/iiwas/KrisnawatiS13 fatcat:r6p2h4oiq5fi3mhlazokatknrq

Instant annotations in ELAN corpora of spoken and written Komi, an endangered language of the Barents Sea region

Ciprian Gerstenberger, Niko Partanen, Michael Rießler
2017 Proceedings of the 2nd Workshop on the Use of Computational Methods in the Study of Endangered Languages   unpublished
implemented as Finite State Transducers and Constraint Grammar for rule-based morphosyntactic tagging and disambiguation.  ...  Our aim is to challenge current manual approaches in the annotation of language documentation corpora. 12  ...  for Language Corpora (University of Hamburg).  ... 
doi:10.18653/v1/w17-0109 fatcat:rjneh7qjfvhphgyaz2oqwagfni

Software Architecture for Language Engineering

HAMISH CUNNINGHAM, DONIA SCOTT
2004 Natural Language Engineering  
Sense Tagging: Semantic Tagging with a Lexicon. In Proceedings of the SIGLEX Workshop on Tagging Text with Lexical Semantics, pages 74-78, Washington, DC, 1997. [Waterworth 87] T. Waterworth.  ...  These corpora are often annotated with the intended output of some task (e.g. part-of-speech tagging, or Named Entity recognition and classification, or alignment of translations) and used to train statistical  ...  Annotations matched on the LHS of a rule may be referred to on the RHS by means of labels that are attached to pattern elements. A.1 Grammar of JAPE JAPE is similar to CPSL with a few exceptions.  ... 
doi:10.1017/s1351324904003481 fatcat:xzkpj2edozgidfrknmergcyyga

Extracting and Learning a Dependency-Enhanced Type Lexicon for Dutch [article]

Konstantinos Kogkalidis
2019 arXiv   pre-print
This thesis is concerned with type-logical grammars and their practical applicability as tools of reasoning about sentence syntax and semantics.  ...  An algorithm for the conversion of dependency-annotated sentences into type sequences is then implemented, populating the type logic with concrete, data-driven lexical types.  ...  These cases are easy to solve by simply adjusting the numerals' dependency label into modifier (mod ), when they co-exist with a real determiner.  ... 
arXiv:1909.02955v2 fatcat:qjdwpjelcjgdnewx5wy2xofii4

Linguistic Annotation [chapter]

Martha Palmer, Nianwen Xue
2010 The Handbook of Computational Linguistics and Natural Language Processing  
By adjusting the momentum m, we can adjust this trade-off. A second parameter which we can adjust is the learning rate.  ...  He incorporates bias by adjusting the prior distribution of probabilities over all lexicalized CFG rules.  ...  , 238, 240, 244, 245, 246, 289, 294 sentence error rate (SER), 546 sentential alignment, 569 sentiment tagging, 240 separation types, 413 sequence tagging task, 520 Sequitur, 374, 375 serial, 486, 487  ... 
doi:10.1002/9781444324044.ch10 fatcat:peq2ppl6gnfklh7gtwzbrt5xym

Investigation of the Interaction between the Large and Small Subunits of Potato ADP-Glucose Pyrophosphorylase

Ibrahim Barıs, Aytug Tuncel, Natali Ozber, Ozlem Keskin, Ibrahim Halil Kavakli, Ruth Nussinov
2009 PLoS Computational Biology  
Both the theory and the computational model have been developed in interaction with reaction time experiments, particularly in picture naming or related word production paradigms, with the aim of accounting  ...  This provided us with a matrix for the creation of our first lexical concepts, concepts flagged by way of a verbal label.  ...  receives picture-tagged activation).  ... 
doi:10.1371/journal.pcbi.1000546 pmid:19876371 pmcid:PMC2759521 fatcat:sarzmgeisrc7doszuucfhshtdm
« Previous Showing results 1 — 15 out of 161 results