Filters








650 Hits in 4.5 sec

Different Issues in the Design of a Lemmatizer/Tagger for Basque [article]

I. Aduriz, I. Alegria, J. M. Arriola, X. Artola, Diaz de Illarraza A., N. Ezeiza, K. Gojenola, M. Maritxalar
1995 arXiv   pre-print
This paper presents relevant issues that have been considered in the design of a general purpose lemmatizer/tagger for Basque (EUSLEM).  ...  Due to the characteristics of the language, the tagset here proposed in structured in for levels, so that each level is a refinement of the previous one in the sense that it adds more detailed information  ...  In order to elaborate this project the following basic tools will be used: • The Lexical Database for Basque (LDBB).  ... 
arXiv:cmp-lg/9503020v1 fatcat:hogi5xdv3bf47bcqriqyi5v2wi

SYLLABARIUM: An online application for deriving complete statistics for Basque and Spanish orthographic syllables

Jon Andoni Duñabeitia, Joana Cholin, José Corral, Manuel Perea, Manuel Carreiras
2010 Behavior Research Methods  
For more than a decade, a large body of empirical evidence has shown that, for the recognition of a written word, subword units are accessed at early stages of visual word processing, and the properties  ...  of these subword units have an effect on reading behavior.  ...  The authors express their gratitude to Marc Brysbaert and to two anonymous reviewers for their comments on an earlier draft. Correspondence concerning this article should be addressed to J. A.  ... 
doi:10.3758/brm.42.1.118 pmid:20160291 fatcat:pexhaspvx5h23iqzsjcz5wl7iy

Automatic morphological analysis of Basque

I. Alegria, X. Artola, K. Sarasola, M. Urkia
1996 Literary and Linguistic Computing  
This analyser is a basic tool for current and future work on automatic processing of Basque and its first two applications are a commercial spelling corrector named Xuxen and a general purpose lemmatizer  ...  In order to deal with a wide variety of linguistic data and to be a support for other NLP applications, we have built a Lexical Database (LDBB).  ... 
doi:10.1093/llc/11.4.193 fatcat:rcuxtvb6jnexxardqyvy3ciaqy

Building the Gold Standard for the Surface Syntax of Basque

Itziar Aduriz, María Jesús Aranzabe, Jose Maria Arriola, Arantza Díaz de Ilarraza, Itziar Gonzalez-Dios, Ruben Urizar
2017 Revista de Procesamiento de Lenguaje Natural (SEPLN)  
Tags from the lexical database The analyses produced by the morphosyntatic analyzer for Basque Morfeus are accomplished based on the information included in the lexical database for Basque EDBL.  ...  The information contained in the lexical database for Basque EDBL (Aldezabal et al., 2001) constitutes the basis for our analyzers.  ... 
dblp:journals/pdln/AdurizAAIGU17 fatcat:4jdtltvtfzfbzpp32huk64oske

The role of the frequency of constituents in compound words: Evidence from Basque and Spanish

Jon Andoni Duñabeitia, Manuel Perea, Manuel Carreiras
2007 Psychonomic Bulletin & Review  
An additional set of 48 nonwords of 6 to 12 letters was included for the purposes of the lexical decision task.  ...  Cross-language comparisons are a key issue to understanding the generality of morphological parsing (see Frost et al., 2005, for com parison between Indo-European and Semitic languages).  ...  The dissociation of the frequency effect for the initial and ending lexeme clearly supports a morphemic/lexical decomposition in lexical access.  ... 
doi:10.3758/bf03193108 pmid:18229492 fatcat:kf5m7sqmhjcnrm5jxo5oli6epa

Decision Tree-Based Context Dependent Sublexical Units for Continuous Speech Recognition of Basque [chapter]

K. López de Ipiña, M. Graña, N. Ezeiza, M. Hernández, E. Zulueta, A. Ezeiza
2003 Lecture Notes in Computer Science  
This paper presents a new methodology, based on the classical decision trees, to get a suitable set of context dependent sublexical units for Basque Continuous Speech Recognition (CSR).  ...  In addition, the use of the new context dependent units to build word models was addressed.  ...  The authors would like to thank all the volunteer speakers that has collaborated recording the databases. We thank also all people have collaborated in the development of this work.  ... 
doi:10.1007/978-3-540-24586-5_31 fatcat:4ojdxipkl5hk3pbtp5yxvq7v5i

Across language families: Genome diversity mirrors linguistic variation within Europe

Giuseppe Longobardi, Silvia Ghirotto, Cristina Guardiano, Francesca Tassi, Andrea Benazzo, Andrea Ceolin, Guido Barbujani
2015 American Journal of Physical Anthropology  
Contrary to previous observations, on the European scale, language proved a better predictor of genomic differences than geography.  ...  We corroborated the method and used it to compare patterns of linguistic and genomic variation in Europe.  ...  Finally, and most crucially, for the purpose of calculating Mantel correlations between qualitatively and quantitatively very different entities (56 parameters, 178.000 SNPs), distances seem a necessary  ... 
doi:10.1002/ajpa.22758 pmid:26059462 pmcid:PMC5095809 fatcat:bcixydo3gvbf7evmpxgt5gj6ae

Wordnet-LMF

Claudia Soria, Monica Monachini, Piek Vossen
2009 Proceeding of the 2009 international workshop on Intercultural collaboration - IWIC '09  
Wordnet-LMF was developed in the framework of the EU KYOTO project for the specific purpose of endowing a set of wordnets with a standardized interoperability format allowing the interchange of lexicosemantic  ...  In this paper we present Wordnet-LMF, a dialect of ISO Lexical Markup Framework that instantiates LMF for representing wordnets.  ...  to be used as a working encoding format for storing and access of lexical information in a dedicated database.  ... 
doi:10.1145/1499224.1499246 fatcat:25z6p3lej5doviwk5gp4ofwpam

Placeholders in the English Interlanguage of Bilingual (Basque/Spanish) Children

María del Pilar García Mayo, Amparo Lázaro Ibarrola, Juana M. Liceras
2005 Language Learning  
In this article we provide an explanation for 2 syntactic phenomena whose systematic production has been observed in the English nonnative grammar of 3 different age groups of 58 bilingual (Basque/Spanish  ...  The research reported here is part of a larger longitudinal study on the English interlanguage of bilingual (Basque/Spanish) children.  ...  Specifically, in the process of the acquisition of English by speakers of Spanish and/or Basque, we will pay special attention to the fact that (a) Spanish and Basque pronouns differ from English pronouns  ... 
doi:10.1111/j.0023-8333.2005.00312.x fatcat:unwqqtv4vrgabdwhgsu73lpdwu

Bilingual Lexicography and Corpus Methods. The Example of German-Basque as Language Pair

David Lindemann
2013 Procedia - Social and Behavioral Sciences  
In this paper, we present some research on German-Basque corpus-based lexicography and describe our proposals for a new German-Basque electronic dictionary for Basque-L1 German learners.  ...  In the context of a low-or medium-density language pair, they will have to ask which electronic resources and tools are needed and available, and to evaluate the bilingual glossaries obtained with computational  ...  Acknowledgements This study has been supported by the project IT665-13, funded by the Basque Government. Funding is gratefully acknowledged.  ... 
doi:10.1016/j.sbspro.2013.10.645 fatcat:amhsugq4yzbrrld35lmrqn2huy

Constructing an intelligent dictionary help system

E. AGIRRE, X. ARREGI, X. ARTOLA, A. DIÁZ DE ILARRAZA, K. SARASOLA, A. SOROA
1996 Natural Language Engineering  
[17] , a general conceptual model for describing lexical knowledge is presented, as well as the way to describe each source in terms of the classes and relationships of the general model.  ...  Lexical disambiguation is not a trivial issue and is receiving much attention in recent research.  ... 
doi:10.1017/s1351324997001356 fatcat:3hdub4627fegrkfey544c3ou2y

IsMilkmana superhero likeBatman? Constituent morphological priming in compound words

Jon Andoni Duñabeitia, Itziar Laka, Manuel Perea, Manuel Carreiras
2009 The European Journal of Cognitive Psychology  
We will come back to this issue in the General Discussion. The aim of the present experiments is twofold.  ...  A set of 52 compound words was selected from the Basque E-Hitz database (Perea et al., 2006 ; see Appendix for a complete list of materials).  ...  APPENDIX Word and nonword primes and targets used in the experiments Each triplet consists of (1) the related prime, (2) the unrelated prime, and (3) the target.  ... 
doi:10.1080/09541440802079835 fatcat:botj4qlpnrhl7dub34menazgdu

Bilingual education/bilingualism

1998 Language Teaching  
A recent spate of interest in early verb learning has provided several databases for examining the order in which verbs appear in children's speech.  ...  How these factors interact within the overall context of a kind of de facto additive bilingual situation, in turn, suggests a different approach to the broader debate on the general cognitive/linguistic  ... 
doi:10.1017/s0261444800013446 fatcat:rg7cgdyf6bdg7docj4khh24toa

MULTIMAP: Multilingual visual naming test for the mapping of eloquent areas during awake surgeries [article]

Sandra Gisbert-Munoz, Ileana Quinones, Lucia Amoruso, Polina Timofeeva, Shuang Geng, Sami Boudelaa, Inigo Pomposo, Santiago Gil-Robles, Manuel Carreiras
2020 biorxiv/medrxiv   pre-print
Recognizing that the distinction between nouns and verbs is necessary for detailed and precise language mapping, MULTIMAP consists of a database of 218 standardized color pictures representing both objects  ...  Heterogeneity in the selection criteria for stimuli leads to differences, for example, in the size, color, image quality, and even names associated with pictures, making direct cross-linguistic comparisons  ...  Acknowledgements We would like to thank the BCBL Lab Department for data recordings and Magda Altman for her useful comments on the manuscript. Funding  ... 
doi:10.1101/2020.02.20.957282 fatcat:tukjlpxnqfccrcmb4lrljvuuve

MULTIMAP: Multilingual picture naming test for mapping eloquent areas during awake surgeries

Sandra Gisbert-Muñoz, Ileana Quiñones, Lucia Amoruso, Polina Timofeeva, Shuang Geng, Sami Boudelaa, Iñigo Pomposo, Santiago Gil-Robles, Manuel Carreiras
2020 Behavior Research Methods  
Recognizing that the distinction between nouns and verbs is necessary for detailed and precise language mapping, MULTIMAP consists of a database of 218 standardized color pictures representing both objects  ...  Heterogeneity in the selection criteria for stimuli leads to differences, for example, in the size, color, image quality, and even names associated with pictures, making direct cross-linguistic comparisons  ...  Acknowledgements We would like to thank the BCBL Lab Department for data recordings and Magda Altman for her useful comments on the manuscript. Funding  ... 
doi:10.3758/s13428-020-01467-4 pmid:32901346 fatcat:5eytdpfvojfkxp4yld46kkem4u
« Previous Showing results 1 — 15 out of 650 results