Filters








77 Hits in 2.5 sec

Zipf's laws of meaning in Catalan [article]

Neus Català, Jaume Baixeries, Ramon Ferrer-Cancho, Lluís Padró, Antoni Hernández-Fernández
2021 arXiv   pre-print
We verify these laws in Catalan via the relationship among their exponents and that of the rank-frequency law.  ...  We report the first evidence of two marked regimes for these laws in written language and speech, paralleling the two regimes in Zipf's rank-frequency law in large multi-author corpora discovered in early  ...  Rafel and the technicians of Oficines Lexicogràfiques de l'Institut d'Estudis Catalans (Institute of Catalan Studies) for providing us with the data and helpful comments.  ... 
arXiv:2107.00042v1 fatcat:zxzbhfvspjer5e4ee4hzdgqmca

Zipf's laws of meaning in Catalan

Neus Català, Jaume Baixeries, Ramon Ferrer-i-Cancho, Lluís Padró, Antoni Hernández-Fernández, Diego Raphael Amancio
2021 PLoS ONE  
We verify these laws in Catalan via the relationship among their exponents and that of the rank-frequency law.  ...  We report the first evidence of two marked regimes for these laws in written language and speech, paralleling the two regimes in Zipf's rank-frequency law in large multi-author corpora discovered in early  ...  Rafel and the technicians of Oficines Lexicogràfiques de l'Institut d'Estudis Catalans (Institute of Catalan Studies) for providing us with the data and helpful comments.  ... 
doi:10.1371/journal.pone.0260849 pmid:34914766 pmcid:PMC8675765 fatcat:y3al6esxcjc6jgskkf25ap2kbi

Linguistic Laws in Speech: The Case of Catalan and Spanish

Antoni Hernández-Fernández, Iván G. Torre, Juan-María Garrido, Lucas Lacasa
2019 Entropy  
In this work we consider Glissando Corpus—an oral corpus of Catalan and Spanish—and empirically analyze the presence of the four classical linguistic laws (Zipf's law, Herdan's law, Brevity law, and Menzerath–Altmann's  ...  law) in oral communication, and further complement this with the analysis of two recently formulated laws: lognormality law and size-rank law.  ...  The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.  ... 
doi:10.3390/e21121153 fatcat:6qhdqwtbqzdmpb3hqbigkv6fjy

Emergence of linguistic laws in human voice

Iván González Torre, Bartolo Luque, Lucas Lacasa, Jordi Luque, Antoni Hernández-Fernández
2017 Scientific Reports  
ACKNOWLEDGEMENTS Linguistic laws constitute one of the quantitative cornerstones of modern cognitive sciences and have been routinely investigated in written corpora, or in the equivalent transcription  ...  This means that inferences of statistical patterns of language in acoustics are biased by the arbitrary, language-dependent segmentation of the signal, and virtually precludes the possibility of making  ...  REFERENCES We have found for the first time that human voice manifests the analog of classical linguistic laws found in written texts (Zipf's law, Heaps' law , Menzerath-Altmann law and the law of abbreviation  ... 
doi:10.1038/srep43862 pmid:28272418 pmcid:PMC5341060 fatcat:54kblxuoxnf25m2rmcwanjvpkm

Universal Complex Structures in Written Language [article]

Alvaro Corral, Albert Diaz-Guilera U Politecnica Catalunya
2009 arXiv   pre-print
In terms of language usage, one of the most influential results is Zipf's law of word frequencies. Zipf's law appears to be universal, and may not even be unique to human language.  ...  Here we present an alternative approach that puts Zipf's law in the context of critical phenomena (the cornerstone of complexity in physics) and establishes the presence of a large scale "attraction" between  ...  The authors participate in different research projects funded by Spanish and Catalan agencies. Author Information Correspondence and requests for materials should be addressed to A.C.  ... 
arXiv:0901.2924v1 fatcat:7ojvhxdjanec5joaddxfsnixxa

Information Theory and Language

Łukasz Dębowski, Christian Bentz
2020 Entropy  
Human language is a system of communication. Communication, in turn, consists primarily of information transmission [...]  ...  Conflicts of Interest: The authors declare no conflict of interest.  ...  Acknowledgments: We express our thanks to the authors of the above contributions, the reviewers for their feedback on the manuscripts, and to the journal Entropy and MDPI for their support during this  ... 
doi:10.3390/e22040435 pmid:33286209 fatcat:me5ui7eginbsfl4663jyrprzle

Emergence of linguistic laws in human voice [article]

Ivan Gonzalez Torre, Bartolo Luque, Lucas Lacasa, Jordi Luque and Antoni Hernandez-Fernandez
2016 arXiv   pre-print
Linguistic laws constitute one of the quantitative cornerstones of modern cognitive sciences and have been routinely investigated in written corpora, or in the equivalent transcription of oral corpora.  ...  This means that inferences of statistical patterns of language in acoustics are biased by the arbitrary, language-dependent segmentation of the signal, and virtually precludes the possibility of making  ...  BL acknowledges the hospitality and support of Queen Mary University of London, where part of this research was developed, and a Salvador de Madariaga fellowship.  ... 
arXiv:1610.02736v1 fatcat:dnhxt2x4vnavrmelb33sg7dxoa

Page 228 of The Journal of Animal Ecology Vol. 76, Issue 2 [page]

2007 The Journal of Animal Ecology  
Newman, M.E.J. (2005) Power laws, Pareto distributions and Zipf’s law. Contemporary Physics, 46, 323-351. Pueyo, S. (2006) Diversity: between neutrality and structure. Oikos, 112, 392-405.  ...  . & Catalan, J. (2003) Helical Lévy walks: adjusting searching statistics to resource availability in microzooplankton. Proceedings of the National Academy of Sciences USA, 100, 12771 12775. .  ... 

Visualizing Document Image Collections Using Image-Based Word Clouds [chapter]

Tomas Wilkinson, Anders Brun
2015 Lecture Notes in Computer Science  
In this paper, we introduce image-based word clouds as a novel tool for a quick and aesthetic overviews of common words in collections of digitized text manuscripts.  ...  Our new tool is not limited to any specific kind of text. We make further contributions in ways of stop-word removal, class based feature weighting and visualization.  ...  The work is done in part as a collaboration with the Swedish Museum of Natural History (Naturhistoriska riksmuseet).  ... 
doi:10.1007/978-3-319-27857-5_27 fatcat:66bxthgvbfehjhnmzhwdebjczu

ThetextcatPackage forn-Gram Based Text Categorization inR

Kurt Hornik, Patrick Mair, Johannes Rauch, Wilhelm Geiger, Christian Buchta, Ingo Feinerer
2013 Journal of Statistical Software  
Among the wide variety of language identification methods discussed in the literature, the ones employing the Cavnar and Trenkle (1994) approach to text categorization based on character n-gram frequencies  ...  Identifying the language used will typically be the first step in most natural language processing tasks.  ...  Baayen (2008, p. 226) elaborates the problem of sample independence of Zipf's law. In fact, Ha, Hanna, Ming, and Smith (2009) propose an extension of Zipf's law for large corpora.  ... 
doi:10.18637/jss.v052.i06 fatcat:f4i2ayml3bffffhbqleyz5c6re

Page 2389 of Linguistics and Language Behavior Abstracts: LLBA Vol. 27, Issue 4 [page]

1993 Linguistics and Language Behavior Abstracts: LLBA  
’s laws, morphology role; Word Meaning Arabic Christian vocabulary, | 1th-century lexicography, Ibn Sidah’s definitions comparative commentary; 9310115 Arabic early Koranic exegesis tradition, Masa’il  ...  aphasia case study; visual program therapy; 66-year-old left-handed female; 9310697 West Atlantic Languages Seereer-Siin (West Atlantic) noun classification, morphophonological analysis; 9309443 Wh Phrases Catalan  ... 

Truncated lognormal distributions and scaling in the size of naturally defined population clusters [article]

Alvaro Corral, Frederic Udina, Elsa Arcaute
2019 arXiv   pre-print
We perform a detailed study of the distributions, using state-of-the-art statistical tools. By means of scaling analysis we rule out the existence of a power-law regime in the low-population range.  ...  The logarithmic-coefficient-of-variation test allows us to establish that the power-law tail for high population, characteristic of Zipf's law, has a rather limited range of applicability.  ...  ANALYSIS AND RESULTS In order to investigate the validity of Zipf's law for the clusters of population, we consider, as in Ref.  ... 
arXiv:1910.01036v1 fatcat:2jjcztl2tbgrxgoaev4cwwmphy

On the Origin of Ambiguity in Efficient Communication

Jordi Fortuny, Bernat Corominas-Murtra
2013 Journal of Logic, Language and Information  
This leads us to a precise and general expression of the intuition behind Zipf's vocabulary balance in terms of a symmetry equation between the complexities of the coding and the decoding processes that  ...  Accordingly, the emergence of irreversible computations is required if the complexities of the coding and the decoding processes are balanced in a symmetric scenario, which means that the emergence of  ...  Acknowledgements We would like to thank the members of the Centre de Lingüística Teòrica that attended the course on ambiguity for postgraduate students we taught  ... 
doi:10.1007/s10849-013-9179-3 fatcat:ueffzsmtcvg3zh43hmqdhj764u

Can Menzerath's law be a criterion of complexity in communication?

Iván G. Torre, Łukasz Dębowski, Antoni Hernández-Fernández, Diego Raphael Amancio
2021 PLoS ONE  
In contrast, Menzerath-Altmann's law (MAL) is a precise mathematical power-law-exponential formula which expresses the expected length of the linguistic construct conditioned on the number of its constituents  ...  We show that this null model complies with Menzerath's law, revealing that Menzerath's law itself can hardly be a criterion of complexity in communication.  ...  Acknowledgments We want to dedicate this work to the memory of Professor Gabriel Altmann who humbly always encouraged us to investigate "Menzerath's law". Sit tibi terra levis, Professor Altmann.  ... 
doi:10.1371/journal.pone.0256133 pmid:34415939 pmcid:PMC8378695 fatcat:nitpmxrrqjeqnaava54ur56luy

Power law behavior associated with a Fibonacci Lucas model and generalized statistical models [article]

Aram Z. Mekjian
2009 arXiv   pre-print
This scale invariant power law parallels that seen in complex networks. The growth of the network is developed using recurrence properties of the model.  ...  numbers and also the connection of Lucas numbers to the golden mean.  ...  This work was supported by the Department of Energy under grant number DE-FG02-96ER-40987  ... 
arXiv:0910.2471v1 fatcat:eh2wvvbugraedpujcdnp2gzg4u
« Previous Showing results 1 — 15 out of 77 results