A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Filters
Human Activity Analysis and Prediction Using Google n-Grams
2018
International Journal of Future Computer and Communication
Google-n grams are generated from millions of books between 1500 to 2000 which can be an indicator for human specific feature and behavior. ...
In this paper human specific main activities are analyzed and the human activities in near feature are predicted via Google n-grams and the functions which are generated via using n-grams. ...
Because of google n-grams contain different language datasets; comparison of semantic similarity for different languages [7] is studied using google n-grams. ...
doi:10.18178/ijfcc.2018.7.2.516
fatcat:whsqrrfxtbbujd7xe2ax3pll2a
Exploring mega-corpora: Google Ngram Viewer and the Corpus of Historical American English
2014
EuroAmerican Journal of Applied Linguistics and Languages
EN The creation of internet-based mega-corpora such as the Corpus of Contemporary American English (COCA), the Corpus of Historical American English (COHA) (Davies, 2011a) and the Google Ngram Viewer ( ...
Once the n-grams are saved as raw data in Excel files, the n--grams can then be searched using the online viewer. ...
--The viewer is composed of n--grams, from 1--gram to 5--grams. N--grams represent how many words are in a lexical bundle: 1--gram = 1 word, while 5--grams = 5 words (Bohannon, 2011) . ...
doi:10.21283/2376905x.1.4
fatcat:7immdiacibao7ghyu7d3rld5ye
Social evidence of a changing climate: Google Ngram data points to early climate change impact on human society
2015
Weather
Figure 1 : 1 Annual relative frequency of use of the n-gram 'earthquake' in English language books 1800-2000. ...
GLS models adjusted for autocorrelation. * indicates significance at α=0.05. ☼ indicates a comparatively closer relationship between n-gram and temperature than between Paleo Index and temperature. ...
doi:10.1002/wea.2504
fatcat:gruhdhr625gdblag4ljpu52ire
Page 719 of Linguistics and Language Behavior Abstracts: LLBA Vol. 29, Issue 2
[page]
1995
Linguistics and Language Behavior Abstracts: LLBA
Objective composite test scores & case profile information for children (N = 92) with developmental language impairment were rated by speech-language pathologists (N = 27) with an average of 13 years expe ...
A survey of Swiss speech therapists (N = 30) indicates that users do not always make full use of the program. ...
Quantifying paradigm change in demography
2014
Demographic Research
METHODS The presented analysis is descriptive and is based on a series of simple measures obtained from the free online tool Google Books Ngram Viewer, which includes frequencies of word groupings (n-grams ...
The free Google Books Ngram Viewer tool (http://books.google.com/ngrams) analyses frequencies of words, and phrases of a given length of n words (called n-grams or ngrams) for n≤5, amongst all words or ...
Even though the caveats about the disparity between numbers and frequencies remain in force, the ratio of the number of n-grams with "analysis" to the ones with "theory" is clearly increasing, as indicated ...
doi:10.4054/demres.2014.30.32
fatcat:xaycvb2uyrht7iz226a5c7fhl4
Online Film Subtitles As A Corpus: An Ngram-Based Approach
2017
Zenodo
A series of quantitative analyses based of n-gram frequencies demonstrate that subtitles are not fundamentally different from other registers of English and that they represent a close approximation of ...
Namely, the language of subtitles is more emotional and dynamic, but less spontaneous, vague and narrative than that of normally occurring conversations. ...
These questions will be answered with the help of an n-gram approach. ...
doi:10.5281/zenodo.582336
fatcat:zgrrly4n5ngxpcxeukuou4pnda
Mapping and Analysis of Standard Indonesian Pronunciation Errors by Using the Bigram Method
2022
INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT (ICORAD)
Indonesian language is increasingly being ignored, even the mass media often find the use of non-standard language, so there is a uniformity in the use of words that often appear in scientific articles ...
The uniformity of Indonesian pronunciation certainly confuses the general public, for example: television news viewers and radio listeners, to distinguish between standard and non-standard forms. ...
N-Gram Language Model Conceptually, the n-gram model is an estimate of the probability of a word or character from a history of previous occurrences (Jurafsky & Martin, 2018) . ...
doi:10.47841/icorad.v1i1.16
fatcat:yoelksysgjgfnhjyhfimmmpci4
Culturomics on a Bengali Newspaper Corpus
2012
2012 International Conference on Asian Language Processing
To the best of our knowledge, this is the first time a culturomic trend analysis is being performed on an Indic language. ...
We plan to implement a Bengali n-gram viewer in the same spirit as Google n-gram viewer. To accomplish this idea, we need a comprehensive database of all Bengali n-grams used in the 132 months. ...
CONCLUSION In this paper we introduced, for the first time, a culturomic study on an Indic language (Bengali). ...
doi:10.1109/ialp.2012.68
dblp:conf/ialp/PhaniLB12
fatcat:pj47peuhyfcghmmdjbbpn7kq2i
Google Books Ngram Viewer in Socio-Cultural Research
2018
Research in Language
The objective of this paper is to verify if Google Books Ngram Viewer, a new tool working on a database of 361 billion words in English, and enabling quick recovery of data on word frequency in a diachronic ...
Michel et al. (2010) define the 1-gram as "a string of characters uninterrupted by a space" and an n-gram as "a sequence of 1-grams, such as the phrases 'stock market' (a 2gram) and 'the United States ...
The study was limited to the analysis of frequency of a given 1-gram, which might be understood as a single lexical unit, or an n-gram (a series of lexical units) over time, but occurring at least 40 times ...
doi:10.2478/rela-2018-0015
fatcat:e7mcarowbnc2pispabo6sy27xq
Evolution of the most common English words and phrases over the centuries
2012
Journal of the Royal Society Interface
Along with the steady growth of the English lexicon, this provides an empirical explanation for the ubiquity of the Zipf's law in language statistics and confirms that writing, although undoubtedly an ...
An n-gram is made up of a series of n 1-grams, and a 1-gram is a string of characters uninterrupted by a space. ...
Tables listing the top 100, top 1000 and top 10.000 n-grams for all available years since 1520 inclusive, along with their yearly usage frequencies and direct links to the Google Books Ngram Viewer, are ...
doi:10.1098/rsif.2012.0491
pmid:22832364
pmcid:PMC3481586
fatcat:lovxccsmfvczrg3an6qljfm634
Storywrangler: A massive exploratorium for sociolinguistic, cultural, socioeconomic, and political timelines using Twitter
[article]
2021
arXiv
pre-print
We make the data set available through an interactive time series viewer, and as downloadable time series and daily distributions. ...
Here, we describe Storywrangler, a natural language processing instrument designed to carry out an ongoing, day-scale curation of over 100 billion tweets containing roughly 1 trillion 1-grams from 2008 ...
Notation and Measures We write an n-gram by τ and a day's lexicon for language -the set of distinct n-grams found in all tweets (AT) for a given date t-by D t, ;n . ...
arXiv:2007.12988v4
fatcat:n3yir6h73zdvnky5l7bbdsqphe
In Tandem or Out of Sync? Academic Economics Research and Public Policy Measures
2014
Social Science Research Network
In addition, frequency analyses of these same terms are gathered from other English language based text sources (in particular, Google's Ngram Viewer) to determine if discussion of them in the wider public ...
In this instance as well, academic attention to certain policy related terms mirrors quite strongly attention to those terms in the wider English-language literature. 11 Lagging the Ngram Viewer results ...
doi:10.2139/ssrn.2516978
fatcat:zt2mdznerzdcxnxs5d5j2pj6uq
Violence Rating Prediction from Movie Scripts
2019
PROCEEDINGS OF THE THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE TWENTY-EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE
Violent content in movies can influence viewers' perception of the society. ...
To date, we are the first to show the language used in movie scripts is a strong indicator of violent content. This offers novel computational tools to assist in creating awareness of storytelling. ...
Our features can be divided into five categories: N-grams, Linguistic and Lexical, Sentiment, Abusive Language and Distributional Semantics. ...
doi:10.1609/aaai.v33i01.3301671
fatcat:5k3eccn5hbbgrgco3zmblkcvs4
Quantitative Analysis of Suffix Variability of Comparative Adjectives in Russian
2019
Symposium on Languages, Applications and Technologies
The authors concluded that there is no previously anticipated influence of phonetic and morphological factors on the choice of the suffix of an adjective in a bookish speech. ...
of 1-grams). ...
An example of this case is shown in figure 2 . Like in the Google Books Ngram Viewer service, a moving average with a window +-3 years (the window length is 7 years) is used. ...
doi:10.4230/oasics.slate.2019.21
dblp:conf/slate/GaleevB19
fatcat:mp66fuspnfeipf2yzi4mussr6q
Gender inequality and female body language in children's literature
2020
Digital Scholarship in the Humanities
With an exploratory case study of gendered body language in children's literature, we illustrate the relationship between quantitative and qualitative analysis. ...
The case study is focused on female body language descriptions and how the presentation of body language has changed over time. ...
Her hands on her hips in the Google N-gram Viewer.
Fig. 2 . 2 Fig. 2. Female BP s network in ChiLit (19th century).Fig. 3. Female BP s network in OCC (contemporary data). ...
doi:10.1093/llc/fqaa051
fatcat:ujxuwlsnffeaxikxgascmodpum
« Previous
Showing results 1 — 15 out of 3,091 results