492 Hits in 4.1 sec

Geographical Latent Variable Models for Microblog Retrieval [chapter]

Alexander Kotov, Vineeth Rakesh, Eugene Agichtein, Chandan K. Reddy
2015 Lecture Notes in Computer Science  
In particular, we experimentally compare the retrieval effectiveness of four geographical latent variable models: two geographical variants of post-hoc LDA, latent variable model without hidden topics  ...  geographical regions, little is known about their utility for information retrieval in general or microblog retrieval in particular.  ...  Geographical latent variable models Post-hoc geographical variants of LDA We use retrieval methods based on two post-hoc geographical variants of Latent Dirichlet Allocation (LDA) [2] , a popular topic  ... 
doi:10.1007/978-3-319-16354-3_70 fatcat:lclga4ijovf45hd64jkemouq34

CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background

Lirong Qiu, Jia Yu
2018 Complexity  
In this paper, we propose Combining Latent Dirichlet Allocation (CLDA), a new topic model that can learn the potential topics of microblog short texts and long texts simultaneously.  ...  Experimental results in a real microblog data set show that CLDA outperforms many advanced models in mining user interest, and we also confirm that CLDA also has good performance in recommending systems  ...  retrieval tools.  ... 
doi:10.1155/2018/2503816 fatcat:ydushel7kzbpfhzmafc76w7il4

The Million Musical Tweet Dataset - What We Can Learn From Microblogs

David Hauger, Markus Schedl, Andrej Kosir, Marko Tkalcic
2013 Zenodo  
To get comparable local time (which is not directly provided by Twitter) we used GeoNames 9 for retrieving the time zones for the geographic coordinates.  ...  Visualizing genres / latent factors for Brazil. Figure 6 . 6 Figure 6. Visualizing genres / latent factors for France.  ... 
doi:10.5281/zenodo.1417648 fatcat:7folsgmauvgo7nuldietyko2fi

Latent Dirichlet Allocation (LDA) and Topic modeling: models, applications, a survey [article]

Hamed Jelodar, Yongli Wang, Chi Yuan, Xia Feng, Xiahui Jiang, Yanchao Li, Liang Zhao
2018 arXiv   pre-print
There are various methods for topic modeling, which Latent Dirichlet allocation (LDA) is one of the most popular methods in this field.  ...  Topic modeling is one of the most powerful techniques in text mining for data mining, latent data discovery, and finding relationships among data, text documents.  ...  Acknowledgements This article has been awarded by the National Natural Science Foundation of China (61170035, 61272420, 81674099, 61502233), the Fundamental Research Fund for the Central Universities (  ... 
arXiv:1711.04305v2 fatcat:jzsx6owjyjfo3gkbohrc2ggkzq

How Is the Mobile Internet Different? Search Costs and Local Activities

Anindya Ghose, Avi Goldfarb, Sang Pil Han
2013 Information systems research  
geographically close matches is higher on mobile phones: stores located in close proximity to a user's home are much more likely to be clicked on mobile phones.  ...  Using data on user behavior at a (Twitter-like) microblogging service, we exploit exogenous variation in the ranking mechanism of posts to identify the ranking effects.  ...  Econometric Model Our model consists of two distinct levels: (1) a postlevel latent utility model and (2) a population-level Table 2 Notations and Variable Descriptions Table 2 . 4.2.1.  ... 
doi:10.1287/isre.1120.0453 fatcat:chbnp3ttmzhdxblsj5ttrk77da

Query Expansion for Microblog Retrieval Focusing on an Ensemble of Features

Abu Nowshed Chy, Md Zia Ullah, Masaki Aono
2019 Journal of Information Processing  
Upon retrieving tweets by our proposed topic modeling based query expansion, we utilize the pseudo-relevance feedback and a new temporal relatedness approach to select the candidate tweets.  ...  However, finding good expansion terms for a given query is a challenging task.  ...  We estimate the MeanDecreaseGini, a measure of variable importance in random forest model.  ... 
doi:10.2197/ipsjjip.27.61 fatcat:n3q4l6tmn5fh7ppdjvqpbcmjt4

Harvesting microblogs for contextual music similarity estimation: a co-occurrence-based framework

Markus Schedl, David Hauger, Julián Urbano
2013 Multimedia Systems  
As evaluation criteria we use precision and recall in an artist retrieval task as well as rank proximity.  ...  As music plays an important role in many human lives, microblogs on musicrelated activities are available in abundance.  ...  This version of the tf Á idf model proved particularly beneficial for modeling pre-filtered music-related microblogs [31] .  ... 
doi:10.1007/s00530-013-0321-5 fatcat:dm7hmwwg5feclhryi74i4gsdlu

A Small Survey On Event Detection Using Twitter [article]

Debanjan Datta
2022 arXiv   pre-print
Bayesian Mixture Models Latent Event Model(LEM) [51] the authors propose a generative model with latent variable to detect events.  ...  The latent variable is the event membership of individual documents.  ... 
arXiv:2011.05801v2 fatcat:gtrtxzgju5akzho3a2z4pq33gu

Recent Research Advances on Interactive Machine Learning [article]

Liu Jiang, Shixia Liu, Changjian Chen
2018 arXiv   pre-print
We conclude the survey with a discussion of open challenges and research opportunities that we believe are inspiring for future work in IML.  ...  years have witnessed the proliferation of IML in the field of visual analytics, most recent surveys either focus on a specific area of IML or aim to summarize a visualization field that is too generic for  ...  propose MutualRanker to interactively retrieve salient data from microblogs and address the uncertainty introduced by the ranking model.  ... 
arXiv:1811.04548v1 fatcat:4pihx2imurd2lj7hc524uiyafi

Identifying Regional Dialects in On-Line Social Media [chapter]

Jacob Eisenstein
2018 The Handbook of Dialectology  
The unprecedented scale of this data enables the application of quantitative methods to automatically discover the lexical variables that distinguish the language of geographical areas such as cities.  ...  This can be paired with the segmentation of geographical space into dialect regions, within the context of a single joint statistical model -thus simultaneously identifying coherent dialect regions and  ...  Acknowledgments Thanks to Brendan O'Connor for providing the data on which this chapter is based, and for many insightful conversations over a fun and productive long-term collaboration on this research  ... 
doi:10.1002/9781118827628.ch21 fatcat:5pm7lx53pnalxodq75nt5jyori

Visualization of Clandestine Labs from Seizure Reports: Thematic Mapping and Data Mining Research Directions [article]

William Hsu, Mohammed Abduljabbar, Ryuichi Osuga, Max Lu, Wesam Elshamy
2015 arXiv   pre-print
We describe an experimental test bed for event mapping that uses this end-to-end information retrieval system, and report preliminary results on a geoinformatics problem: tracking of methamphetamine lab  ...  We develop a static, finite topic model and examine the potential benefits and feasibility of extending this to dynamic topic modeling with a large number of topics and continuous time.  ...  This also holds for microblogs and other social media.  ... 
arXiv:1503.01549v1 fatcat:5usymj3c6zhn7jsf426s2mic6a

Online Social Networks Event Detection: A Survey [chapter]

Mário Cordeiro, João Gama
2016 Lecture Notes in Computer Science  
[61] presented a model for retrieving microblog posts that is enhanced with textual and microblog specific quality indicators and with a dynamic query expansion model.  ...  The approaches used were based on latent variable models inspired on modeling selectional preferences, and unsupervised information extraction.  ... 
doi:10.1007/978-3-319-41706-6_1 fatcat:zdoso55jzjbypkye7w3uozcjce

When and Where?: Behavior Dominant Location Forecasting with Micro-Blog Streams

Bhaskar Gautam, Annappa Basava, Abhishek Singh, Amit Agrawal
2018 2018 IEEE International Conference on Data Mining Workshops (ICDMW)  
Our proposed algorithm is based on the dynamic formation of collective personality communities using different languages, opinions, geographical and temporal distributions for finding out optimized equivalent  ...  as a predictive classification model.  ...  This contemporary input features along with algorithmic model are indexed below: 1) personality traits, weekdays, hours and distributed embedding for each variable, 2) personality traits, weekdays, hours  ... 
doi:10.1109/icdmw.2018.00169 dblp:conf/icdm/GautamBSA18 fatcat:dtbuei4b6rcu5o235b33tnqrse

What Does Twitter Say About Self-Regulated Learning? Mapping Tweets From 2011 to 2021

Mohammad Khalil, Gleb Belokrys
2022 Frontiers in Psychology  
For topic modeling, the text mining technique of Latent Dirichlet allocation (LDA) was applied and revealed insights on computationally processed topics.  ...  This work uses three main analysis methods, descriptive, topic modeling, and geocoding analysis.  ...  In addition, we are extremely grateful to the two reviewers for their constructive comments which have significantly improved this work.  ... 
doi:10.3389/fpsyg.2022.820813 pmid:35282232 pmcid:PMC8907480 fatcat:dsd32rckqbb7rpvslkgrnfadbm

Microblog Retrieval Using Ensemble of Feature Sets through Supervised Feature Selection

Abu Nowshed CHY, Md Zia ULLAH, Masaki AONO
2017 IEICE transactions on information and systems  
People usually search microblog posts for real-time information need [1], therefore recency is considered as an important temporal property for retrieving relevant tweets [5], [11]-[13].  ...  Modern and representative retrieval models, including Inverse Document Frequency (IDF), Okapi BM25, Language Model, Vector Space Model, Probability Ranking Principle (PRP), etc. also utilized by several  ...  Acknowledgments This research was supported by JSPS Grant-in-Aid for Sci-entific Research (B) 26280038.  ... 
doi:10.1587/transinf.2016dap0032 fatcat:syjywutdajbdhdq5x6vtwvzf34
« Previous Showing results 1 — 15 out of 492 results