Filters








25 Hits in 5.2 sec

MB-ToT: An Effective Model for Topic Mining in Microblogs

Shaopeng Liu, Jian Yin, Jia Ouyang, Yun Huang, Piyuan Lin
2014 Applied Mathematics & Information Sciences  
Topic mining on microblogging sites with sheer scale of instance messages and social network information, such as Twitter, is a hard and challenging problem.  ...  Firstly, we assume each topic is a mixture distribution influenced by both word co-occurrences and timestamps of microblogs. This allows MB-ToT to capture the changes of each topic over time.  ...  Sun, Large scale microblog mining using distributed MB-LDA, Proceedings of the 21st international conference companion on World Wide Web, New York: ACM Press, 1035-1042 (2012). [9] X. Meng, F.  ... 
doi:10.12785/amis/080137 fatcat:knw4rqmpozczpimrgb7oftso4a

CLDA: An Effective Topic Model for Mining User Interest Preference under Big Data Background

Lirong Qiu, Jia Yu
2018 Complexity  
LDA is an effective method of text mining, but it will not play a very good role in applying LDA directly to a large number of short texts in microblog.  ...  We mainly use a large number of user text data from microblog to study.  ...  MB-LDA: it is a generative model based on LDA for theme mining on Weibo [31, 32] .  ... 
doi:10.1155/2018/2503816 fatcat:ydushel7kzbpfhzmafc76w7il4

TM-ToT: An Effective Model for Topic Mining from the Tibetan Messages [chapter]

Chengxu Ye, Wushao Wen, Ping Yang
2014 Communications in Computer and Information Science  
The microblog platforms, such as Weibo, now accumulate a large scale of data including the Tibetan messages.  ...  In our experiments, TM-ToT outperforms Twitter-LDA by a large margin in terms of perplexity. Furthermore, the quality of the generated latent topics of TM-ToT is promising.  ...  [7] introduced a novel probabilistic generative model MicroBlog-Latent Dirichlet Allocation (MB-LDA) for large scale microblog mining.  ... 
doi:10.1007/978-3-662-45924-9_40 fatcat:d7iq4d7s2ze75nsy5yyysjaqti

Design and Implementation of Parallelized LDA Topic Model Based on MapReduce

Duan-wu YAN, Tie-jun LI, Xiong-fei YANG, Kun CHEN
2018 DEStech Transactions on Computer Science and Engineering  
In order to solve theefficiency bottlenecksof non-parallel LDA topic model while processing large-scale text datasets,a parallel LDA topic model computing framework based on MapReduce is designed and implemented  ...  Performance testing of the parallel LDA topic modelis also conducted by using the bibliographic sample data of articles and patents.Experiment shows that, the parallel LDA topic analysis process based  ...  Zhang Chenyi et al [3] proposed a microblog generation model MB-LDA which based on LDA for topic mining of microblog texts.  ... 
doi:10.12783/dtcse/ccnt2018/24712 fatcat:lr5u3o3m55dwdlaoz3f3tdkyr4

Heat Analysis of Entrepreneurial Hotspots

Lin FENG, A-dong KONG, Hai-zhuo LIN, Sheng-lan LIU
2017 DEStech Transactions on Computer Science and Engineering  
of microblog for topic mining [11] .  ...  Nowadays, LDA model has been widely used, for example, Chen-yi Zhang et al. proposed the microblog generation model MB-LDA which took into account the contact association and text association relationship  ... 
doi:10.12783/dtcse/cst2017/12543 fatcat:fdzdsahezvaolnvjmw3wduydry

Weibo clustering: A new approach utilizing users' reposting data in social networking services

Guangzhi Zhang, Yunchuan Sun, Mengling Xu, Rongfang Bie
2014 Computer Science and Information Systems  
A modified model called "MB-LDA" is proposed on topic mining in [27] , which introduces the "@" and "RT" (Retweet, Repost) into the LDA model to mine the latent relations in the conversations whose test  ...  Although the experimental results show this method works well on large-scale microblog dataset, the small length of news in microblog cannot ensure completeness of the whole event.  ... 
doi:10.2298/csis130927070z fatcat:mddm5i4drjfl3mdlm43ah4wucq

A Prerecognition Model for Hot Topic Discovery Based on Microblogging Data

Tongyu Zhu, Jianjun Yu
2014 The Scientific World Journal  
When adopting topic detection and tracking techniques to find hot topics with streamed microblogging data, it will meet obstacles like streamed microblogging data clustering, topic hotness definition,  ...  The microblogging is prevailing since its easy and anonymous information sharing at Internet, which also brings the issue of dispersing negative topics, or even rumors.  ...  And then a parallel clustering algorithm should be provided to process large scale of posts.  ... 
doi:10.1155/2014/360934 pmid:25254235 pmcid:PMC4164368 fatcat:w723pmih3jhxzi5gmrtdhf4ihq

Extracting Actionable Knowledge from Domestic Violence Discourses on Social Media [article]

Sudha Subramani, Manjula O'Connor
2018 arXiv   pre-print
But, it is difficult to mine the actionable knowledge from large conversational datasets from social media due to the characteristics of high dimensions, short, noisy, huge volume, high velocity, and so  ...  But, it is challenging for the information retrieval from the large streams of microblogs of its following characteristics: • Immense scale of volume, fast data arriving rate and their unique characteristics  ...  On variation of LDA, online-LDA [41] , Dynamic topic models [40] , labelled LDA [42] used probabilistic topic models as a baseline model to detect events in twitter streams.  ... 
arXiv:1807.02391v1 fatcat:sg7uaofqdzddjd3derap23lyie

Extracting Actionable Knowledge from Domestic Violence Discourses on Social Media

Sudha Subramani, Manjula O'Connor
2018 EAI Endorsed Transactions on Scalable Information Systems  
But, it is difficult to mine the actionable knowledge from large conversational datasets from social media due to the characteristics of high dimensions, short, noisy, huge volume, high velocity, and so  ...  But, it is challenging for the information retrieval from the large streams of microblogs of its following characteristics: • Immense scale of volume, fast data arriving rate and their unique characteristics  ...  On variation of LDA, online-LDA [41] , Dynamic topic models [40] , labelled LDA [42] used probabilistic topic models as a baseline model to detect events in twitter streams.  ... 
doi:10.4108/eai.29-5-2018.154807 fatcat:kvgaqlzvxnc7xou4upc5tmrehq

Constructing Topic Models of Internet of Things for Information Processing

Jie Xin, Zhiming Cui, Shukui Zhang, Tianxu He, Chunhua Li, Haojing Huang
2014 The Scientific World Journal  
Therefore, constructing high quality topic hierarchies that can capture the term distribution of each product record enables us to better understand users' search intent and benefits tasks such as taxonomy  ...  MB-LDA [12] is designed for topic mining of microblogs or tweets, which takes both contact relation and document relation into consideration.  ...  We make use of two large-scale datasets for the evaluation of our model, both of which are extracted from real-world data sources.  ... 
doi:10.1155/2014/675234 pmid:25110737 pmcid:PMC4119721 fatcat:4hxligtscnhdtjc7uibqyqdqom

A big data methodology for categorising technical support requests using Hadoop and Mahout

Arantxa Duque Barrachina, Aisling O'Driscoll
2014 Journal of Big Data  
proposes a Proof of Concept (PoC) end to end solution that utilises the Hadoop programming model, extended ecosystem and the Mahout Big Data Analytics library for categorising similar support calls for large  ...  While research associated with machine learning algorithms is well established, research on big data analytics and large scale distributed machine learning is very much in its infancy with libraries such  ...  Research design and methodology The proposed solution provides an end to end solution for conducting large scale analysis of technical support data using the open source Hadoop platform, components of  ... 
doi:10.1186/2196-1115-1-1 fatcat:2xhb6n5a5fcghe5egy3baow3py

Bursty event detection from microblog: a distributed and incremental approach

Jianxin Li, Jianfeng Wen, Zhenying Tai, Richong Zhang, Weiren Yu
2015 Concurrency and Computation  
In this paper, we propose a distributed and incremental temporal topic model for microblogs called Bursty Event dEtection (BEE+).  ...  With the rise in popularity and size of microblogs, there is a need for distributed approaches that can detect bursty event with low latency from the short-text data stream.  ...  INTRODUCTION Bursty Event dEtection (BEE+) from microblogs is now a hot area in the field of data mining and knowledge discovery, because the information posted in microblogs is real time and often event  ... 
doi:10.1002/cpe.3657 fatcat:uhwzwh6iqngc5pgrqr23bc623y

Open challenges for data stream mining research

Georg Krempl, Myra Spiliopoulou, Jerzy Stefanowski, Indre Žliobaite, Dariusz Brzeziński, Eyke Hüllermeier, Mark Last, Vincent Lemaire, Tino Noack, Ammar Shaker, Sonja Sievi
2014 SIGKDD Explorations  
This article presents a discussion on eight open challenges for data stream mining.  ...  The resulting analysis is illustrated by practical applications and provides general suggestions concerning lines of future research in data stream mining.  ...  to thank the participants of the RealStream2013 workshop at ECMLPKDD2013 in Prague, and in particular Bernhard Pfahringer and George Forman, for suggestions and discussions on the challenges in stream mining  ... 
doi:10.1145/2674026.2674028 fatcat:y3bozzeohveibgxb5wmiwfcogm

Topic-Level Opinion Influence Model(TOIM): An Investigation Using Tencent Micro-Blogging [article]

Daifeng Li, Ying Ding, Xin Shuai, Golden Guo-zheng Sun, Jie Tang, Zhipeng Luo, Jingwei Zhang, Guo Zhang
2012 arXiv   pre-print
To evaluate and test this proposed model, an experiment was designed and a sub-dataset from Tencent Micro-Blogging was used.  ...  Mining user opinion from Micro-Blogging has been extensively studied on the most popular social networking sites such as Twitter and Facebook in the U.S., but few studies have been done on Micro-Blogging  ...  Topic Model based Sentiment Analysis and Opinion Detection Since the introduction of LDA model [23] , various extended LDA models have been proposed for topic extraction from large-scale corpora.  ... 
arXiv:1210.6497v1 fatcat:rjp3g6jxwvemtejwgiox2zuu5m

Entropy-Enhanced Attention Model for Explanation Recommendation

Yongjie Yan, Guang Yu, Xiangbin Yan
2022 Entropy  
The loss function of the model is used to realize the interpretability of recommendation systems.  ...  Most of the existing recommendation systems using deep learning are based on the method of RNN (Recurrent Neural Network).  ...  Although it improves the effect of recommendation, the problem of data sparsity will gradually appear when the data scale is large enough.  ... 
doi:10.3390/e24040535 pmid:35455199 pmcid:PMC9028415 fatcat:meao2xfwzjcudf3w6ocbug2xs4
« Previous Showing results 1 — 15 out of 25 results