Filters








13 Hits in 5.5 sec

Graph visualization tool for Twittersphere users based on a high-scalable extract, transform and load system

Pablo Aragón, Íñigo García, Antonio García
2011 Proceedings of the International Conference on Web Intelligence, Mining and Semantics - WIMS '11  
DESIGN RESULTS CRAWLING MODULE METADATA EXTRACTION MODULE INDEXING MODULE GRAPH VISUALIZATION MODULE Based on Nutch 1.  ...  INTRODUCTION DISTRIBUTED COMPUTATION PIPELINE DESIGN RESULTS CRAWLING MODULE METADATA EXTRACTION MODULE INDEXING MODULE GRAPH VISUALIZATION MODULE The Graph Visualization module transforms the  ... 
doi:10.1145/1988688.1988743 dblp:conf/wims/AragonGG11 fatcat:awps35akpvf4fio2tzldd3yhsu

Open challenges for data stream mining research

Georg Krempl, Myra Spiliopoulou, Jerzy Stefanowski, Indre Žliobaite, Dariusz Brzeziński, Eyke Hüllermeier, Mark Last, Vincent Lemaire, Tino Noack, Ammar Shaker, Sonja Sievi
2014 SIGKDD Explorations  
Existing solutions are based on "best practices", i.e., the systems' decisions are knowledge-driven and/or data-driven.  ...  This article presents a discussion on eight open challenges for data stream mining.  ...  on the challenges in stream mining.  ... 
doi:10.1145/2674026.2674028 fatcat:y3bozzeohveibgxb5wmiwfcogm

Tracking Dengue on Twitter Using Hybrid Filtration-Polarity and Apache Flume

Norjihan Binti Abdul Ghani, Suraya Hamid, Muneer Ahmad, Younes Saadi, N.Z. Jhanjhi, Mohammed A. Alzain, Mehedi Masud
2022 Computer systems science and engineering  
This research study proposes a sentiment analysis polarity approach for collecting data and extracting relevant information about dengue via Apache Hadoop.  ...  The method consists of two main parts: the first part collects data from social media using Apache Flume, while the second part focuses on querying and extracting relevant information via the hybrid filtration-polarity  ...  The algorithm is introduced based on the concept of data conditioning introduced by [37] , which is a process for transforming any noisy and raw social media content into high-quality data based on the  ... 
doi:10.32604/csse.2022.018467 fatcat:ekjbcu5h5bbttpxvaycgzva5we

Using tweets to support disaster planning, warning and response

Peter M. Landwehr, Wei Wei, Michael Kowalchuck, Kathleen M. Carley
2016 Safety Science  
Additional support was provided by the center for Computational Analysis of Social and Organizational Systems (CASOS) at CMU.  ...  for CMU social media analytics.  ...  the transformation extracts a number of network relationships from the data.  ... 
doi:10.1016/j.ssci.2016.04.012 fatcat:6knyipdpqzb73f2t4soy5dmz5y

What do people study when they study Twitter? Classifying Twitter related academic papers

Shirley A. Williams, Melissa M. Terras, Claire Warwick
2013 Journal of Documentation  
Findings The majority of published work relating to Twitter concentrates on aspects of the messages sent and details of the users.  ...  Structured Abstract Purpose Since its introduction in 2006, messages posted to the microblogging system Twitter have provided a rich dataset for researchers, leading to the publication of over a thousand  ...  Graph visualization tool for Twittersphere users based on a high--scalable extract, transform and load system 2011 ACM International Conference Proceeding Series 13 Arakawa Y., Tagashira S., Fukuda  ... 
doi:10.1108/jd-03-2012-0027 fatcat:3rfuptki5bgxvfxumtf6ea6vxi

Program

2022 2022 International Conference on Decision Aid Sciences and Applications (DASA)  
Ten different features are extracted from these IMFs. Only Hjorth parameters (activity, mobility, complexity) are selected using the Kruskal-Wallis test.  ...  This process is time-consuming, biases, and subject-specific.  ...  Even before running Extract, Transform, Load (ETL) on parallel architectures, extensive querying and performance in a way challenges are required.  ... 
doi:10.1109/dasa54658.2022.9765271 fatcat:ttqppf4j3navnaxe653mrzmezi

D4.3 - Computational analysis report

Elena Garcia, Sara Poveda, Gemma Molero, Ashwani Malviya, Francisco Santarremigia, Maria Chiara Leva, Mary Kinahan, Luca D'Alonzo, Francesco Fabbri, Guillermo Cid, Pablo Aragón, David Laniado (+5 others)
2022 Zenodo  
on service provisions (Use cases I and III) and employment conditions (Use case IV), exploring and assessing differences in user satisfaction according to their socio-demographic characteristics, by means  ...  Analysis of social media data collected from Twitter, to get insights on the concerns of men and women regarding topics related to Use case I, II and III (Section 5) Analysis of user satisfaction surveys  ...  Based on the user name, short bio and picture, the tool returns the estimated probability of a user to be man or woman.  ... 
doi:10.5281/zenodo.6372486 fatcat:ytk75oel7va5bccibqzaxvdanm

Bot accounts Ensemble Classification on Twitter

Κωνσταντίνος Ανδρέα Γεωργίου
2021
These accounts are known as bots and are a great threat for social media.  ...  has been given in detecting multiple bot types, in a multiclass bot detection system.  ...  Graph based methodologies present an alternative view of the problem, being considerably more easily interpretable by a human and modelling the associations between accounts in a visually engaging way  ... 
doi:10.26262/heal.auth.ir.329750 fatcat:7lfs34td4zf6va7lsycs6jb4oe

Inferring social behavior and interaction on twitter by combining metadata about users & messages

Marc Cheong
2017
First, I introduce a new framework for the large-scale gathering and collation of Twitter user and message metadata.  ...  Most extant literature treats the message and user domains on Twitter independently of one another. Current research focuses only on a single domain, but rarely on both.  ...  Based on the aforementioned observations, in essence, the GenderFromName algorithm is a highly-scalable and efficient method for determining the genders of Twitter users.  ... 
doi:10.4225/03/58b5009d3726a fatcat:x7xi2espbbd73jlg4s45hifclu

Analyzing and improving diversification, privacy, and information management on the web [article]

Kaweh Djafari Naini, Leibniz Universität Hannover, Leibniz Universität Hannover
2019
First, we present an efficient and scalable algorithm for web search results diversification for large-scale retrieval systems.  ...  Web search queries often contain only a few terms, and can be ambiguous, which is a core issue for retrieval systems.  ...  In [SO13] the authors extract features based on the usage and network properties of the users for predicting the users motives for using Facebook.  ... 
doi:10.15488/4313 fatcat:olobgfrs2ra4hp65xmdga7376e

TWITTER AND SOCIETY PETER LANG Library of Congress Cataloging-in-Publication Data

Axel Weller, Jean Burgess, & Merja, Puschmann, Steve Jones, New York, Washington, D Baltimore, Bern, Frankfurt Berlin, Brussels, Vienna (+14 others)
2014 unpublished
Die Deutsche Nationalbibliothek lists this publication in the "Deutsche Nationalbibliografie"; detailed bibliographic data is available on the Internet at http://dnb.d-nb.de/.  ...  There are many tools and methods for extracting information from text, using both rule-based and statistical techniques.  ...  . 140kIt 140kit 2 is a Web-based tool for the analysis of Twitter data.  ... 
fatcat:qmtuv4tuffb45iv5csnb6p6xw4

Architecture in the Data-driven City

Frederick Peter Ortner
2019
Pentland writes: "The key to citizen involvement in management of a data-rich city is visualizing the data.."  ...  Social Physics. 2014. p. 141. 12 Mattern, Instrumental City p.10/27 13 Picon, Smart Cities p.105 14 The connection between neo-malthusian predictions of ecological doom and systems thinking have an intriguing  ...  It should, however, demonstrate that the designer's analytic work solving a design problem and advocating for a design often produces intellectual fruit which have value above and beyond the creation of  ... 
doi:10.5075/epfl-thesis-9554 fatcat:mpq7n242ijgw5o3aq6hn5yf7he

Scanning the Science-Society Horizon [article]

Brenda Moon, University, The Australian National, University, The Australian National
2016
An open source a data gathering tool for Twitter data was developed and used to collect a dataset from Twitter with the keyword 'science' during 2011.  ...  The number of users per day followed a similar pattern, and most of these users did not use the word 'science' often on Twitter.  ...  Termite is an active research project by Jason Chuang and Ashley Jin, a 'visual analysis tool for inspecting the output of statistical topic models', which is open source and available on GitHub 21 (Chuang  ... 
doi:10.25911/5d6664e8354b8 fatcat:jmgtblj2n5e6xosu7ue3sokami