108 Hits in 4.3 sec

Automatic geotagging of Russian web sites

Alexei Pyalling, Michael Maslov, Pavel Braslavski
2006 Proceedings of the 15th international conference on World Wide Web - WWW '06  
The poster describes a fast, simple, yet accurate method to associate large amounts of web resources stored in a search engine database with geographic locations.  ...  Experiments ran on Yandex search engine index; evaluation has proved the efficacy of the approach.  ...  The authors would like to thank the editors of the Yandex directory who took the trouble to manually assess the results of the automatic classification.  ... 
doi:10.1145/1135777.1135967 dblp:conf/www/PyallingMB06 fatcat:p3tm74jtwfcorlgaldwvzvqlji

Digital Imaginations Of National Parks In Different Social Media: A Data Exploration

Vuokko Heikinheimo, Henrikki Tenkanen, Tuomo Hiippala, Tuuli Toivonen
2018 Zenodo  
Noteworthy, almost 50% of Twitter users had shared their geotagged national park post to Twitter via Instagram.  ...  Language identification from text content and content analysis of images provide basis for further exploration of the digital representations of national parks and place-related experiences of visitors  ...  It is also important to acknowledge that only a small percentage of all social media content is geotagged (it is estimated that 1% of all tweets are geotagged).  ... 
doi:10.5281/zenodo.1472744 fatcat:nyusvrb7w5feve6nizuqghxcgm

Mapping the languages of Twitter in Finland:

Tuomo Hiippala, Tuomas Väisänen, Tuuli Toivonen, Olle Järv
2020 Neuphilologische Mitteilungen  
To cope with the high volume of social media data, we use automatic language identification and place of residence detection.  ...  We collect a corpus of 38 million Twitter messages with two million geographical coordinates to map the languages used across Finland at the level of regions and municipalities.  ...  Helsinki's status as the largest city in Finland is reflected in the high number of observations compared to other sites (n = 15 602).  ... 
doi:10.51814/nm.99996 fatcat:bhpsyaa5gng6zm7awi57ijqrhy

Social media data for conservation science: A methodological overview

Tuuli Toivonen, Vuokko Heikinheimo, Christoph Fink, Anna Hausmann, Tuomo Hiippala, Olle Järv, Henrikki Tenkanen, Enrico Di Minin
2019 Biological Conservation  
analyses, (iii) exemplify the potential of these approaches for real-world conservation challenges, and (iv) discuss the limitations of social media data analysis in conservation science.  ...  Specifically, we (i) describe what kind of relevant information can be retrieved from social media platforms, (ii) provide a detailed overview of advanced methods for spatio-temporal, content and network  ...  Web scraping, or web crawling, is an approach for downloading and extracting data from web pages using an automated script.  ... 
doi:10.1016/j.biocon.2019.01.023 fatcat:nxs2oh6cfbaeln2jiyg4zg5uby


2010 Issues in Information Systems  
Perhaps one of the most potentially dangerous activities on the Internet is keeping in touch with others via social media, or social networking sites.  ...  This exploratory study examined personally identifiable information (PII), of photos uploaded to TwitPic. A total of 417,056 images were downloaded and processed for the study.  ...  Social network sites "not only allow users to create personal information spaces which are easily accessible from anywhere on the Web, but also gives them the tools to share their personal artifacts with  ... 
doi:10.48009/1_iis_2010_432-438 fatcat:ivxhcfpiifh2nect4cx7xcu2xi

A Python Library for Exploratory Data Analysis on Twitter Data based on Tokens and Aggregated Origin-Destination Information [article]

Mario Graff and Daniela Moctezuma and Sabino Miranda-Jiménez and Eric S. Tellez
2021 arXiv   pre-print
, and Russian languages.  ...  In summary, the Python library presented is applied to different domains and retrieves a plethora of information in terms of frequencies by day of words and bi-grams of words for Arabic, English, Spanish  ...  The idea of automatically finding the groups is explored using the well-known k-means algorithm.  ... 
arXiv:2009.01826v3 fatcat:awfqvsmbszcdhbigq5ng3jjxii

Determining the spatial reader scopes of news sources using local lexicons

Gianluca Quercini, Hanan Samet, Jagan Sankaranarayanan, Michael D. Lieberman
2010 Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems - GIS '10  
Techniques to automatically generate the local lexicon of a location by using the link structure of Wikipedia are described and evaluated.  ...  ., Web versions of newspapers) usually have an implicit spatial reader scope, which is the geographical location for which the content has been primarily produced.  ...  For example, a person living in Washington, DC often turns to the web site of the Washington Post, as it is the most prominent news source in the area.  ... 
doi:10.1145/1869790.1869800 dblp:conf/gis/QuerciniSSL10 fatcat:342d4vh2afcmrn6tfxkmouvufm

Development of Organizing and Economic Measures for Monitoring the Cost of Construction Resources

Dmitriy Silka, V. Murgul
2017 MATEC Web of Conferences  
Market pricing depends on a huge list of factors and in each case is carried out in a wide range of initial values with high probability assumptions.  ...  This approach is based on a set of organizational and economic mechanisms allowing tracking not potential, but real purchase and sale of construction resources.  ...  delivery; -Geotag of a supplier's location (geographic coordinates in the form corresponding to the format of the international standards). 6.3.  ... 
doi:10.1051/matecconf/201710608042 fatcat:k376fvv7bzai3cwsrkblwnql2u

Spatiotemporal Filtering Pipeline for Efficient Social Networks Data Processing Algorithms [chapter]

Ksenia Mukhina, Alexander Visheratin, Denis Nasonov
2020 Lecture Notes in Computer Science  
One of the areas that gathers momentum is the investigation of location-based social networks (LBSNs) because the understanding of citizens' behavior on various scales can help to improve quality of living  ...  The main aim of this paper is helping LBSN researchers to perform a preliminary step of data preprocessing and thus increase the efficiency of their algorithms.  ...  This research is financially supported by The Russian Science Foundation, Agreement #18-71-00149.  ... 
doi:10.1007/978-3-030-50433-5_7 fatcat:t6rf5be54febjiwia2qsc5fzae

A Pragmatic Guide to Geoparsing Evaluation [article]

Milan Gritta, Mohammad Taher Pilehvar, Nigel Collier
2019 arXiv   pre-print
In addition to fine-grained Geotagging and Toponym Resolution (Geocoding), this dataset is also suitable for prototyping and evaluating machine learning NLP models.  ...  Evaluation is further made inconsistent, even unrepresentative of real-world usage by the lack of distinction between the different types of toponyms, which necessitates new guidelines, a consolidation  ...  GeoWeb-News is a web-scraped corpus hence a few articles may contain duplicate paragraphs or some missing words from improperly parsed web links, which is typical of what might be encountered in practical  ... 
arXiv:1810.12368v5 fatcat:omtwa7xnvrgxvgipn6pddc6l44

Placing Photos with a Multimodal Probability Density Function

Jonathon Hare, Jamie Davies, Sina Samangooei, Paul Lewis
2014 Proceedings of International Conference on Multimedia Retrieval - ICMR '14  
However there is still a mass of content that is not geotagged, meaning that algorithms for efficient and accurate geographical estimation of an image are needed.  ...  This paper presents a general model for effectively using both textual metadata and visual features of photos to automatically place them on a world map with state-of-the-art performance.  ...  INTRODUCTION Over recent years there has been a steady increase in the amount of geotagged imagery on the web.  ... 
doi:10.1145/2578726.2578768 dblp:conf/mir/HareDSL14 fatcat:rttnqkz6yvhslm42xclcmozvdq

On How Crowdsourced Data and Landscape Organisation Metrics Can Facilitate the Mapping of Cultural Ecosystem Services: An Estonian Case Study

Oleksandr Karasov, Stien Heremans, Mart Külvik, Artem Domnich, Igor Chervanyov
2020 Land  
Therefore, recent studies are increasingly recognising passively crowdsourced geotagged photographs retrieved from location-based social media as suitable data for quantitative mapping and assessment of  ...  The suggested methodology can significantly improve the state-of-the-art with regard to CES mapping from geotagged photographs, and it is therefore particularly relevant for monitoring landscape sustainability  ...  Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/land9050158 fatcat:67l3h6u7bzcgrlzpo55y5g7m5e

An introduction to the Europe Media Monitor family of applications [article]

Ralf Steinberger, Bruno Pouliquen, Erik van der Goot
2013 arXiv   pre-print
We give an overview of their functionality and discuss some of the implications of the fact that they cover quite so many languages.  ...  Part of this media monitoring work can be automated.  ...  EMM visits the news web sites up to every five minutes to search for the latest articles.  ... 
arXiv:1309.5290v1 fatcat:wpcgvkswkne7vbknmm23yhj3w4

Travel Guides for Creative Tourists, Powered by Geotagged Social Media [article]

Dan Tasse, Jason I. Hong
2021 arXiv   pre-print
However, public geotagged social media data, like tweets and photos, have the potential to fill this gap, showing users an interesting and unique side of a place.  ...  Through three studies surrounding the design and construction of a social-media-powered Neighborhood Guides website, we show recommendations for building such a site.  ...  Geotagged social media can help fill this gap and help them understand the neighborhoods in an unfamiliar city, and we have provided an example and guidelines to build a better site.  ... 
arXiv:2112.12009v1 fatcat:uizyl2yl6nh5jjjfxzbvrvjita

Characterizing the 2016 Russian IRA Influence Campaign [article]

Adam Badawy, Aseel Addawood, Kristina Lerman, Emilio Ferrara
2018 arXiv   pre-print
Congress investigation of Russian interference in the 2016 U.S. election campaign, with Russia accused of, among other things, using trolls (malicious accounts created for the purpose of manipulation)  ...  Conservative users who retweet Russian trolls produced significantly more tweets than liberal ones, about 8 times as many in terms of tweets.  ...  The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of AFOSR  ... 
arXiv:1812.01997v1 fatcat:4cvg4utmiffkveqkvqln7nhxxu
« Previous Showing results 1 — 15 out of 108 results