Filters








2,924 Hits in 9.4 sec

Event Detection in Wikipedia Edit History Improved by Documents Web Based Automatic Assessment

Marco Fisichella, Andrea Ceroni
2021 Big Data and Cognitive Computing  
The evolution of an event is captured successfully primarily based on analyzing the user edits records in Wikipedia.  ...  As the extra document source for event validation, we chose the Web due to its ease of accessibility and wide event coverage.  ...  )) by analyzing the events described manually by Wikipedia users versus the events detected by our model.  ... 
doi:10.3390/bdcc5030034 fatcat:nbzrpcdg2fgofp3gz44wdvhrby

An Information Nutritional Label for Online Documents

Norbert Fuhr, Wolfgang Nejdl, Isabella Peters, Benno Stein, Anastasia Giachanou, Gregory Grefenstette, Iryna Gurevych, Andreas Hanselowski, Kalervo Jarvelin, Rosie Jones, YiquN Liu, Josiane Mothe
2018 SIGIR Forum  
The beauty of the web is its openness, but this openness has lead to a proliferation of false and unreliable information, whose presentation makes it difficult to detect.  ...  Here we propose creating an "information nutrition label" that we can automatically generated for any online text.  ...  Acknowledgements This proposition is the result of a workshop held during the Dagstuhl Seminar number 17301 on User-Generated Content in Social Media, July 23 28, 2017.  ... 
doi:10.1145/3190580.3190588 fatcat:n5avwqt5pzgktgwtjcealsvoau

Community-Contributed Media Collections: Knowledge at Our Fingertips [chapter]

Tania Cerquitelli, Alessandro Fiori, Alberto Grand
2011 Community-Built Databases  
The works described in this chapter aim to (a) improve the automatic understanding of this multimedia data and (b) enhance the document classification task and the user searching activity on media collections  ...  This chapter reviews different collections of user-contributed media, such as YouTube, Flickr, and Wikipedia, by presenting the main features of their online social networking sites.  ...  Since many Web users may not be proficient at creating and managing Web content, the editing model of Wikipedia is based on "wiki".  ... 
doi:10.1007/978-3-642-19047-6_2 fatcat:pijcr2c3j5e5tjs73zbtuue6qa

Barbara Made the News

Flavio Martins, João Magalhes, Jamie Callan
2016 Proceedings of the Ninth ACM International Conference on Web Search and Data Mining - WSDM '16  
Our hypothesis stems from the fact that when a real-world event occurs it usually has peak times on the Web: a higher volume of tweets, new visits and edits to related Wikipedia articles, and news published  ...  In Twitter, and other microblogging services, the generation of new content by the crowd is often biased towards immediacy: what is happening now.  ...  The best improvement in MAP is obtained with the Twitter Feedback feature and the best improvement in P30 is obtained by the Wikipedia Edits feature.  ... 
doi:10.1145/2835776.2835825 dblp:conf/wsdm/MartinsMC16 fatcat:slaftrg6ezap5ptmnonfxjvwa4

Extracting Event-Related Information from Article Updates in Wikipedia [chapter]

Mihai Georgescu, Nattiya Kanhabua, Daniel Krause, Wolfgang Nejdl, Stefan Siersdorfer
2013 Lecture Notes in Computer Science  
In this paper, we conduct an in-depth analysis of event-related updates in Wikipedia by examining different indicators for events including language, meta annotations, and update bursts.  ...  We then study how these indicators can be employed for automatically detecting eventrelated updates.  ...  Acknowledgments This work was partially funded by the European Commission FP7 under grant agreements No. 287704 and No. 600826 for the CUBRIK and ForgetIT projects respectively.  ... 
doi:10.1007/978-3-642-36973-5_22 fatcat:j6lcsjzwzjfnlhndwl2xhg7fna

Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus

Mohamad Mehdi, Chitu Okoli, Mostafa Mesgari, Finn Årup Nielsen, Arto Lanamäki
2017 Information Processing & Management  
files that contain all links between 5, 716, 808 Wikipedia pages. • Wikipedia3: a monthly updated conversion of the English Wikipedia into RDF. • Wikipedia edit history: complete Wikipedia edit history  ...  Several studies used Wikipedia knowledge base to enhance the text classification task. Wang and Domeniconi [128] used Wikipedia to improve document classification by defining concept-based kernels.  ... 
doi:10.1016/j.ipm.2016.07.003 fatcat:qgjeatizfzbyjkbo4rsuxea76y

Generating Full Length Wikipedia Biographies: The Impact of Gender Bias on the Retrieval-Based Generation of Women Biographies [article]

Angela Fan, Claire Gardent
2022 arXiv   pre-print
We address these by developing a model for English text that uses a retrieval mechanism to identify relevant supporting information on the web and a cache-based pre-trained encoder-decoder to generate  ...  To assess the impact of available web evidence on the output text, we compare the performance of our approach when generating biographies about women (for which less information is available on the web  ...  However, Wikipedia articles remain painstakingly written and edited primarily by a network of human contributors.  ... 
arXiv:2204.05879v1 fatcat:mjfgnvmxibgpfn3xnvteosrsai

Computing controversy: Formal model and algorithms for detecting controversy on Wikipedia and in search queries

Kazimierz Zielinski, Radoslaw Nielek, Adam Wierzbicki, Adam Jatowt
2018 Information Processing & Management  
Our approach can be also applied in Wikipedia or other knowledge bases for supporting the detection of controversy and content maintenance.  ...  Finally, we believe that our results could be useful for social science researchers for understanding the complex nature of controversy and in fostering their studies.  ...  Acknowledgment This work is supported by Polish National Science Centre grant 2015/19/B/ST6/03179.  ... 
doi:10.1016/j.ipm.2017.08.005 fatcat:iu5qkr2w6fdzxne5eavjdvp3km

User-Generated Content in Social Media (Dagstuhl Seminar 17301)

Tat-Seng Chua, Norbert Fuhr, Gregory Grefenstette, Kalervo Järvelin, Jaakko Paltonen, Marc Herbstritt
2018 Dagstuhl Reports  
WG1 invented an "Information Nutrition Label" that characterizes a document by different features such as e.g. emotion, opinion, controversy, and topicality; For computing these feature values, available  ...  This report documents the program and the outcomes of Dagstuhl Seminar 17301 "User-Generated Content in Social Media". Social media have a profound impact on individuals, businesses, and society.  ...  For topics that are covered by Wikipedia, determine the portion of reverts (after article editing), the so-called "edit wars" in Wikipedia. See the coverage measure (essay articles) below.  ... 
doi:10.4230/dagrep.7.7.110 dblp:journals/dagstuhl-reports/ChuaFGJP17 fatcat:bman5u6q5zdg7a6csnzwpba7sm

When time meets information retrieval: Past proposals, current plans and future trends

Bilel Moulahi, Lynda Tamine, Sadok Ben Yahia
2016 Journal of information science  
In this respect, the time dimension has been extensively exploited as a highly important relevance criterion to improve the retrieval effectiveness of document ranking models.  ...  With the advent of Web search and the large amount of data published on the Web sphere, a tremendous amount of documents become strongly time-dependent.  ...  Acknowledgement We thank the anonymous reviewers, whose comments have contributed to important improvements of the paper. Notes  ... 
doi:10.1177/0165551515607277 fatcat:c7zu437drreq5dqlbka646qnby

Multilingual Ranking of Wikipedia Articles with Quality and Popularity Assessment in Different Topics

Włodzimierz Lewoniewski, Krzysztof Węcel, Witold Abramowicz
2019 Computers  
The goal of this study is to show what topics are best represented in different language versions of Wikipedia using results of quality assessment for over 39 million articles in 55 languages.  ...  On Wikipedia, articles about various topics can be created and edited independently in each language version. Therefore, the quality of information about the same topic depends on the language.  ...  Automatic quality assessment of Wikipedia articles is a known challenge in the scientific community.  ... 
doi:10.3390/computers8030060 fatcat:zwjjdyhhkreanjstqcs7zaaw2e

Integration of multiple network views in Wikipedia

Guangyu Wu, Pádraig Cunningham
2014 Knowledge and Information Systems  
We present an example of such a scenario in the analysis of edit networks in Wikipedia -the networks of editors interacting on Wikipedia pages.  ...  This is particularly an issue when the network is dynamic and is defined by events that occur over time.  ...  Acknowledgments This work is supported by Science Foundation Ireland Grant No. 08/SRC/I140 (Clique: Graph & Network Analysis Cluster).  ... 
doi:10.1007/s10115-014-0802-7 fatcat:e7jg5qpt7fdsxmwmfcdckbrhia

Wikipedia Research and Tools: Review and Comments

Finn Årup Nielsen
2012 Social Science Research Network  
I here give an overview of Wikipedia and wiki research and tools. Well over 1,000 reports have been published in the field and there exist dedicated scientific meetings for Wikipedia research.  ...  Claudia Koltzenburg and James Heilman for pointing to references and tools Thanks also to Chitu Okoli, Mohamad Mehdi, Mostafa Mesgari and Arto Lanamäki with whom I am writing systematic reviews about Wikipedia  ...  Automatic vandalism detection in Wikipedia: Towards a machine learning approach.  ... 
doi:10.2139/ssrn.2129874 fatcat:h4znerp2efhn5j5bdi5rl7addi

Mining meaning from Wikipedia

Olena Medelyan, David Milne, Catherine Legg, Ian H. Witten
2009 International Journal of Human-Computer Studies  
It focuses on research that extracts and makes use of the concepts, relations, facts and descriptions found in Wikipedia, and organizes the work into four broad categories: applying Wikipedia to natural  ...  The article addresses how Wikipedia is being used as is, how it is being improved and adapted, and how it is being combined with other structures to create entirely new resources.  ...  Medelyan is supported by a scholarship from Google, Milne by the New Zealand Tertiary Education Commission.  ... 
doi:10.1016/j.ijhcs.2009.05.004 fatcat:mzxszf4jlfcizbgxuemgdwzdiy

Trust in collaborative web applications

Andrew G. West, Jian Chang, Krishna K. Venkatasubramanian, Insup Lee
2012 Future generations computer systems  
Abstract Collaborative functionality is increasingly prevalent in web applications.  ...  Collaborative functionality is increasingly prevalent in web applications. Such functionality permits individuals to add -and sometimes modify -web content, often with minimal barriers to entry.  ...  Acknowledgements: This research was supported in part by ONR MURI N00014-07-1-0907 and NSF CNS-0931239. A preliminary version was published as University of Pennsylvania Technical Report MS-CIS-10-33.  ... 
doi:10.1016/j.future.2011.02.007 fatcat:siwvu6fke5gynafyn6vxketlau
« Previous Showing results 1 — 15 out of 2,924 results