2,351 Hits in 2.7 sec

Large-scale linked data integration using probabilistic reasoning and crowdsourcing

Gianluca Demartini, Djellel Eddine Difallah, Philippe Cudré-Mauroux
2013 The VLDB journal  
We tackle the problems of semiautomatically matching linked data sets and of linking large collections of Web pages to linked data.  ...  We integrate all results from the inverted indices, from the graph database and from the crowd using a probabilistic framework in order to make sensible decisions about candidate matches and to identify  ...  For this reason, only a very limited set of candidate pairs should be crowdsourced when matching large data sets.  ... 
doi:10.1007/s00778-013-0324-z fatcat:s6g6emp2qjeflkoasym37ys5fm


Gianluca Demartini, Djellel Eddine Difallah, Philippe Cudré-Mauroux
2012 Proceedings of the 21st international conference on World Wide Web - WWW '12  
We evaluate ZenCrowd in a real deployment and show how a combination of both probabilistic reasoning and crowdsourcing techniques can significantly improve the quality of the links, while limiting the  ...  We tackle the problem of entity linking for large collections of online pages; Our system, ZenCrowd, identifies entities from natural language text using state of the art techniques and automatically connects  ...  We describe our formal model to combine both algorithmic and crowdsourcing results using probabilistic reasoning in Section 4.  ... 
doi:10.1145/2187836.2187900 dblp:conf/www/DemartiniDC12 fatcat:ywleux4n2bdlzmavh4x5jrx5xy

Disaster management, crowdsourced R&D and probabilistic innovation theory: Toward real time disaster response capability

Christian William Callaghan
2016 International Journal of Disaster Risk Reduction  
Global collaborative innovation platforms and large-scale investments in emerging crowdsourced R&D and social media technologies together with synthesis of appropriate theory may contribute to improved  ...  inputs and analysis), and its methodologies, such as those drawing from crowdsourced R&D and social media, may offer useful insights into enabling real time research capabilities, with important implications  ...  to provide large-scale quality data in crises but also how they can contribute to analysis of data and problem solving in real time.  ... 
doi:10.1016/j.ijdrr.2016.05.004 pmid:32289010 pmcid:PMC7104335 fatcat:khoixzktgrdttavwhyn6xbjqhu

TopCrowd [chapter]

Christian Nieke, Ulrich Güntzer, Wolf-Tilo Balke
2014 Lecture Notes in Computer Science  
Building databases and information systems over data extracted from heterogeneous sources like the Web poses a severe challenge: most data is incomplete and thus difficult to process in structured queries  ...  The intelligent combination of efficient data processing algorithms with crowdsourced database operators promises to alleviate the situation.  ...  All attribute values were scaled to an interval [0,1] and used as attribute scores.  ... 
doi:10.1007/978-3-319-12206-9_10 fatcat:i6clz5fhejahtfhosmyvvtgh4e

Contemporary HIV/AIDS research: Insights from knowledge management theory

Chris William Callaghan
2017 SAHARA-J  
Expert and non-expert crowdsourced inputs can enable problem-solving through exponentially increasing problem-solving inputs, using the 'crowd,' thereby increasing collaborations dramatically.  ...  To date, however, these challenges remain with us, and theoretical contributions that can complement natural science efforts to eradicate these problems are needed.  ...  In large-scale crowdsourced data collection and analysis seeking to facilitate real time research, crowdsourced R&D inputs would therefore need to be differentiated according to their relative value to  ... 
doi:10.1080/17290376.2017.1375426 pmid:28922967 pmcid:PMC5639607 fatcat:lz7mljxdhnak5fi4t2epy4vk2a

Crowdsourcing Tasks within Linked Data Management

Elena Simperl, Barry Norton, Denny Vrandecic
2011 International Semantic Web Conference  
Many aspects of Linked Data management -including exposing legacy data and applications to semantic formats, designing vocabularies to describe RDF data, identifying links between entities, query processing  ...  In this paper we build upon and extend these ideas to propose a framework by which human and computational intelligence can co-exist by augmenting existing Linked Data and Linked Service technology with  ...  , in data discovery and data integration.  ... 
dblp:conf/semweb/SimperlNV11 fatcat:h3dl7equmjbltebxduai44n3wi

Developing the Transdisciplinary Aging Research Agenda: New Developments in Big Data

Christian W. Callaghan
2018 Current Aging Science  
comprehensive data coverage and inductive data-driven modes of enquiry versus theory-driven deductive modes, this critical review seeks to offer useful perspectives of big data analytics and to derive  ...  Method: This work offers a critical review of theory and literature relating big data to aging research.  ...  Such large scale data creates opportunities for big data scientists to develop theory and new models based on radically increased volumes of data collection, integration, and analysis [49] .  ... 
doi:10.2174/1874609810666170719100122 pmid:28721807 pmcid:PMC6110041 fatcat:yzao236l6rejrccclkdb45af5y

When less is more: innovations for tracking progress toward global targets

Todd S Rosenstock, Christine Lamanna, Sabrina Chesterman, James Hammond, Suneetha Kadiyala, Eike Luedeling, Keith Shepherd, Brian DeRenzi, Mark T van Wijk
2017 Current Opinion in Environmental Sustainability  
on cost, accuracy and scale of data collection.  ...  Despite advances in the ability to acquire and handle large amounts of data, budgets as well as human and institutional capacity are typically insufficient to deal with the scale and complexity of even  ... 
doi:10.1016/j.cosust.2017.02.010 fatcat:zveknmwvvvdbzl73a2un5trahu

Knowledge Management and Problem Solving in Real Time: The Role of Swarm Intelligence

Chris W Callaghan
2016 Interdisciplinary Journal of Information, Knowledge, and Management  
This synthesis seeks to offer useful insights into the research process, by offering a perspective of what maximized collaboration, as a system, implies for real-time problem solving.  ...  Knowledge management research applied to the development of real-time research capability, or capability to solve societal problems in hours and days instead of years and decades, is perhaps increasingly  ...  The probabilistic potential of crowdsourcing and social media use for large data collection, synthesis, and analysis is particularly useful in real-time disaster contexts where problem-solving processes  ... 
doi:10.28945/3528 fatcat:q2ymtymjnfdrlhptfcx54ja2ci

Identifying and Accounting for Task-Dependent Bias in Crowdsourcing

Ece Kamar, Ashish Kapoor, Eric Horvitz
2015 AAAI Conference on Human Computation & Crowdsourcing  
First, we show how to build and use probabilistic graphical models for jointly modeling task features, workers' biases, worker contributions and ground truth answers of tasks so that task-dependent bias  ...  We evaluate the models with varying complexity on a large data set collected from a citizen science project and show that the models are effective at correcting the task-dependent worker bias.  ...  Acknowledgments We thank the Galaxy Zoo team for data access, John Guiver for assistance with using Infer.Net and the anonymous reviewers for feedback.  ... 
dblp:conf/hcomp/KamarKH15 fatcat:2g3ujgltkjdn5expjprgtiej4q

A Survey on State-of-the-art Techniques for Knowledge Graphs Construction and Challenges ahead [article]

Ali Hur, Naeem Janjua, Mohiuddin Ahmed
2021 arXiv   pre-print
The knowledge graph is an emerging technology that allows logical reasoning and uncovers new insights using content along with the context.  ...  Thereby, it provides necessary syntax and reasoning semantics that enable machines to solve complex healthcare, security, financial institutions, economics, and business problems.  ...  Mao, and W. Zhao, "Knowledge graphs completion via [2] X. Dong et al., "Knowledge vault: A web-scale approach to probabilistic reasoning," Inf. Sci.  ... 
arXiv:2110.08012v2 fatcat:q6utzgjahfehpftol3dttgolui

AuthCrowd: Author Name Disambiguation and Entity Matching using Crowdsourcing

Antonio Correia, Diogo Guimaraes, Dennis Paulino, Shoaib Jameel, Daniel Schneider, Benjamim Fonseca, Hugo Paredes
2021 2021 IEEE 24th International Conference on Computer Supported Cooperative Work in Design (CSCWD)  
As these concerns tend to be of large-scale, both the general consistency and the quality of electronic data are largely affected.  ...  This paper presents an approach to handle these name ambiguity problems through the use of crowdsourcing as a complementary means to traditional unsupervised approaches.  ...  in [4] does not consider the use of human-involved methods and techniques at a large scale (i.e., crowdsourcing) despite some recent advances in this direction [6] .  ... 
doi:10.1109/cscwd49262.2021.9437769 fatcat:wuc2n5a2bjenpkyzeasmcqdsqi


Rebecca Sharp, Adarsh Pyarelal, Benjamin Gyori, Keith Alcock, Egoitz Laparra, Marco A. Valenzuela-Escárcega, Ajay Nagesh, Vikas Yadav, John Bachman, Zheng Tang, Heather Lent, Fan Luo (+5 others)
2019 Proceedings of the 2019 Conference of the North  
Delphi is a modeling framework that assembles quantified causal fragments and their contexts into executable probabilistic models that respect the semantics of the original text and can be used to support  ...  In this paper, we introduce an approach that builds executable probabilistic models from raw, free text. The proposed approach is implemented through three systems: Eidos 1 , IN-DRA 2 and Delphi 3 .  ...  Acknowledgments: This work was supported by the Defense Advanced Research Projects Agency (DARPA) under the World Modelers program, grant W911NF1810014 and by the Bill and Melinda Gates Foundation HBGDki  ... 
doi:10.18653/v1/n19-4008 dblp:conf/naacl/SharpPGALVNYBTL19 fatcat:tov4e4a2nvc37bbc6v4fpffn6q

Intelligent systems for geosciences

Yolanda Gil, Mary Hill, John Horel, Leslie Hsu, Jim Kinter, Craig Knoblock, David Krum, Vipin Kumar, Pierre Lermusiaux, Yan Liu, Chris North, Suzanne A. Pierce (+16 others)
2018 Communications of the ACM  
Geoscience data is challenging because it tends to be uncertain, intermittent, sparse, multiresolution, and multiscale.  ...  Although there have been significant and beneficial interactions between the intelligent systems and geosciences communities, 4,12  ...  , and Maria Zemankova for suggestions and feedback.  ... 
doi:10.1145/3192335 fatcat:67tjss5hojdorb2thpdlxoa6ye

A crowdsourceable QoE evaluation framework for multimedia content

Kuan-Ta Chen, Chen-Chi Wu, Yu-Chun Chang, Chin-Laung Lei
2009 Proceedings of the seventeen ACM international conference on Multimedia - MM '09  
that of MOS, so there is less burden on participants; and 3) it derives interval-scale scores that enable subsequent quantitative analysis and QoE provisioning.  ...  Since such a crowd can be quite large, crowdsourcing enables researchers to conduct experiments with a more diverse set of participants at a lower economic cost than would be possible under laboratory  ...  This work was supported in part by the Taiwan E-learning and Digital Archives Program (TELDAP), sponsored by the National Science Council of Taiwan under grants NSC98-2631-001-011 and NSC98-2631-001-013  ... 
doi:10.1145/1631272.1631339 dblp:conf/mm/ChenWCL09 fatcat:byqiddgf5jfpbfzeq2yad4fzam
« Previous Showing results 1 — 15 out of 2,351 results