156,996 Hits in 3.6 sec

Scalable integration and processing of linked data

Andreas Harth, Aidan Hogan, Spyros Kotoulas, Jacopo Urbani
2011 Proceedings of the 20th international conference companion on World wide web - WWW '11  
As such, the tutorial will focus on Linked Data publishing and related Semantic Web technologies, introducing scalable techniques for crawling, indexing and automatically integrating structured heterogeneous  ...  The tutorial will show how Linked Data enables uniform access, parsing and interpretation of data, and how this novel wealth of structured data can potentially be exploited for creating new applications  ...  and for integrating heterogeneous data from a large number of diverse sources.  ... 
doi:10.1145/1963192.1963318 dblp:conf/www/HarthHKU11 fatcat:63pzk35ga5b4hegnyf3dcmd2ey

Developing the Scalable Data Integration for Disease Surveillance (SDIDS) Platform

David Buckeridge, Maxime Lavigne, Kate Zinszer, Anya Okhmatovskaia, Samson Tu, Csongor Nyalus, Mark Musen, Wilson Lau, Lauren Carroll, Neil Abernethy
2016 Online Journal of Public Health Informatics  
Objective To develop a scalable software platform for integrating existing global health surveillance data and to implement the platform for malaria surveillance in Uganda.  ...  Acknowledgments Research supported by the Bill and Melinda Gates Foundation.  ...  Retrieval and analysis of data: External applications can connect directly to SDIDS via an API to request data for further processing or to request the results of analyses applied to the integrated data  ... 
doi:10.5210/ojphi.v8i1.6414 fatcat:vp36ffdv7nfvjfl3nolcjetyii

Leveraging Fog Computing for Scalable IoT Datacenter Using Spine-Leaf Network Topology

K. C. Okafor, Ifeyinwa E. Achumba, Gloria A. Chukwudebe, Gordon C. Ononiwu
2017 Journal of Electrical and Computer Engineering  
Besides, the scalability requirements found in the current IoT data processing (in the cloud) can hardly be used for applications such as assisted living systems, Big Data analytic solutions, and smart  ...  With the Internet of Everything (IoE) paradigm that gathers almost every object online, huge traffic workload, bandwidth, security, and latency issues remain a concern for IoT users in today's world.  ...  Acknowledgments This research was carried out as an extended work on Distributed Cloud Computing Network for SGEMS/EETACP project commissioned by the Department of Electronic Engineering, University of  ... 
doi:10.1155/2017/2363240 fatcat:7x7xvrmuf5dm7ixlu4m33ryxta

The EDRN knowledge environment: an open source, scalable informatics platform for biological sciences research

Daniel Crichton, Ashish Mahabal, Kristen Anton, Luca Cinquini, Maureen Colbert, S. George Djorgovski, Heather Kincaid, Sean Kelly, David Liu, Thomas George, Achyut K. Dutta, M. Saif Islam
2017 Micro- and Nanotechnology Sensors, Systems, and Applications IX  
It has accumulated data on hundreds of thousands of biospecemens and serves over 1300 registered users across the National Cancer Institute (NCI).  ...  It uses tools like Apache OODT, Plone, and Solr, and borrows heavily from JPL's Planetary Data System's ontological infrastructure.  ...  AAM and SGD also acknowledge support from the Center for Data-Driven Discovery at Caltech, and from the Ajax Foundation.  ... 
doi:10.1117/12.2263842 fatcat:vosjphnqxbh67gdbomsuuvwwmq

Storing and Provisioning Linked Data as a Service [chapter]

Johannes Lorey
2013 Lecture Notes in Computer Science  
We compare our work to state-of-the-art approaches for discovering, integrating, and consuming Linked Data.  ...  Linked Data offers novel opportunities for aggregating information about a wide range of topics and for a multitude of applications.  ...  A core concept of our proposed framework combining both centralized data storage and distributed query processing is data integration.  ... 
doi:10.1007/978-3-642-38288-8_48 fatcat:mtkwfovbmbfupgd5f4t3bx4x3q

BigDataGrapes D3.2 - Data Ingestion & Integration Components

Panagiotis Zervas, Sotiris Konstantinidis, Antonis Koukourikos
2018 Zenodo  
Afterwards, the document describes the different nature of data, and which technologies can be used to facilitate the ingestion process.  ...  Also, the document describes the tools that will be used for data integration across the different BigDataGrapes platform layers, as well as for long-term storage and preservation of data.  ...  The distributed execution paradigm serves as the basis for efficiently solving the equally urgent challenge of Heterogeneity and Scalability in the context of Big Data processing.  ... 
doi:10.5281/zenodo.1482750 fatcat:p3i7hezygvbhhfod55rb34ss2y

Blending OLAP Processing with Real-Time Data Streams [chapter]

João Costa, José Cecílio, Pedro Martins, Pedro Furtado
2011 Lecture Notes in Computer Science  
This blending allows the processing of queries that need such capabilities with top efficiency and scalability features.  ...  CEP and Databases share some characteristics but traditionally are treated as two separate worlds, one oriented towards real-time event processing and the later oriented towards long-term data management  ...  StreamNetFlux ease of use, real-time processing, scalability and efficiency was evaluated with a massive volume of high-rate data produced by an energy power grid infrastructure.  ... 
doi:10.1007/978-3-642-20152-3_36 fatcat:6gnrr6bjefhmlgrvskjmhclbiq

Scalable modeling of cloud-based IoT services for smart cities

Amir Taherkordi, Frank Eliassen
2016 2016 IEEE International Conference on Pervasive Computing and Communication Workshops (PerCom Workshops)  
The existing work in this area does not sufficiently address this design issue due to the adoption of a uniform and flat view to the structure of IoT services and their data in the Cloud.  ...  With a huge number of IoT services accessible through clouds, it is very important to model and expose cloud-based IoT services in a scalable manner, promising easy and realtime delivery of smart city  ...  In [25] , an approach is proposed to publish sensor data as linked data to enable dynamic discovery, integration, and querying of heterogeneous sensor data sources.  ... 
doi:10.1109/percomw.2016.7457098 dblp:conf/percom/TaherkordiE16 fatcat:7luot2jki5cbhprgikxqy2hexy

A Scalable Framework for Quality Assessment of RDF Datasets [article]

Gezim Sejdiu, Anisa Rula, Jens Lehmann, Hajira Jabeen
2020 arXiv   pre-print
Nevertheless, many applications, such as data integration, search, and interlinking, cannot take full advantage of Linked Data if it is of low quality.  ...  There exist a few approaches for the quality assessment of Linked Data, but their performance degrades with the increase in data size and quickly grows beyond the capabilities of a single machine.  ...  Acknowledgment This work was partly supported by the EU Horizon2020 projects BigDataOcean (GA no. 732310), Boost4.0 (GA no. 780732), QROWD (GA no. 723088) and CLEOPATRA (GA no. 812997).  ... 
arXiv:2001.11100v1 fatcat:azwjqvmwu5bzvlgcqrjaoaik54

Resource Optimisation in IoT Cloud Systems by Using Matchmaking and Self-management Principles [chapter]

Martin Serrano, Danh Le-Phuoc, Maciej Zaremba, Alex Galis, Sami Bhiri, Manfred Hauswirth
2013 Lecture Notes in Computer Science  
IoT Cloud systems provide scalable capacity and dynamic behaviour control of virtual infrastructures for running applications, services and processes.  ...  For our IoT Cloud data management solution we utilize performance metrics expressed with linked data in order to integrate monitored performance data and end user profile information (via linked data relations  ...  In these settings managing Cloud services lifecycle by enabling scalable applications and using distributed information systems and linked data processing in a securely is crucial.  ... 
doi:10.1007/978-3-642-38082-2_11 fatcat:cgqiysdweffefcsqexqm7roiwq

A Scalable Framework for Quality Assessment of RDF Datasets

Gezim Sejdiu, Anisa Rula, Jens Lehmann, Hajira Jabeen
2019 Zenodo  
, and interlink- take full advantage of Linked Data if it is of low quality.  ...  Today, we than 10,000 datasets being available online following Linked Data These standards allow data to be machine readable and inter-operable. ss, many applications, such as data integration, search  ...  Acknowledgment This work was partly supported by the EU Horizon2020 projects BigDataOcean (GA no. 732310), Boost4.0 (GA no. 780732), QROWD (GA no. 723088) and CLEOPATRA (GA no. 812997).  ... 
doi:10.5281/zenodo.3567905 fatcat:yoqhftwtkjabvc4rubiyrrcf4u

Scalable Hpc Workflow Infrastructure For Steering Scientific Instruments And Streaming Applications

Geoffrey Fox, Shantenu Jha, Lavanya Ramakrishnan
2015 Zenodo  
Linking scientific instruments to exascale machines and analyzing the large volumes of data produced by the instruments requires workflow infrastructure that scales along many dimensions.  ...  The requirements of distributed computing problems which couple HPC and streaming data, are distinct from those familiar from large­scale parallel simulations, grid computing, data repositories and orchestration  ...  Existing approaches such as Apache Storm are elegant but for example, lack support for quality of service and HPC processing of events. • Novel programming and software paradigms: There is a need to integrate  ... 
doi:10.5281/zenodo.19149 fatcat:jcdomemp5rgqxn3ioy7p4clova

InterDataNet Naming System: A Scalable Architecture for Managing URIs of Heterogeneous and Distributed Data with Rich Semantics [chapter]

Davide Chini, Franco Pirri, Maria Chiara Pettenati, Samuele Innocenti, Lucia Ciofi
2010 Lecture Notes in Computer Science  
Establishing equivalence links between (semantic) resources, as it is the case in the Linked Data approach, implies permanent search, analysis and alignment of new (semantic) data in a rapidly changing  ...  The core of the IDN architecture is the Naming System aimed at providing a scalable and open service to support consistent reuse of entities and their identifiers, enabling a global reference and addressing  ...  Acknowledgments We would like to acknowledge the valuable support of Prof. Dino Giuli for the material and scientific support to this research activity.  ... 
doi:10.1007/978-3-642-14956-6_4 fatcat:khdggfifdfgdjkoackp7xtun74

Piveau: A Large-scale Open Data Management Platform based on Semantic Web Technologies [article]

Fabian Kirstein, Kyriakos Stefanidis, Benjamin Dittwald, Simon Dutkowski, Sebastian Urbanek, Manfred Hauswirth
2020 arXiv   pre-print
We give a detailed description of the underlying, highly scalable, service-oriented architecture, how we integrated the aforementioned standards, and used a triplestore as our primary database.  ...  It harnesses a variety of standards, like RDF, DCAT, DQV, and SKOS, to overcome the barriers in Open Data publication. The solution puts a strong focus on assuring data quality and scalability.  ...  The above data warehouses offer highly scalable ETL functionality but do not support Linked Data and DCAT.  ... 
arXiv:2005.02614v1 fatcat:sx4bz6pyu5grxifsvnphnvbafi

TreeVector: Scalable, Interactive, Phylogenetic Trees for the Web

Ralph Pethica, Gary Barker, Tim Kovacs, Julian Gough, I. King Jordan
2010 PLoS ONE  
There are now many bioinformatics servers and databases with a range of dynamic processes and updates to cope with the increasing volume of data.  ...  Traditional techniques of plotting phylogenetic trees focus on rendering a single static image, but increases in the production of biological data and large-scale analyses demand scalable, browsable, and  ...  Martin Madera for helpful discussion on production of the SUPERFAMILY architectures tree. Author Contributions Conceived and designed the experiments: RP GB JG. Performed the experiments: RP.  ... 
doi:10.1371/journal.pone.0008934 pmid:20126613 pmcid:PMC2812488 fatcat:etzj3rlwo5eldil27t33rp5qai
« Previous Showing results 1 — 15 out of 156,996 results