15 Hits in 2.3 sec

Evaluation Criteria for RDF Triplestores with an Application to Allegrograph

Khadija Alaoui, Mohamed Bahaj
2020 International Journal of Advanced Computer Science and Applications  
This has led to the appearance of a variety of data systems to store and process RDF data.  ...  Since its launching as the standard language of the semantic web, the Resource Description Framework RDF has gained an enormous importance in many fields.  ...  , dynamicity, scalability, reasoning, data integration, data exchange, data portability, scalability, visualization and support of analytical functionalities.  ... 
doi:10.14569/ijacsa.2020.0110653 fatcat:4nm5caxih5b2rdgbgcr52qbrxa

A survey and experimental comparison of distributed SPARQL engines for very large RDF data

Ibrahim Abdelaziz, Razen Harbi, Zuhair Khayyat, Panos Kalnis
2017 Proceedings of the VLDB Endowment  
Then, we select 12 representative systems and perform extensive experimental evaluation with respect to preprocessing cost, query performance, scalability and workload adaptability, using a variety of  ...  In this paper, we present a survey of 22 state-of-the-art systems that cover the entire spectrum of distributed RDF data processing and categorize them by several characteristics.  ...  , incurred replication, query performance, and scalability.  ... 
doi:10.14778/3151106.3151109 fatcat:6m7iotec65cufebmm5jbali74q

Towards Making Distributed RDF Processing FLINKer

Amr Azzam, Sabrina Kirrane, Axel Polleres
2018 2018 4th International Conference on Big Data Innovations and Applications (Innovate-Data)  
In this position paper, based on an indepth analysis of the state of the art, we propose to manage large RDF datasets in Flink, a well-known scalable distributed Big Data processing framework.  ...  This scenario has introduced severe big semantic data challenges when it comes to managing and querying RDF data at Web scale.  ...  a distributed and scalable manner.  ... 
doi:10.1109/innovate-data.2018.00009 dblp:conf/obd/AzzamKP18 fatcat:s7la6h4c7fgvrf6e4qz37xz34e

Big Data Semantics

Paolo Ceravolo, Antonia Azzini, Marco Angelini, Tiziana Catarci, Philippe Cudré-Mauroux, Ernesto Damiani, Alexandra Mazak, Maurice Van Keulen, Mustafa Jarrar, Giuseppe Santucci, Kai-Uwe Sattler, Monica Scannapieco (+3 others)
2018 Journal on Data Semantics  
Indeed, multiple components and procedures must be coordinated to ensure a high level of data quality and accessibility for the application layers, e.g., data analytics and reporting.  ...  In this paper, the third of its kind co-authored by members of IFIP WG 2.6 on Data Semantics, we propose a review of the literature addressing these topics and discuss relevant challenges for future research  ...  Data Semantics Dimensions Achieving the full potential of Big Data analytics requires realizing a reconciliation between data distribution and data modeling principles [14, 21] .  ... 
doi:10.1007/s13740-018-0086-2 fatcat:bhbeyntbtzdkvf5t3dcko42jpy

Semantic Oriented Data Modeling for Enterprise Application Engineering Using Semantic Web Languages

Khadija Alaoui
2020 International Journal of Advanced Trends in Computer Science and Engineering  
RDF and OWL ware introduced to guaranty a semantic web to permit a web of interconnected data instead of a static web so that computer can explore web contents.  ...  Here beyond publishing data for the semantic web, we give motivations for the necessity of a RDF/OWL semantic data modeling, as well as an approach in form of best practices for this modeling.  ...  Scalability and multi-modeling The data modeling should be scalable with respect to big data volumes and data processing. A.  ... 
doi:10.30534/ijatcse/2020/116932020 fatcat:zxshsr2wbbbwra3vht5th7leee

Categorization of RDF Data Management Systems

Khadija Alaoui, Mohamed Bahaj
2021 Advances in Science, Technology and Engineering Systems  
Furthermore, the categorization considers various aspects that specifically deal with RDF data modeling, organization of RDF data, the processing of SPARQL queries, scalability, as well as aspects related  ...  The wide acceptance of the semantic web language RDF for ontologies creation in various application fields has led to the emergence of numerous RDF data processing solutions, the so-called triplestores  ...  and for data analytics purposes.  ... 
doi:10.25046/aj060225 fatcat:sbltbipjrbexnoysynmkib4jsa

Efficient Subgraph Matching on Large RDF Graphs Using MapReduce

Xin Wang, Lele Chai, Qiang Xu, Yajun Yang, Jianxin Li, Junhu Wang, Yunpeng Chai
2019 Data Science and Engineering  
In our method, query graphs are decomposed into a set of stars that utilize the semantic and structural information embedded RDF graphs as heuristics.  ...  One algorithm, called RDF property filtering, filters out invalid input data to reduce intermediate results; the other is to improve the query performance by postponing the Cartesian product operations  ...  YARS2 is a distributed semantic web search engine, which integrates data retrieving, collecting, indexing, and browsing together.  ... 
doi:10.1007/s41019-019-0090-z fatcat:zfrn7vmxzvch7hca4tsmbnfvne

RDF in the clouds: a survey

Zoi Kaoudi, Ioana Manolescu
2014 The VLDB journal  
The Resource Description Framework (RDF) pioneered by the W3C is increasingly being adopted to model data in a variety of scenarios, in particular data to be published or exchanged on the Web.  ...  In this article, we survey RDF data management architectures and systems designed for a cloud environment, and more generally, those large-scale RDF data management systems that can be easily deployed  ...  Although this approach may be efficient for very selective queries with few intermediate results, it is not scalable for analytical-style queries which need to access big portions of RDF data.  ... 
doi:10.1007/s00778-014-0364-z fatcat:qyp6euinnvexxlliqe2wf45ona

Sempala: Interactive SPARQL Query Processing on Hadoop [chapter]

Alexander Schätzle, Martin Przyjaciel-Zablocki, Antony Neu, Georg Lausen
2014 Lecture Notes in Computer Science  
For Hadoop-based applications, a common data pool (HDFS) provides many synergy benefits, making it very attractive to use these infrastructures for semantic data processing as well.  ...  Driven by initiatives like, the amount of semantically annotated data is expected to grow steadily towards massive scale, requiring cluster-based solutions to query it.  ...  However, they do not provide a distributed query engine, thus scalability and query performance for large RDF data is still an issue.  ... 
doi:10.1007/978-3-319-11964-9_11 fatcat:nbt5xlop2jahlapxvfikcqkn5y

Combining Vertex-Centric Graph Processing with SPARQL for Large-Scale RDF Data Analytics

Ibrahim Abdelaziz, Razen Harbi, Semih Salihoglu, Panos Kalnis
2017 IEEE Transactions on Parallel and Distributed Systems  
We present various scenarios where our framework simplifies significantly the implementation of complex RDF data analytics programs.  ...  We bridge the gap by introducing Spartex, a versatile framework for complex RDF analytics.  ...  Fig. 7 . 7 Data Scalability using LUBM dataset. Fig. 8 . 8 Machine Scalability (LUBM-10240). Fig. 10 . 10 Use case 3: SamplD analytics pipeline.  ... 
doi:10.1109/tpds.2017.2720174 fatcat:yolgu6hg75fixgtffmmew57kle

Document-based RDF storage method for parallel evaluation of basic graph pattern queries

Eleftherios Kalogeros, Manolis Gergatsoulis, Matthew Damigos
2020 International Journal of Metadata, Semantics and Ontologies  
We propose an effective data model for storing RDF data in a document database using maximum replication factor of 2 (i.e., in the worst case scenario, the data graph will be doubled in storage size).  ...  In this paper, we investigate the problem of efficiently evaluating (Basic Graph Pattern) BGP SPARQL queries over a large amount of RDF data.  ...  He is currently working as BI and Analytics lead in Anixe Technologies, and he has a long experience in data warehousing, data integration and data analytics as member of multiple technology companies.  ... 
doi:10.1504/ijmso.2020.107798 fatcat:k4kzrnecl5asdegmqys6s3hozu

Big Data: from collection to visualization

Mohammed Ghesmoune, Hanene Azzag, Salima Benbernou, Mustapha Lebbah, Tarn Duong, Mourad Ouziri
2017 Machine Learning  
For this, we present a complete work flow on (a) how to represent the heterogeneous collected data using the high performance RDF language, how to perform the fusion of the Big Data in RDF by resolving  ...  version of the growing neural gas approach, which is capable of clustering data streams with a single pass over the data.  ...  Acknowledgements This research has been supported by the French Foundation FSN, PIA Grant Big data-Investissements d'Avenir.  ... 
doi:10.1007/s10994-016-5622-4 fatcat:qzfmkhe2ffag3pjx7noqdk7f24

Scalable Discovery and Analytics on Web Linked Data

Ibrahim Abdelaziz
To address the scalability limitation of federated RDF engines, we propose Lusail; a scalable system for querying geo-distributed RDF graphs.  ...  It identifies a set of research problems for improving the state-of-the-art, including: supporting the emerging RDF analytics required by many modern applications, querying linked data at scale, and enabling  ...  However, it lacks a scalability study to assess how scalable are existing federated engines as the number of endpoints and accordingly the data size increases.  ... 
doi:10.25781/kaust-pg966 fatcat:r6mjiift6fgj5msmuntzfmcnnq

Accelerating SPARQL Queries and Analytics on RDF Data Dissertation by EXAMINATION COMMITTEE

Razen Al-Harbi, Razen Al-Harbi
2016 unpublished
This dissertation tackles the problem of accelerating SPARQL queries and RDF analytics on distributed shared-nothing RDF systems. First, a distributed RDF engine , coined AdPart, is introduced.  ...  Finally, (iii) it provides a unified in-memory data store that allows the persistence of intermediate results.  ...  H2RDF+ [30] is a highly scalable distributed RDF engine based on MapReduce [47] framework and Apache HBase [53].  ... 

Efficient Query Processing Over Web-Scale RDF Data

Amgad M. Madkour
We present a set of encoding techniques and demonstrate how to use semantic filters to reduce irrelevant data in a distributed setting.  ...  RDF is the defacto standard for semantic data where it provides a flexible semi-structured model for describing concepts and relationships.  ...  This meta-data assists the query engine in determining if a data block can be skipped. The SDATA representation in a SemVP allows a query engine to skip data blocks.  ... 
doi:10.25394/pgs.7413308.v1 fatcat:qc6c7yzz45a3ve7q3b7bsdkelq