Filters








20 Hits in 1.6 sec

ABSTAT: Ontology-Driven Linked Data Summaries with Pattern Minimalization [chapter]

Blerina Spahiu, Riccardo Porrini, Matteo Palmonari, Anisa Rula, Andrea Maurino
2016 Lecture Notes in Computer Science  
In this paper we discuss how an ontology-driven data abstraction model supports the extraction and the representation of summaries of linked data sets.  ...  The proposed summarization model is the backbone of the ABSTAT framework, that aims at helping users understanding big and complex linked data sets.  ...  Conclusion and Future Work Getting an understanding of the shape and nature of the data from large Linked Data sets is a complex and a challenging task.  ... 
doi:10.1007/978-3-319-47602-5_51 fatcat:duhtfgzxhrh3xmhlxmc2hawb3m

ABSTAT 1.0: Compute, Manage and Share Semantic Profiles of RDF Knowledge Graphs [chapter]

Renzo Arturo Alva Principe, Blerina Spahiu, Matteo Palmonari, Anisa Rula, Flavio De Paoli, Andrea Maurino
2018 Lecture Notes in Computer Science  
ABSTAT is an online profiling tool which helps data consumers in better understanding the data by extracting ontology-driven patterns and statistics about the data.  ...  As Linked Data available on the Web continue to grow, understanding their structure and content remains a challenging task making such the bottleneck for their reuse.  ...  Acknowledgements This research has been supported in part by EU H2020 projects EW-Shopp -Grant n. 732590, and EuBusinessGraph -Grant n. 732003.  ... 
doi:10.1007/978-3-319-98192-5_32 fatcat:saxp6uaqrfgtzfukjgl6nmcyxu

ABSTAT-HD: a scalable tool for profiling very large knowledge graphs

Renzo Arturo Alva Principe, Andrea Maurino, Matteo Palmonari, Michele Ciavotta, Blerina Spahiu
2021 The VLDB journal  
We demonstrate the impact of the new architecture of ABSTAT-HD by presenting a set of experiments that show its scalability with respect to three dimensions of the data to be processed: size, complexity  ...  However, constructing profiles and calculating several statistics such as cardinality descriptors or inferences are resource expensive.  ...  The semantic profile consists of a summary, which provides an abstract, but complete description of the dataset content, and some statistics.  ... 
doi:10.1007/s00778-021-00704-2 fatcat:eue2ppldrnfr5aip3xrv7ttnpy

Profiling Linguistic Knowledge Graphs

Blerina Spahiu, Renzo Arturo Alva Principe, Andrea Maurino
2022 Zenodo  
Such metrics are evaluated on linguistic data and our findings provide a basis for a more efficient understanding of linguistic data.  ...  Recently the number of approaches that model and interconnect linguistic data as knowledge graphs has experienced outstanding growth.  ...  These statistics describe the data set and its schema and include statistics about number of triples, triples with blank nodes, labeled subjects, number of owl:sameAs links, class and property usage, class  ... 
doi:10.5281/zenodo.6827644 fatcat:5huv5456gfh2rlejna2gouyaaq

Towards Improving the Quality of Knowledge Graphs with Data-driven Ontology Patterns and SHACL

Blerina Spahiu, Andrea Maurino, Matteo Palmonari
2018 International Semantic Web Conference  
ABSTAT is an online semantic profiling tool which helps data consumers in better understanding of the data by extracting data-driven ontology patterns and statistics about the data.  ...  As Linked Data available on the Web continue to grow, understanding their structure and assessing their quality remains a challenging task making such the bottleneck for their reuse.  ...  Acknowledgements This research has been supported in part by EU H2020 projects EW-Shopp -Grant n. 732590, and EuBusinessGraph -Grant n. 732003.  ... 
dblp:conf/semweb/SpahiuMP18 fatcat:msvptafggzgl7cfsyjb4bt56iq

Effect of heuristic post-processing on knowledge graph profile patterns: cross-domain study

Gollam Rabby, Farhana Keya, Vojtēc Svátek, Renzo Arturo Alva Principe
2022 Zenodo  
and even XML data types.  ...  We experimented with post-processing the patterns returned by ABSTAT with regard to reducing the quantity of patterns and re-ranking the patterns appearing in the first positions of the frequency-ordered  ...  Given a KG in the form of a dataset and an ontology (optional), ABSTAT computes a profile which consists of a summary about the dataset content and statistics.  ... 
doi:10.5281/zenodo.6827777 fatcat:prucyhpmhbenpnmdqxlgkrq56u

TTProfiler: Computing Types and Terms Profiles of Assertional Knowledge Graphs

Lamine Diop, Arnaud Giacometti, Béatrice Markhoff, Arnaud Soulet
2021 International Joint Workshop on Semantic Web and Ontology Design for Cultural Heritage  
As more and more knowledge graphs (KG) are published in the Web, there is a need of tools for abstracting their content for their producers to verify their result, and for their consumers to use it.  ...  This implies showing the schema-level patterns instantiated in the graph, with the frequency with which they are instantiated. A profile represents this information.  ...  The closest to our proposal is ABSTAT [6] dealing with both data and schemas, with a set of abstract patterns as output, usable online.  ... 
dblp:conf/swodch/DiopGMS21 fatcat:xib2bmpjajgypaegagv7ky2qz4

Loupe - An Online Tool for Inspecting Datasets in the Linked Data Cloud

Nandana Mihindukulasooriya, María Poveda-Villalón, Raúl García-Castro, Asunción Gómez-Pérez
2015 International Semantic Web Conference  
The Linked Data initiative continues to grow making more datasets available; however, discovering the type of data contained in a dataset, its structure, and the vocabularies used still remains a challenge  ...  hindering the querying and reuse.  ...  ABSTAT [3] provides a summary of the most commonly used abstract knowledge patterns similar to the triple-patterns shown by Loupe.  ... 
dblp:conf/semweb/Mihindukulasooriya15 fatcat:jaezlbed4zaknb7ewt6qjuvfd4

Data wrangling at scale

Nikolay Nikolov, Michele Ciavotta, Flavio De Paoli
2018 Proceedings of the 12th European Conference on Software Architecture Companion Proceedings - ECSA '18  
This paper presents a subsystem of a comprehensive platform dedicated to data transformation, linking and extension of large data sets.  ...  In particular, the platform supports both design and run time aspects of the data transformation process, which is reflected in the architecture.  ...  The profiles extracted by ABSTAT describe the content of knowledge graphs, using abstraction (schema-level patterns) and statistics.  ... 
doi:10.1145/3241403.3241437 dblp:conf/ecsa/NikolovCP18 fatcat:agloraiom5eizj5islfjsxxxmy

FLUID: A Common Model for Semantic Structural Graph Summaries Based on Equivalence Relations [article]

Till Blume, Ansgar Scherp
2020 arXiv   pre-print
We abstract from these patterns and provide for the first time a formally defined common model FLUID (FLexible graph sUmmarIes for Data graphs) that allows to flexibly define structural graph summaries  ...  We show that graph summaries defined with FLUID can be computed in the worst case in 𝒪(n^2) w.r.t. n the number of edges in the data graph.  ...  For example, SchemEX [9] , ABSTAT [8] , LODeX [33, 14] , and Loupe [15] summarize RDF instances based on a common type set and common properties linking to RDF instances with the same type sets.  ... 
arXiv:1908.01528v2 fatcat:i75lscq2crd4fiuyqotqciwuse

DistLODStats: Distributed Computation of RDF Dataset Statistics

Gezim Sejdiu, Ivan Ermilov, Jens Lehmann, Mohamed Nadjib Mami
2018 Zenodo  
Nevertheless, many applications, such as data integration, nd interlinking, may not take the full advantage of the data without hav- ori statistical information about its internal structure and coverage  ...  In e are already a number of tools, which offer such statistics, providing ormation about RDF datasets and vocabularies.  ...  Acknowledgment This work was partly supported by the EU Horizon2020 projects BigDataEurope (GA no. 644564), QROWD (GA no. 723088), WDAqua (GA no. 642795) and BigDataOcean (GA no. 732310).  ... 
doi:10.5281/zenodo.3567965 fatcat:24tntp6einggrjawhjwo5c5aj4

Summarizing semantic graphs: a survey

Šejla Čebirić, François Goasdoué, Haridimos Kondylakis, Dimitris Kotzinos, Ioana Manolescu, Georgia Troullinou, Mussab Zneika
2018 The VLDB journal  
There is no single concept of RDF summary, and not a single but many approaches to build such summaries; each is better suited for some uses, and each presents specific challenges with respect to its construction  ...  The explosion in the amount of the available RDF data has lead to the need to explore, query and understand such data sources.  ...  Acknowledgments This research is implemented through IKY scholarships programme and cofinanced by the European Union and Greek national funds through the action entitled "Reinforcement of Postdoctoral  ... 
doi:10.1007/s00778-018-0528-3 fatcat:5yphgy3jajcwxbt5oxshadv4oy

Trav-SHACL: Efficiently Validating Networks of SHACL Constraints [article]

Mónica Figuera and Philipp D. Rohde and Maria-Esther Vidal
2021 arXiv   pre-print
Knowledge graphs have emerged as expressive data structures for Web data.  ...  Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine.  ...  Acknowledgements This work has been partially supported by the EU H2020 projects iASiS (No 727658) and QualiChain (No 822404), and the ERAMed project P4-LUCAT (No 53000015).  ... 
arXiv:2101.07136v1 fatcat:4ehu723265e6heupdy7q2hu6uy

A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases: The 3cixty Use Case

Nandana Mihindukulasooriya, Giuseppe Rizzo, Raphaël Troncy, Óscar Corcho, Raúl García-Castro
2016 Extended Semantic Web Conference  
In the finegrained analysis step, specific tests are developed for a particular knowledge base according to the data model and pre-defined constraints that shape the data.  ...  The main objective of this approach is to detect and to flag potential defects as early as possible in the data publishing process and to eliminate or minimize the undesirable outcomes in the applications  ...  Acknowledgments This work was partially supported by the FPI grant (BES-2014-068449), the innovation activity 3cixty (14523) of EIT Digital, and the 4V project (TIN2013-46238-C4-2-R).  ... 
dblp:conf/esws/Mihindukulasooriya16 fatcat:dcr4ma3zhzaufmibb4zevya5b4

Automatic text summarization: What has been done and what has to be done [article]

Abdelkrime Aries, Djamel eddine Zegour, Walid Khaled Hidouci
2019 arXiv   pre-print
Therefore, a summary must be short, representative and readable. Generating summaries automatically can be beneficial for humans, since it can save time and help selecting relevant documents.  ...  Summaries are important when it comes to process huge amounts of information. Their most important benefit is saving time, which we do not have much nowadays.  ...  ABSTAT 16 [Palmonari et al., 2015] produces a summary of linked data sets which make use of ontologies to describe the semantics of their data.  ... 
arXiv:1904.00688v1 fatcat:xvvjdpu3xzdsdn4piksz2s4pve
« Previous Showing results 1 — 15 out of 20 results