Filters








944 Hits in 5.1 sec

A Survey of Heterogeneous Information Network Analysis [article]

Chuan Shi, Yitong Li, Jiawei Zhang, Yizhou Sun, Philip S. Yu
2015 arXiv   pre-print
We will introduce basic concepts of heterogeneous information network analysis, examine its developments on different data mining tasks, discuss some advanced topics, and point out some future research  ...  Recently, more and more researchers begin to consider these interconnected, multi-typed data as heterogeneous information networks, and develop structural analysis approaches by leveraging the rich semantic  ...  [38] introduce ComClus to promote clustering and ranking performance by applying star schema network with self loop to combine the heterogeneous and homogeneous information.  ... 
arXiv:1511.04854v1 fatcat:n2k3sulq3fbq3e34lrfrv3uoou

Graph-based ETL Processes for Warehousing Statistical Open Data

Alain Berro, Imen Megdiche, Olivier Teste
2015 Proceedings of the 17th International Conference on Enterprise Information Systems  
In the third step, system interacts with users to incrementally transform the integrated RDF graph into a multidimensional schema.  ...  But extracting structures, integrating and defining multidimensional schema from several scattered and heterogeneous tables in the SOD are major problems challenging the traditional ETL (Extract-Transform-Load  ...  define incrementally the multidimensional schema from visual graphs.  ... 
doi:10.5220/0005363302710278 dblp:conf/iceis/BerroMT15 fatcat:4fja6olnvvgr5hxgl2jpo75era

Entity-aware query processing for heterogeneous data with uncertainty and correlations

Ekaterini Ioannou
2009 Proceedings of the 2009 EDBT/ICDT Workshops on - EDBT/ICDT '09  
Many modern systems rely on rich heterogeneous data that has been integrated from a variety of different applications and sources.  ...  My work focuses on addressing this requirement through a new approach for entity-aware query processing over heterogeneous data.  ...  [5] identify the different properties on which the efficiency of such algorithm depends on, and introduce different algorithms to address the possible combinations of the found properties.  ... 
doi:10.1145/1698790.1698818 dblp:conf/edbtw/Ioannou09 fatcat:4tugusglazctbldnwlryizoc7a

Graph Summarization [article]

Angela Bonifati, Stefania Dumbrava, Haridimos Kondylakis
2020 arXiv   pre-print
One method for condensing and simplifying such datasets is graph summarization.  ...  As this problem is common to several areas studying graph topologies, different approaches, such as clustering, compression, sampling, or influence detection, have been proposed, primarily based on statistical  ...  The proposed algorithms are time linear in the size of the input graph and incremental.  ... 
arXiv:2004.14794v3 fatcat:4g4l3exin5dxpoe6pdggbtcory

End-to-End Entity Resolution for Big Data: A Survey [article]

Vassilis Christophides, Vasilis Efthymiou, Themis Palpanas, George Papadakis, Kostas Stefanidis
2020 arXiv   pre-print
One of the most important tasks for improving data quality and the reliability of data analytics results is Entity Resolution (ER).  ...  aspects of entity indexing and matching methods in order to cope with more than one of the Big Data characteristics simultaneously.  ...  Fast algorithms are also required to incrementally cluster the graph formed by the matched entities in a way that approximates the optimal performance of correlation clustering [77] .  ... 
arXiv:1905.06397v3 fatcat:rs2qoolz2jcppklriew5pjfefq

Data Mining-based Fragmentation of XML Data Warehouses [article]

Hadj Mahboubi
2008 arXiv   pre-print
We experimentally compare its efficiency to classical derived horizontal fragmentation algorithms adapted to XML data warehouses and show its superiority.  ...  With the multiplication of XML data sources, many XML data warehouse models have been proposed to handle data heterogeneity and complexity in a way relational data warehouses fail to achieve.  ...  ACKNOWLEDGMENTS The authors would like to thank Houssem Aissa, Anouar Benzakour, Kevin du Repaire and Hamza El Kartite, who participated in coding our approach in Java.  ... 
arXiv:0811.0741v1 fatcat:mxfgybhr45buffdol72gnlw4uu

Data mining-based fragmentation of XML data warehouses

Hadj Mahboubi, Jérôme Darmont
2008 Proceeding of the ACM 11th international workshop on Data warehousing and OLAP - DOLAP '08  
We experimentally compare its efficiency to classical derived horizontal fragmentation algorithms adapted to XML data warehouses and show its superiority.  ...  With the multiplication of XML data sources, many XML data warehouse models have been proposed to handle data heterogeneity and complexity in a way relational data warehouses fail to achieve.  ...  ACKNOWLEDGMENTS The authors would like to thank Houssem Aissa, Anouar Benzakour, Kevin du Repaire and Hamza El Kartite, who participated in coding our approach in Java.  ... 
doi:10.1145/1458432.1458435 dblp:conf/dolap/MahboubiD08 fatcat:zeypfx4hq5c5nis3sh44tf6oem

Multityped Community Discovery in Time-Evolving Heterogeneous Information Networks Based on Tensor Decomposition

Jibing Wu, Lianfei Yu, Qun Zhang, Peiteng Shi, Lihua Liu, Su Deng, Hongbin Huang
2018 Complexity  
However, they assume that heterogeneous information networks usually follow some simple schemas, such as bityped network and star network schema.  ...  Experimental results on both synthetic and real-world datasets demonstrate the efficiency of our framework.  ...  Acknowledgments This study was supported by the National Science Foundation of China (no. 61401482 and no. 61401483).  ... 
doi:10.1155/2018/9653404 fatcat:4xmuxejlhrd3hh3izz4suajjj4

Efficient query construction for large scale data

Elena Demidova, Xuan Zhou, Wolfgang Nejdl
2013 Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '13  
As these databases are intended to accommodate heterogeneous information and knowledge, they usually comprise a very large schema and billions of instances.  ...  Browsing and searching data on such a scale is not an easy task for a Web user.  ...  [16] , these algorithms not scale on large database schemas.  ... 
doi:10.1145/2484028.2484078 dblp:conf/sigir/DemidovaZN13 fatcat:q4wvos57q5aitll5sbfzxwcs5m

Towards scalable RDF graph analytics on MapReduce

Padmashree Ravindra, Vikas V. Deshpande, Kemafor Anyanwu
2010 Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud - MDAC '10  
the closure of the related graph and hence perform efficient reasoning using the resultant ordering of inferring rules.  ...  Yet another approach optimizes multi-way joins [10] by providing strategies to efficiently partition and replicate the tuples of a relation on reducer processes in a way that minimizes the communication  ...  Related Work Traditional OLAP systems support efficient analytical querying, but they focus on structured data that has been suitably organized in star or snowflake schema.  ... 
doi:10.1145/1779599.1779604 fatcat:tvk3s4hhhrazbo5gxn4i4pus44

Predictive Performance Comparison Analysis of Relational & NoSQL Graph Databases

Wisal Khan, Ejaz ahmed, Waseem Shahzad
2017 International Journal of Advanced Computer Science and Applications  
In this paper we will compare Oracle relational database and NoSQL graph database using optimized queries and physical database tuning techniques.  ...  Relational databases cannot process properly and manage such large amount of data efficiently.  ...  Clydesdale follows many techniques such as columnar storage, star join and block iteration. Clydesdale is suitable when the workloads fit the data as star schema.  ... 
doi:10.14569/ijacsa.2017.080564 fatcat:bxy5sn3u6nazhiseyqiz3elsba

Workload matters

Güneş Aluç, M. Tamer Özsu, Khuzaima Daudjee
2014 Proceedings of the VLDB Endowment  
The Resource Description Framework (RDF) is a standard for conceptually describing data on the Web, and SPARQL is the query language for RDF.  ...  Existing systems are workload-oblivious, and are therefore unable to provide consistently good performance. We propose a vision for a workload-aware and adaptive system.  ...  Clustering algorithms used in conventional database design are not suitable for runtime execution-clustering is NPhard and approximations have quadratic complexity [14] .  ... 
doi:10.14778/2732951.2732957 fatcat:xg7vimkow5hkxg5kquvhls465a

Warehousing complex data from the Web [article]

Omar Boussaid , Sabine Loudcher
2017 arXiv   pre-print
and data mining techniques.  ...  Our approach includes the integration of complex data in an ODS, under the form of XML documents; their dimensional modeling and storage in an XML data warehouse; and their analysis with combined OLAP  ...  Robert Wrembel and Prof. Jaroslav Pokorný, the editors, for inviting them to publish an article in this special issue.  ... 
arXiv:1701.00398v1 fatcat:64yhgypd4fdlrhy7gwobtljs7y

Warehousing complex data from the web

O. Boussaid, J. Darmont, F. Bentayeb, S. Loudcher
2008 International Journal of Web Engineering and Technology  
Our approach includes the integration of complex data in an ODS, in the form of XML documents; their dimensional modelling and storage in an XML data warehouse; and their analysis with combined OLAP and  ...  Data warehousing and Online Analytical Processing (OLAP) technologies are now moving onto handling complex data that mostly originate from the web.  ...  Robert Wrembel and Prof. Jaroslav Pokorný, the editors, for inviting them to publish an article in this special issue.  ... 
doi:10.1504/ijwet.2008.019942 fatcat:ikqrnhgh7jflrjgne5dsa4tgu4

Impacts of climate change in monetary terms? Issues for developing countries

H. Asbjorn Aaheim
2002 International Journal of Global Environmental Issues  
Our approach includes the integration of complex data in an ODS, in the form of XML documents; their dimensional modelling and storage in an XML data warehouse; and their analysis with combined OLAP and  ...  Data warehousing and Online Analytical Processing (OLAP) technologies are now moving onto handling complex data that mostly originate from the web.  ...  Robert Wrembel and Prof. Jaroslav Pokorný, the editors, for inviting them to publish an article in this special issue.  ... 
doi:10.1504/ijgenvi.2002.002401 fatcat:e2uikuijxzh4dkq7ktkzavq3qu
« Previous Showing results 1 — 15 out of 944 results