7,043 Hits in 3.7 sec

UDBMS: Road to Unification for Multi-model Data Management [chapter]

Jiaheng Lu, Zhen Hua Liu, Pengfei Xu, Chao Zhang
2018 Msphere  
One of the greatest challenges in big data management is the "Variety" of the data. The data may be presented in various types and formats: structured, semi-structured and unstructured.  ...  In this paper, we envision novel principles and technologies to handle multiple models of data in one unified database system, including model-agnostic storage, unified query processing and indexes, in-memory  ...  The existing results for query optimization and transaction model mainly work on a single model, either structured or semi-structured data.  ... 
doi:10.1007/978-3-030-01391-2_33 fatcat:wr63msuftjck3lvpnoa6y2r2g4

Schema Extraction on Semi-structured Data [article]

Panpan Li, Yikun Gong, Chen Wang
2021 arXiv   pre-print
Schema extraction plays an important role in understanding schemas, optimizing queries, and validating data consistency.  ...  With the continuous development of NoSQL databases, more and more developers choose to use semi-structured data for development and data management, which puts forward requirements for schema management  ...  large-scale organization, storage and analysis of semi-structured data.  ... 
arXiv:2012.08105v2 fatcat:hco64wxnrfawrk3twlff2xia3i

A Decade of XML Data Management: An Industrial Experience Report from Oracle

Zhen Hua Liu, Ravi Murthy
2009 Proceedings / International Conference on Data Engineering  
and query optimisation techniques for different XML use cases.  ...  We discuss the lessons learnt in supporting both data-centric and document-centric XMLDB applications within a single database system and the need for the implementation of different XML storage, index  ...  With schema and non-schema based XML and universal structure, value, path and relational XMLTable indexing strategies for XML, there are many cost based choices to optimize the physical design of XML for  ... 
doi:10.1109/icde.2009.18 dblp:conf/icde/LiuM09 fatcat:oecsthn7q5g5bp2y3x7vbnoq5i

Towards a physical XML independent XQuery/SQL/XML engine

Zhen Hua Liu, Sivasankaran Chandrasekar, Thomas Baby, Hui J. Chang
2008 Proceedings of the VLDB Endowment  
There are industrial efforts for building hybrid XQuery and SQL/XML engines that support both languages so that users can manage and query both relational and XML data on one platform.  ...  with relational indexes, binary XML storage with schema agnostic path-value-order key XMLIndex, SQL/XML view over relational data and relational view over XML.  ...  ACKNOWLEDGEMENTS We gratefully acknowledge the contributions of all the members of the Oracle XML DB development and product management teams.  ... 
doi:10.14778/1454159.1454177 fatcat:jdlyx66mcvhovhj67z7fy56q2a

The Snowflake Elastic Data Warehouse

Benoit Dageville, Jiansheng Huang, Allison W. Lee, Ashish Motivala, Abdul Q. Munir, Steven Pelley, Peter Povinec, Greg Rahn, Spyridon Triantafyllis, Philipp Unterbrunner, Thierry Cruanes, Marcin Zukowski (+6 others)
2016 Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16  
Public cloud platforms now offer virtually unlimited compute and storage resources on demand. At the same time, the Software-as-a-Service (SaaS) model brings enterprise-class  ...  We continue to improve and extend core query processing functionality, both for standard SQL and semi-relational extensions.  ...  We are currently working on improving the data access performance by providing additional metadata structures and data re-organization tasks-with a focus on minimal to no user interaction.  ... 
doi:10.1145/2882903.2903741 dblp:conf/sigmod/DagevilleCZAABC16 fatcat:5ehnby2gvffujfsyboulqq2pxu

An LSM-based Tuple Compaction Framework for Apache AsterixDB (Extended Version) [article]

Wail Y. Alkowaileet, Sattam Alsubaiee, Michael J. Carey
2020 arXiv   pre-print
Our focus in this paper is to address the storage overhead issue by introducing a tuple compactor framework that infers and extracts the schema from self-describing semi-structured records during the data  ...  We have implemented and empirically evaluated our approach to measure its impact on storage, data ingestion, and query performance in the context of Apache AsterixDB.  ...  It was also supported by NSF awards IIS-1838248 and CNS-1925610, industrial support from Amazon, Google, Microsoft and Couchbase, and the Donald Bren Foundation (via a Bren Chair).  ... 
arXiv:1910.08185v2 fatcat:dktqsj47i5gbrn4xyoc2aullt4


2020 Al-Azhar Bulletin of Science  
Relational databases are usually used for data storage and retrieval. They are suitable for limited data volume.  ...  But when it comes to Bigdata, we need to use more flexible databases that satisfy the need to handle semi-structured and unstructured data. These databases are called NoSQL (Not only SQL) databases.  ...  Graph databases are schema-free and come up with efficient storage of semi-structured and unstructured data [15, 29] .  ... 
doi:10.21608/absb.2020.210374 fatcat:umexpeuv3veo3o5ihqhn3pcvjq

Analysis and Evaluation of Techniques for Managing Unstructured and Semi-Structured Data in a MapReduce Platform

Dina Darwish
2017 International Journal Of Engineering And Computer Science  
Since, almost most kinds of database systems are designed to manage well-structured data requiring users to design a schema before storing and querying data.  ...  Having a single data platform for managing both well-structured data, unstructured and semi-structured data is beneficial to users; this approach reduces significantly integration, migration, development  ...  The schema based on structured data may have bounded dimensions with unlimited number of elements as formal schema definition, while, schema based on unstructured and semi-structured data has unlimited  ... 
doi:10.18535/ijecs/v6i2.03 fatcat:qonsrnvrtng4fkzvqqluwcqft4

Comparative of Mediator Approach for Database Integration

Mohd Kamir Yusof, Md Yazid Mohd Saman, Wan Nor Shuhadah Wan Nik
2015 Journal of Computer Science  
Six applications which are based on mediator approach have  ...  Acknowledgment The researchers would like to thank Universiti Sultan Zainal Abidin for providing facilities and services to do this research.  ...  Ethics This article is original and contains unpublished material. The corresponding author confirms that all of the other authors have read and approved the manuscript and no ethical issues involved.  ... 
doi:10.3844/jcssp.2015.204.217 fatcat:v3xwfbhyvzfjno33l63qt3aafe

Managing XML Data to optimize Performance into Object-Relational Databases

Iuliana BOTHA
2011 Database Systems Journal  
It is detailed the possibility of storing XML data into such databases, using for exemplification an Oracle database and there are tested some optimizing techniques of the queries over XMLType tables,  ...  This paper propose some possibilities for manage XML data in order to optimize performance into object-relational databases.  ...  : adapted from [12]) CHARACTERISTIC STRUCTURED STORAGE LOB STORAGE Database schema flexibility Limited flexibility for schema changes Good flexibility for schema changes Data integrity and  ... 
doaj:237a7f5a3b404277bc90170b0aa207ea fatcat:lr6u4q4exzhutjjdqcme3ltjzy

Database storage management with object-based storage devices

Sami Iren, Steve Schlosser
2005 Proceedings of the 1st international workshop on Data management on new hardware - DAMON '05  
of schema and disk parameters yields 2D placement 2D data structure access • On-disk storage requires serialization • Access along one dimension is efficient -i.e., sequential • Records Attributes  ...  Create object for relation 2. Attach schema attribute 3.  ...  Solving the "parameter problem" • Relying on storage vendors to expose parameters is problematic -Measuring parameters is difficult and fragile (but not impossible) •  ... 
doi:10.1145/1114252.1114264 fatcat:utjwsoww75fknhsl5mrdjzjg4e

ClothoDecoupling Memory Page Layout from Storage Organization [chapter]

2004 Proceedings 2004 VLDB Conference  
Clotho creates in-memory pages individually tailored for compound and dynamically changing workloads, and enables efficient use of different storage technologies (e.g., disk arrays or MEMS-based storage  ...  Existing techniques, however, do not optimize accesses to all levels of the memory hierarchy and for all the different workloads, because each storage level uses different technology (cache, memory, disks  ...  Logic, Microsoft, Network Appliance, Oracle, Panasas, Seagate, Sun, and Veritas) for their interest, insights, feedback, and support.  ... 
doi:10.1016/b978-012088469-8/50062-0 fatcat:gcez3b6dfjentmf7d3ev3j23si

Handling Evolution in Big Data Architectures

Darja Solodovnikova, Laila Niedrite
2020 Baltic Journal of Modern Computing  
In this paper, we analyze architectures designed for Big Data processing and analysis described in the literature with the purpose to identify the most appropriate solution for the evolution problem.  ...  Data processing and analysis.  ...  Hive is used for the storage of structured source data and Kafka is used for log file data. OLAP analysis is provided by Kylin for precalculated cube data and Impala for ad-hoc queries.  ... 
doi:10.22364/bjmc.2020.8.1.02 fatcat:ghvrfrud7rcblj7a3ww3pzn2dm

UDBMS: Road to Unification for Multi-model Data Management [article]

Jiaheng Lu, Zhen Hua Liu, Pengfei Xu, Chao Zhang
2016 arXiv   pre-print
For example, semi-structured, graph and relational models are examples of data models that may be supported by a new system.  ...  UDBMS will provide several new features such as unified data model and flexible schema, unified query processing, unified index structure and cross-model transaction guarantees.  ...  Flexible schema management. Original ORDBMS assumes the perfect schema based world. Semi-structure data and unstructured data challenges ORDBMS with schema-less design.  ... 
arXiv:1612.08050v1 fatcat:7qek5sljnrbjvhuwpfsh4duj5y

JSON data management

Zhen Hua Liu, Beda Hammerschmidt, Doug McMahon
2014 Proceedings of the 2014 ACM SIGMOD international conference on Management of data - SIGMOD '14  
This SQL/JSON approach offers significant benefits to application developers as they can use one product to manage both relational data and semi-structured flexible schema data.  ...  JSON is a light-weight and flexible semi-structured data format supporting constructs common in most programming languages.  ...  We thank Vikas Arora, Andrew Witkowski for management efforts of SQL/JSON development at Oracle.  ... 
doi:10.1145/2588555.2595628 dblp:conf/sigmod/LiuHM14 fatcat:rrf7sofrfvcdplcbrapk5rb4ji
« Previous Showing results 1 — 15 out of 7,043 results