Filters








2,754 Hits in 5.3 sec

The state of SQL-on-Hadoop in the cloud

Nicolas Poggi, Josep Ll. Berral, Thomas Fenech, David Carrera, Jose Blakeley, Umar Farooq Minhas, Nikola Vujic
2016 2016 IEEE International Conference on Big Data (Big Data)  
We report experiments with 5 real brands in which more than 1 million real images were analyzed. In order to speed-up the training of custom CNNs we applied a transfer learning strategy.  ...  Some of the CNNs perform generic object recognition tasks while others perform what we call visual brand identity recognition.  ...  ACKNOWLEDGEMENTS This work is partially supported by the Spanish Ministry of Economy and Competitivity under contract TIN2015-65316-P and by the SGR programme (2014-SGR-1051) of the Catalan Government.  ... 
doi:10.1109/bigdata.2016.7840751 dblp:conf/bigdataconf/PoggiBFCBMV16 fatcat:vf4hkq3dtrhu7dvby7kdrpo6c4

SQLMR : A Scalable Database Management System for Cloud Computing

Meng-Ju Hsieh, Chao-Rui Chang, Li-Yung Ho, Jan-Jan Wu, Pangfeng Liu
2011 2011 International Conference on Parallel Processing  
On the other hand, traditional SQL-based data processing is familiar to user but is limited in scalability.  ...  As the size of data set in cloud increases rapidly, how to process large amount of data efficiently has become a critical issue.  ...  ACKNOWLEDGEMENTS This work is supported in part by National Science Council of Taiwan under grant number NSC-99-2218-E-001-009.  ... 
doi:10.1109/icpp.2011.54 dblp:conf/icpp/HsiehCHWL11 fatcat:47zwoz2ringx5g7w3eopk6z6a4

M2M: A simple Matlab-to-MapReduce translator for cloud computing

Junbo Zhang, Dong Xiang, Tianrui Li, Yi Pan
2013 Tsinghua Science and Technology  
However, SQL-to-MapReduce translators mainly focus on SQL-like queries, but not on numerical computation.  ...  Recently, some SQLto-MapReduce translators emerge to translate SQL-like queries to MapReduce codes and have good performance in cloud systems.  ...  Acknowledgements This work was partially supported by the National Natural Science Foundation of China (Nos. 61175047, 61100117, and 61202043) and the US National Science Foundation (No. OCI-1156733).  ... 
doi:10.1109/tst.2013.6449402 fatcat:kcazalivyneirbmxso3eh3ylsq

An Experimental and Comparative Benchmark Study Examining Resource Utilization in Managed Hadoop Context [article]

Uluer Emre Ozdil, Serkan Ayvaz
2021 arXiv   pre-print
In the study, we aimed to understand managed Hadoop context in terms of resource utilization.  ...  We utilized three experimental Hadoop-on-PaaS proposals as they come out-of-the-box and conducted Hadoop specific workloads of the HiBench Benchmark Suite.  ...  Poggi, N., Berral, J.L., Fenech, T., Carrera, D., Blakeley, J., Minhas, U.F., Vu- jic, N.: The state of SQL-on-Hadoop in the cloud.  ... 
arXiv:2112.10134v1 fatcat:yuc2gdfd6vbdbnyod2dspmzlfy

Review on the Cloud Computing Programming Model

Chao Shen, Weiqin Tong
2014 International Journal of Advanced Science and Technology  
This paper first analyzes the problems which cloud computing programming model need to solve, and then analyzes the characteristics of the cloud computing programming model.  ...  The cloud computing data center is usually composed of thousand of commercial computers, and these computers are connected by network.  ...  Acknowledgements This work is supported by Innovation Action Plan supported by Science and Technology Commission of Shanghai Municipality (No.11511500200).  ... 
doi:10.14257/ijast.2014.70.02 fatcat:etcuk75oczgabewmzo62sokii4

On managing geospatial big-data in emergency management

Kuien Liu, Yandong Yao, Danhuai Guo
2015 Proceedings of the 1st ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management - EM-GIS '15  
In order to achieve scalability, a number of solutions on big geospatial data management are proposed in recent years.  ...  The purpose of this paper is not only focused on how to program a geospatial data storage platform but also on how to approve the rationality of geospatial big data system that we plan to build.  ...  SQL-on-Hadoop is not yet mature SQL-on-Hadoop technologies have been drawing considerable attention in the big data analytics area of late.  ... 
doi:10.1145/2835596.2835614 dblp:conf/gis/LiuYG15 fatcat:fie2yx5izfb6xl4ljio7zgtmbi

Federated cloud-based big data platform in telecommunications

Chao Deng, Ling Qian, Meng Xu, Yujian Du, Zhiguo Luo, Shaoling Sun
2012 Proceedings of the 2012 workshop on Cloud services, federation, and the 8th open cirrus summit - FederatedClouds '12  
To provide better service to 600 million customers and reduce the cost of IT systems, China Mobile adopted a centralized IT strategy based on cloud computing.  ...  The big data issue becomes the most significant challenge to the cloud computing based China Mobile IT structure. This paper presents the China Mobile's big data platform based on the cloud.  ...  Big Cloud includes a series of systems, supporting IaaS, PaaS and SaaS. The infrastructure of Big Cloud is based on commercial PC server cluster and Hadoop platform.  ... 
doi:10.1145/2378975.2378987 fatcat:xqkldvzrxrfgroypjhxxtzhiye

GENMR: Generalized Query Processing through Map Reduce In Cloud Database Management System [article]

Shweta Malhotra, Mohammad Najmud Doja, Bashir Alam, Mansaf Alam
2016 arXiv   pre-print
For all the new techniques one common thing is that they deal with Data, not just Data but the Big Data. Users store their various kinds of data on cloud repositories.  ...  Big Data, Cloud computing, Cloud Database Management techniques, Data Science and many more are the fantasizing words which are the future of IT industry.  ...  HIVE [16] is considered as SQL-Like language in which users put their queries in SQL form and with the help of Hadoop framework their queries internally gets converted into MapReduce and users without  ... 
arXiv:1603.08102v1 fatcat:osc4s6dgzvce3ecncfruqqfed4

Authorization of Data In Hadoop Using Apache Sentry

N Sirisha, K V.D. Kiran
2018 International Journal of Engineering & Technology  
However, it is a significant target in the Hadoop system, security model was not designed and became the major drawback of Hadoop software.  ...  With the importance of Hadoop in today's enterprises, there is also an increasing trend in providing a high security features in enterprises.  ...  When the machines are in OFF state, the data will be available at rest. Some of them includes data on hard drives, flash drives and USB.  ... 
doi:10.14419/ijet.v7i3.6.14978 fatcat:6dg5be2r3zaerphoi2dp7fetjy

Using Big Data in the Academic Environment

Banica Logica, Radulescu Magdalena
2015 Procedia Economics and Finance  
Also, we have designed a three-step system architecture for a consortium of universities, based on actual software solutions, having the purpose to analyze, organize and access huge data sets in the Cloud  ...  In order to efficiently analyze this large quantity of raw data, the concept of Big Data was introduced.  ...  A model for Big Learning Data on Cloud architecture Second level -Attaching Domain/ Subdomain Metadata to the NoSQL and SQL databases First level -Gathering and Processing data with Hadoop Apache UNIVERSITIES  ... 
doi:10.1016/s2212-5671(15)01712-8 fatcat:4bgdnzdcgndkpdpldowseaulsy

A Study of SQL-on-Hadoop Systems [chapter]

Yueguo Chen, Xiongpai Qin, Haoqiong Bian, Jun Chen, Zhaoan Dong, Xiaoyong Du, Yanjie Gao, Dehai Liu, Jiaheng Lu, Huijie Zhang
2014 Lecture Notes in Computer Science  
This leads to the quick emergence of dozens of SQL-on-Hadoop systems that try to support interactive SQL query processing to the data stored in HDFS.  ...  Then we test and compare the performance of five representative SQL-on-Hadoop systems, based on some queries selected or derived from the TPC-DS benchmark.  ...  Even though, we find that the existing SQL-on-Hadoop systems have benefited a lot from the application of many state-of-the-art parallel query processing techniques (such as columnar storage, MPP architecture  ... 
doi:10.1007/978-3-319-13021-7_12 fatcat:sjmcohh6y5ga7box3bzsou3zvq

REX: Recursive, Delta-Based Data-Centric Computation [article]

Svilen R. Mihaylov, Zachary G. Ives, Sudipto Guha
2012 arXiv   pre-print
We seek to unify the strengths of both styles of platforms, with a focus on supporting iterative computations in which changes, in the form of deltas, are propagated from iteration to iteration, and state  ...  DBMSs that support recursive SQL are more efficient in that they propagate only the changes in each step -- but they still accumulate each iteration's state, even if it is no longer useful.  ...  HadoopDB [1] is a hybrid of Hadoop (MapReduce) and PostgreSQL: queries are posed in a variant of SQL based on Hive, the basic core of Hadoop manages the computations, but the computations are partly  ... 
arXiv:1208.0089v1 fatcat:kc55jzf3e5fqlme5xxwkrqplu4

REX

Svilen R. Mihaylov, Zachary G. Ives, Sudipto Guha
2012 Proceedings of the VLDB Endowment  
We seek to unify the strengths of both styles of platforms, with a focus on supporting iterative computations in which changes, in the form of deltas, are propagated from iteration to iteration, and state  ...  DBMSs that support recursive SQL are more efficient in that they propagate only the changes in each step -but they still accumulate each iteration's state, even if it is no longer useful.  ...  HadoopDB [1] is a hybrid of Hadoop (MapReduce) and PostgreSQL: queries are posed in a variant of SQL based on Hive, the basic core of Hadoop manages the computations, but the computations are partly  ... 
doi:10.14778/2350229.2350246 fatcat:pfzv4hkikjgjzcwwt5hv6gg4vi

BINARY: A framework for big data integration for ad-hoc querying

Azadeh Eftekhari, Farhana Zulkernine, Patrick Martin
2016 2016 IEEE International Conference on Big Data (Big Data)  
Our approach is validated with a proof-of-concept prototype implemented on the OpenStack cloud system. iii  ...  It has features such as a SQL-like query language, a Metastore to hold metadata and file formats to support access to various frameworks on Hadoop and beyond.  ...  Also, some of the SQL-on-Hadoop approaches used in analytical platforms are presented.  ... 
doi:10.1109/bigdata.2016.7840922 dblp:conf/bigdataconf/EftekhariZM16 fatcat:4jd5ycet45df7bhxt76crevzlm

PROPOSING A NEW GRADUATE DEGREE IN DATA ENGINEERING AND ANALYTICS

2019 Issues in Information Systems  
In the state of Utah, over 6,996 technology companies that employ over 143,000 people paying average salaries 88% higher than the average Metropolitan Statistical Areas (MSA) wage (CompTIA, 2019).  ...  in Python • Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform unstructured data using Spark and ML APIs Pre-requisites: Google Cloud Platform Big Data & Machine Learning Fundamentals  ... 
doi:10.48009/1_iis_2019_157-167 fatcat:hyti4qupbrcfxezge7bf7st5ly
« Previous Showing results 1 — 15 out of 2,754 results