Filters








35 Hits in 3.9 sec

BlobSeer: Next-generation data management for large scale infrastructures

Bogdan Nicolae, Gabriel Antoniu, Luc Bougé, Diana Moise, Alexandra Carpen-Amarie
2011 Journal of Parallel and Distributed Computing  
The emergence of highly scalable infrastructures, e.g. for cloud computing and for petascale computing and beyond introduces additional issues for which scalable data management becomes an immediate need  ...  First, it proposes a set of principles for designing highly scalable distributed storage systems that are optimized for heavy data access concurrency.  ...  ALADDIN-G5K experimental testbed, an initiative from the French Ministry of Research through the ACI GRID incentive action, INRIA, CNRS and RENATER and other contributing partners (see http://www. grid5000.fr/ for  ... 
doi:10.1016/j.jpdc.2010.08.004 fatcat:ddnor4rosja47a3djyqgjvec2i

Bringing introspection into BlobSeer: Towards a self-adaptive distributed data management system

Alexandra Carpen-Amarie, Alexandru Costan, Jing Cai, Gabriel Antoniu, Luc Bougé
2011 International Journal of Applied Mathematics and Computer Science  
This paper discusses the requirements for an introspection layer in a data management system for large-scale distributed infrastructures.  ...  We focus on the case of BlobSeer, a large-scale distributed system for storing massive data.  ...  for large-scale distributed data management.  ... 
doi:10.2478/v10006-011-0017-y fatcat:czd6zxk3tjgxvopq6v3qbsoc5i

BlobSeer: Efficient data management for data-intensive applications distributed at large-scale

Bogdan Nicolae, Gabriel Antoniu, Luc Bouge
2010 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)  
to enable efficient data management for dataintensive applications at large scale.  ...  While this approach enables efficient fine-grain data transfers at large scale, unlike in the case of BlobSeer, data management transparency is missing, as the user must be aware of data location and manage  ... 
doi:10.1109/ipdpsw.2010.5470802 dblp:conf/ipps/NicolaeAB10 fatcat:uevzz3ze4ndvvbp5c3gun4qk34

Bringing Introspection Into the BlobSeer Data-Management System Using the MonALISA Distributed Monitoring Framework

Alexandra Carpen-Amarie, Jing Cai, Alexandru Costan, Gabriel Antoniu, Luc Bougé
2010 2010 International Conference on Complex, Intelligent and Software Intensive Systems  
This paper discusses the requirements for an introspection layer in a data-management system for large-scale distributed infrastructures.  ...  We focus on the case of BlobSeer, a large-scale distributed system for storing massive data.  ...  ACKNOWLEDGMENTS The authors thank Bogdan Nicolae for his crucial technical support regarding BlobSeer.  ... 
doi:10.1109/cisis.2010.37 dblp:conf/cisis/Carpen-AmarieCCAB10 fatcat:fyblpout2bda5c6vn5c6yucyey

BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications

Bogdan Nicolae, Diana Moise, Gabriel Antoniu, Luc Bouge, Matthieu Dorier
2010 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)  
We substitute the original HDFS layer of Hadoop with a new, concurrency-optimized data storage layer based on the BlobSeer data management service.  ...  Thereby, the efficiency of Hadoop is significantly improved for data-intensive Map/Reduce applications, which naturally exhibit a high degree of data access concurrency.  ...  BLOBSEER AS A CONCURRENCY-OPTIMIZED FILE SYSTEM FOR HADOOP In this section we introduce BlobSeer, a system for managing massive data in a large-scale distributed context [12] .  ... 
doi:10.1109/ipdps.2010.5470433 dblp:conf/ipps/NicolaeMABD10 fatcat:2owfpp4mzjcg3pdmtxrafmmtvu

Improving the Hadoop map/reduce framework to support concurrent appends through the BlobSeer BLOB management system

Diana Moise, Gabriel Antoniu, Luc Bougé
2010 Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing - HPDC '10  
We provide support for concurrent appends by building a concurrency-optimized data storage layer based on the BlobSeer data management service.  ...  Besides, measurements with an application available with Hadoop show that the support for concurrent appends to shared file is introduced with no extra cost, whereas the number of files managed by the  ...  Instead of managing very large sets of small files, a better approach for handling such very large data sets of small pieces of data consists in packing these pieces of data together into huge files (e.g  ... 
doi:10.1145/1851476.1851596 dblp:conf/hpdc/MoiseAB10 fatcat:jr377eteevcvjncgc6b7szxk4e

BIGhybrid -- A Toolkit for Simulating MapReduce in Hybrid Infrastructures

Julio C.S. dos Anjos, Gilles Fedak, Claudio F.R. Geyer
2014 2014 International Symposium on Computer Architecture and High Performance Computing Workshop  
Cloud computing has increasingly been used as a platform for running large business and data processing applications.  ...  Merging cloud computing and desktop grids into a hybrid infrastructure can provide a feasible low-cost solution for big data analysis.  ...  BitDew-MapReduce Simulation Module BitDew [8] is a middleware for large scale data management in hybrid distributed computing infrastructures.  ... 
doi:10.1109/sbac-padw.2014.8 dblp:conf/sbac-pad/AnjosFG14 fatcat:2iz4zeicovatzfnz6fc3y64rpa

Managing Data Access on Clouds: A Generic Framework for Enforcing Security Policies

Cristina Basescu, Alexandra Carpen-Amarie, Catalin Leordeanu, Alexandru Costan, Gabriel Antoniu
2011 2011 IEEE International Conference on Advanced Information Networking and Applications  
Malicious behaviors such as Denial of Service attacks, especially when targeting large-scale data management systems, cannot be detected by typical authentication mechanisms and are responsible for drastically  ...  In this paper we propose a generic security management framework allowing providers of Cloud data management systems to define and enforce complex security policies.  ...  We integrated the proposed Security Management Framework into BlobSeer [13] , a data-management system designed for large-scale infrastructures, which addresses these requirements.  ... 
doi:10.1109/aina.2011.61 dblp:conf/aina/BasescuCLCA11 fatcat:7o4vokejwfex7msmys6zehuk3u

BIGhybrid: a simulator for MapReduce applications in hybrid distributed infrastructures validated with the Grid5000 experimental platform

Julio C. S. Anjos, Gilles Fedak, Claudio F. R. Geyer
2015 Concurrency and Computation  
The related work demonstrates BlobSeer is a DFS that manages a huge amount of data in a flat sequence of bytes called BLOBs (Binary Large Objects).  ...  The incremental update is necessary for data management in a hybrid infrastructure. BitDew-MapReduce BitDew is a middleware that exploits protocols like P2P, http, BitTorrent and ftp.  ...  Both architectures are suitable for large-scale parallel processing.  ... 
doi:10.1002/cpe.3665 fatcat:3xpufibuyzaflmmvos3mccmk2u

On the Benefits of Transparent Compression for Cost-Effective Cloud Data Storage [chapter]

Bogdan Nicolae
2011 Lecture Notes in Computer Science  
Our solution builds on BlobSeer, a distributed data management service specifically designed to sustain a high throughput for concurrent accesses to huge data sequences that are distributed at large scale  ...  for users that need to manipulate huge data sets and a large number of VM images.  ...  ALADDIN-G5K experimental testbed, an initiative from the French Ministry of Research through the ACI GRID incentive action, INRIA, CNRS and RE-NATER and other contributing partners (see http://www.grid5000.fr/ for  ... 
doi:10.1007/978-3-642-23074-5_7 fatcat:jg5sygkarbg7vmyyztjeqhx3ei

Optimizing intermediate data management in MapReduce computations

Diana Moise, Thi-Thu-Lan Trieu, Luc Bougé, Gabriel Antoniu
2011 Proceedings of the First International Workshop on Cloud Computing Platforms - CloudCP '11  
To meet this goal, we rely on a fault-tolerant, concurrencyoptimized data storage layer based on the BlobSeer data management service.  ...  Many cloud computations process large datasets.  ...  Using BlobSeer as storage for intermediate data Our approach aims at using BlobSeer as storage layer for the intermediate data generated by MapReduce applications.  ... 
doi:10.1145/1967422.1967427 fatcat:4hiyvk6yujbovmhqrjnw7aqewy

Security and Data Compression in Cloud Computing Using BlobSeer Technique

Ashwin Dhivakar
unpublished
at a large scale.  ...  Moreover shifting paradigm is the next major issues faced by IT infrastructures now a days.  ...  The public key is outsourced to everyone hence it is used for encrypting the message. The encrypted message can be decrypted only using the private key with a time limit. Key Generation Algorithm 1.  ... 
fatcat:loirfjoxffak7mjkz7u5ydfcb4

Intelligent services for Big Data science

C. Dobre, F. Xhafa
2014 Future generations computer systems  
Finally, we present a solution to handle efficient storage of context data on a large scale. The combination of these services provide support for intelligent Smart City applications, for  ...  In this paper we present our solutions designed to support next-generation Big Data applications.  ...  BlobSeer [23] is a large-scale, distributed, binary storage service.  ... 
doi:10.1016/j.future.2013.07.014 fatcat:4d5wkb5xi5cjjb433dqlpgecpa

Going back and forth

Bogdan Nicolae, John Bresnahan, Kate Keahey, Gabriel Antoniu
2011 Proceedings of the 20th international symposium on High performance distributed computing - HPDC '11  
Large-scale experiments on hundreds of nodes demonstrate excellent performance results: speedup for concurrent VM deployments ranges from a factor of 2 up to 25, with a reduction in bandwidth utilization  ...  One of those challenges is the need to deploy a large number (hundreds or even thousands) of VM instances simultaneously.  ...  When taking frequent snapshots for a large number of VMs, such approaches generate a large number of files and interdependencies among them, which are difficult to manage and which interfere with the ease-of-use  ... 
doi:10.1145/1996130.1996152 dblp:conf/hpdc/NicolaeBKA11 fatcat:g2wqnbesljghthr4tyjh24tvbq

Optimizing Multi-deployment on Clouds by Means of Self-adaptive Prefetching [chapter]

Bogdan Nicolae, Franck Cappello, Gabriel Antoniu
2011 Lecture Notes in Computer Science  
Large scale experiments under concurrency on hundreds of nodes show that introducing such a prefetching mechanism can achieve a speed-up of up to 35% when compared to simple on-demand fetching.  ...  With Infrastructure-as-a-Service (IaaS) cloud economics getting increasingly complex and dynamic, resource costs can vary greatly over short periods of time.  ...  For this purpose, we deploy BlobSeer on all of the 120 compute nodes and store the initial 2 GB large image in a striped fashion into it.  ... 
doi:10.1007/978-3-642-23400-2_46 fatcat:4dh3imc7y5e5lgm5zprasekpoe
« Previous Showing results 1 — 15 out of 35 results