Filters








7 Hits in 6.3 sec

Evaluation and Analysis of GreenHDFS: A Self-Adaptive, Energy-Conserving Variant of the Hadoop Distributed File System

Rini T. Kaushik, Milind Bhandarkar, Klara Nahrstedt
2010 2010 IEEE Second International Conference on Cloud Computing Technology and Science  
We present a detailed evaluation and sensitivity analysis of an energy-conserving, highly scalable variant of the Hadoop Distributed File System (HDFS) called Green-HDFS.  ...  Detailed lifespan analysis of the files in a large-scale production Hadoop cluster at Yahoo! points at the viability of GreenHDFS. Simulation results with realworld Yahoo!  ...  The views and conclusions contained in this paper are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of NSF or the U.S. government  ... 
doi:10.1109/cloudcom.2010.109 dblp:conf/cloudcom/KaushikBN10 fatcat:5e3s6yauh5hgdatyor6qu4f7em

Effective and Efficient Web Reviews Extraction Based on Hadoop [chapter]

Jian Wan, Jiawei Yan, Congfeng Jiang, Li Zhou, Zujie Ren, Yongjian Ren
2013 Lecture Notes in Computer Science  
We design a Hadoop-based web reviews automatic extraction system. At last, we test the extraction system using the massive web reviews page sets.  ...  The experimental results show that this extraction system can achieve accuracy of more than 96%, and also can obtain a higher speedup, compared with the traditional web extraction.  ...  [13] proposes the GreenHDFS, a self-adaptive, energy-conserving variant of the HDFS. It can cut down on energy consumption of Hadoop cluster and reduce the running cost.  ... 
doi:10.1007/978-3-642-37804-1_12 fatcat:n6ti7jsml5c6lanbnkr5jycl74

Exploring Energy-Consistency Trade-Offs in Cassandra Cloud Storage System

Houssem-Eddine Chihoub, Shadi Ibrahim, Yue Li, Gabriel Antoniu, Maria S. Perez, Luc Bouge
2015 2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)  
This further analysis indicates that the uneven distribution of the load amongst different nodes also impacts the energy consumption in Cassandra.  ...  As power bills have become a substantial part of the monetary cost for operating a data-center, this paper aims to provide a clearer understanding of the interplay between consistency and energy consumption  ...  ACKNOWLEDGMENTS Experiments presented in this paper were carried out using the Grid'5000 testbed, supported by a scientific interest group hosted by Inria and including CNRS, RENATER and several Universities  ... 
doi:10.1109/sbac-pad.2015.28 dblp:conf/sbac-pad/ChihoubILAPB15 fatcat:g5i4ei5gkbd7zmujtdeviqgwiq

A comprehensive view of Hadoop research—A systematic literature review

Ivanilton Polato, Reginaldo Ré, Alfredo Goldman, Fabio Kon
2014 Journal of Network and Computer Applications  
Context: In recent years, the valuable knowledge that can be retrieved from petabyte scale datasetsknown as Big Dataled to the development of solutions to process information based on parallel and distributed  ...  Recently, the number of publications in journals and conferences about Hadoop has increased consistently, which makes it difficult for researchers to comprehend the full body of research and areas that  ...  Table A1 and A2 Table A1 Studies with implementation and/or experiments (MapReduce and data storage & manipulation categories). Appendix A.  ... 
doi:10.1016/j.jnca.2014.07.022 fatcat:4xjveqy6mrctzjc4ou7llyy4u4

Energy-Efficient Big Data Analytics in Datacenters [chapter]

Farhad Mehdipour, Hamid Noori, Bahman Javadi
2016 Advances in Computers  
Also, as the scale of the datacenter is increasingly expanding, minimizing energy consumption and operational cost is a vital concern.  ...  The volume of generated data increases by the rapid growth of Internet of Things (IoT), leading to the big data proliferation and more opportunities for data centers.  ...  Energy-Efficient DFS A solution for improving the energy efficiency of a distributed file system such as HDFS is to recast the data layout and task distribution of the file system to enable significant  ... 
doi:10.1016/bs.adcom.2015.10.002 fatcat:xpecmdmje5avvphkdrdnyv4fqi

An evaluation of deep hashing for high-dimensional similarity search on embedded data [article]

Rutuja Shivraj Pawar, Universitäts- Und Landesbibliothek Sachsen-Anhalt, Martin-Luther Universität, Gunter Saake, Gabriel Campero Durand
2019
Second, based on well-defined metrics, we experimentally evaluate the efficiency and classi-fication accuracy of LSH - Super-Bit, with a focus on the task of supervised entity resolution. T [...]  ...  In doing so, we evaluate the impact of similarity-preserving hashing on helping with data blocking and skipping for ML applications of supervised entity resolution and top-k similarity search.  ...  Distributed File System (HDD) A Distributed File System (DFS) is a le system utilized towards e cient storage and retrieval of data in a distributed client-server setup.  ... 
doi:10.25673/31719 fatcat:76okmnvxnrgyliqq3vbky3e5zq

Energy-efficient Transitional Near-* Computing

Pablo Karl Graubner, Mathematik Und Informatik, Freisleben, Bernd (Prof. Dr.)
2019
It transfers multi-mechanism transitions, a recently developed paradigm for a highly adaptable future Internet, from the field of communication systems to computing systems.  ...  a mobile System-on-a-Chip (SoC).  ...  Typically, computing instances are isolated environments The product of both (a) and (b) reflects the electromecanical efficiency, called True PUE (T P U E = P U E * SP U E).  ... 
doi:10.17192/z2019.0052 fatcat:blcx4sw2d5eljhyj35mehamaha