A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2012; you can also visit the original URL.
The file type is application/pdf
.
Filters
Evaluation and Analysis of GreenHDFS: A Self-Adaptive, Energy-Conserving Variant of the Hadoop Distributed File System
2010
2010 IEEE Second International Conference on Cloud Computing Technology and Science
We present a detailed evaluation and sensitivity analysis of an energy-conserving, highly scalable variant of the Hadoop Distributed File System (HDFS) called Green-HDFS. ...
Detailed lifespan analysis of the files in a large-scale production Hadoop cluster at Yahoo! points at the viability of GreenHDFS. Simulation results with realworld Yahoo! ...
The views and conclusions contained in this paper are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of NSF or the U.S. government ...
doi:10.1109/cloudcom.2010.109
dblp:conf/cloudcom/KaushikBN10
fatcat:5e3s6yauh5hgdatyor6qu4f7em
Effective and Efficient Web Reviews Extraction Based on Hadoop
[chapter]
2013
Lecture Notes in Computer Science
We design a Hadoop-based web reviews automatic extraction system. At last, we test the extraction system using the massive web reviews page sets. ...
The experimental results show that this extraction system can achieve accuracy of more than 96%, and also can obtain a higher speedup, compared with the traditional web extraction. ...
[13] proposes the GreenHDFS, a self-adaptive, energy-conserving variant of the HDFS. It can cut down on energy consumption of Hadoop cluster and reduce the running cost. ...
doi:10.1007/978-3-642-37804-1_12
fatcat:n6ti7jsml5c6lanbnkr5jycl74
Exploring Energy-Consistency Trade-Offs in Cassandra Cloud Storage System
2015
2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)
This further analysis indicates that the uneven distribution of the load amongst different nodes also impacts the energy consumption in Cassandra. ...
As power bills have become a substantial part of the monetary cost for operating a data-center, this paper aims to provide a clearer understanding of the interplay between consistency and energy consumption ...
ACKNOWLEDGMENTS Experiments presented in this paper were carried out using the Grid'5000 testbed, supported by a scientific interest group hosted by Inria and including CNRS, RENATER and several Universities ...
doi:10.1109/sbac-pad.2015.28
dblp:conf/sbac-pad/ChihoubILAPB15
fatcat:g5i4ei5gkbd7zmujtdeviqgwiq
A comprehensive view of Hadoop research—A systematic literature review
2014
Journal of Network and Computer Applications
Context: In recent years, the valuable knowledge that can be retrieved from petabyte scale datasetsknown as Big Dataled to the development of solutions to process information based on parallel and distributed ...
Recently, the number of publications in journals and conferences about Hadoop has increased consistently, which makes it difficult for researchers to comprehend the full body of research and areas that ...
Table A1 and A2 Table A1 Studies with implementation and/or experiments (MapReduce and data storage & manipulation categories).
Appendix A. ...
doi:10.1016/j.jnca.2014.07.022
fatcat:4xjveqy6mrctzjc4ou7llyy4u4
Energy-Efficient Big Data Analytics in Datacenters
[chapter]
2016
Advances in Computers
Also, as the scale of the datacenter is increasingly expanding, minimizing energy consumption and operational cost is a vital concern. ...
The volume of generated data increases by the rapid growth of Internet of Things (IoT), leading to the big data proliferation and more opportunities for data centers. ...
Energy-Efficient DFS A solution for improving the energy efficiency of a distributed file system such as HDFS is to recast the data layout and task distribution of the file system to enable significant ...
doi:10.1016/bs.adcom.2015.10.002
fatcat:xpecmdmje5avvphkdrdnyv4fqi
An evaluation of deep hashing for high-dimensional similarity search on embedded data
[article]
2019
Second, based on well-defined metrics, we experimentally evaluate the efficiency and classi-fication accuracy of LSH - Super-Bit, with a focus on the task of supervised entity resolution. T [...] ...
In doing so, we evaluate the impact of similarity-preserving hashing on helping with data blocking and skipping for ML applications of supervised entity resolution and top-k similarity search. ...
Distributed File System (HDD) A Distributed File System (DFS) is a le system utilized towards e cient storage and retrieval of data in a distributed client-server setup. ...
doi:10.25673/31719
fatcat:76okmnvxnrgyliqq3vbky3e5zq
Energy-efficient Transitional Near-* Computing
2019
It transfers multi-mechanism transitions, a recently developed paradigm for a highly adaptable future Internet, from the field of communication systems to computing systems. ...
a mobile System-on-a-Chip (SoC). ...
Typically, computing instances are isolated environments The product of both (a) and (b) reflects the electromecanical efficiency, called True PUE (T P U E = P U E * SP U E). ...
doi:10.17192/z2019.0052
fatcat:blcx4sw2d5eljhyj35mehamaha