IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY HADOOP BASED APPLICATION USING MULTINODE CLUSTERS

Vanshika Bhati, Meenakshi Sharma, Ajay Agarwal
unpublished
In the present era, data is considered as precious as gold for many organizations. Data management and storage is of utmost importance. In today's scenario, data is being generated in massive quantities every single day. Hence, the storage and processing of data using the conventional storing methods like RDBMS is not efficient and effective. So, new ways have been evolved to manage this massive amount of data, also termed as Big Data. This Big Data is a combination of both structured and
more » ... ctured data. Hadoop is an open source software that helps to store and process this Big Data. The Hadoop divides the data in blocks and stores them on different nodes and also does replication of these blocks for fault tolerance. The Hadoop Distribution File system (HDFS) and MapReduce are the two key components of Hadoop. MapReduce is used to process the data. In this paper 3-nodes cluster is proposed to store file and process the data for word-count application.
fatcat:rdpaltz2irdwjii76zfxc2ymkm