Moving Hadoop to the cloud for big data analytics

Irina Astrova, Arne Koschel, Felix Heine, Ahto Kalja
2020
Hadoop is a Java-based open source programming framework, which supports the processing and storage of large volumes of data sets in a distributed computing environment. On the other hand, an overwhelming majority of organizations are moving their big data processing and storing to the cloud to take advantage of cost reduction – the cloud eliminates the need for investing heavily in infrastructures, which may or may not be used by organizations. This paper shows how organizations can alleviate
more » ... ome of the obstacles faced when trying to make Hadoop run in the cloud.
doi:10.25968/opus-1555 fatcat:dxbtgqrgk5gilptv2oo55zgs6m