A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
A Proposal for Duplicate Data Detection in Big Data
2018
International Journal for Research in Applied Science and Engineering Technology
Big Data is now the most talked about research subject. Over the years with the internet and storage space expansions vast swaths of data are available for would be searcher. But the problem that plagues the internet storage space is that multiple copies of the same data exits. This not only degrades the search results but also concedes time. Also it prevents accurate data analysis. In order to solve these problems a novel proposal has been proposed here. Traditional data mining approaches work
doi:10.22214/ijraset.2018.3361
fatcat:5jqbsrotebcujllazwt3lazjii