A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
DisCo: Distributed Co-clustering with Map-Reduce: A Case Study towards Petabyte-Scale End-to-End Mining
2008
2008 Eighth IEEE International Conference on Data Mining
We propose the Distributed Co-clustering (DisCo) framework, which introduces practical approaches for distributed data pre-processing, and co-clustering. ...
In particular, we focus on co-clustering, which has been studied in many applications such as text mining, collaborative filtering, bio-informatics, graph mining. ...
This paper proposes a comprehensive Distributed Co-clustering (DisCo) solution from the raw data to the end clusters. ...
doi:10.1109/icdm.2008.142
dblp:conf/icdm/PapadimitriouS08
fatcat:yyab4tkyivg25f5swpo6cbzxdi
A Novel and Efficient Method for Protecting Internet Usage from Unauthorized Access Using Map Reduce
2013
International Journal of Information Technology and Computer Science
Hadoop consists of different elements out of which Map Reduce is a scalable tool that enables to process a huge data in parallel. ...
We proposed a Novel and Efficient User Profile Characterization under distributed environment. In this frame work the network ano malies are detected by using Hadoop Map Reduce technique. ...
Spiros Papadimitriou [11] written a paper on DisCo: Distributed Coclustering with Map-Reduce A Case Study Towards Petabyte-Scale End-to-End Mining. ...
doi:10.5815/ijitcs.2013.03.06
fatcat:3m4ytmmnwbfqvcklqk6abb3ati
Distributed data management using MapReduce
2014
ACM Computing Surveys
MapReduce is a framework for processing and managing large scale data sets in a distributed cluster, which has been used for applications such as generating search indexes, document clustering, access ...
In this paper we aim to provide a comprehensive review of a wide range of proposals and systems that focusing fundamentally on the support of distributed data management and processing using the MapReduce ...
Disco supports a distributed index, called Discodex, which is distributed over the cluster nodes and stored in the DDFS. ...
doi:10.1145/2503009
fatcat:nxfuh67rnrhwvh3c5zxmdkyvae
Predictive Analytics On Big Data - An Overview
2019
Informatica (Ljubljana, Tiskana izd.)
While research works carried out continuously to handle big data is at one end, processing it to develop the business insights is a hot topic to work on the other end. ...
This paper presents an overview on predictive analytics with big data. ...
Using distributed platform like mapreduce prevents the over utilization of the resources. A detailed survey on map reduce technology is discussed in [6] . ...
doi:10.31449/inf.v43i4.2577
fatcat:hqi45o6t7jb63dr3aaesink6l4
Mining Tera-Scale Graphs: Theory, Engineering and Discoveries
2018
In this thesis, we propose PEGASUS, a large scale graph mining system implemented on the top of the HADOOP platform, the open source version of MAPREDUCE. ...
How do we find patterns and anomalies, on graphs with billions of nodes and edges, which do not fit in memory? How to use parallelism for such Tera- or Peta-scale graphs? ...
Large Scale Graph Mining Given a very large graph spanning Terabytes or Petabytes, how to find patterns and anomalies? ...
doi:10.1184/r1/6720629.v1
fatcat:eid3eckey5fzbomvqje5emwl6a
Coimbatore-6410 4243
unpublished
s Map-Reduce-Merge. ...
IN A MAP-REDUCE ENVIRONMENT Implementations of map-reduce are being used to perform many operations on very large data. ...
LOCAL vs EXTENDED The attacker makes control of various base stations or any vehicles to an local network is said to be an Local attacker, whereas the extended attacker makes control of a variety of base ...
fatcat:bq2eelzqujhjzfn6qevs75mdim