A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is
We propose the Distributed Co-clustering (DisCo) framework, which introduces practical approaches for distributed data pre-processing, and co-clustering. ... In particular, we focus on co-clustering, which has been studied in many applications such as text mining, collaborative filtering, bio-informatics, graph mining. ... This paper proposes a comprehensive Distributed Co-clustering (DisCo) solution from the raw data to the end clusters. ...doi:10.1109/icdm.2008.142 dblp:conf/icdm/PapadimitriouS08 fatcat:yyab4tkyivg25f5swpo6cbzxdi
Hadoop consists of different elements out of which Map Reduce is a scalable tool that enables to process a huge data in parallel. ... We proposed a Novel and Efficient User Profile Characterization under distributed environment. In this frame work the network ano malies are detected by using Hadoop Map Reduce technique. ... Spiros Papadimitriou  written a paper on DisCo: Distributed Coclustering with Map-Reduce A Case Study Towards Petabyte-Scale End-to-End Mining. ...doi:10.5815/ijitcs.2013.03.06 fatcat:3m4ytmmnwbfqvcklqk6abb3ati
MapReduce is a framework for processing and managing large scale data sets in a distributed cluster, which has been used for applications such as generating search indexes, document clustering, access ... In this paper we aim to provide a comprehensive review of a wide range of proposals and systems that focusing fundamentally on the support of distributed data management and processing using the MapReduce ... Disco supports a distributed index, called Discodex, which is distributed over the cluster nodes and stored in the DDFS. ...doi:10.1145/2503009 fatcat:nxfuh67rnrhwvh3c5zxmdkyvae
While research works carried out continuously to handle big data is at one end, processing it to develop the business insights is a hot topic to work on the other end. ... This paper presents an overview on predictive analytics with big data. ... Using distributed platform like mapreduce prevents the over utilization of the resources. A detailed survey on map reduce technology is discussed in  . ...doi:10.31449/inf.v43i4.2577 fatcat:hqi45o6t7jb63dr3aaesink6l4
In this thesis, we propose PEGASUS, a large scale graph mining system implemented on the top of the HADOOP platform, the open source version of MAPREDUCE. ... How do we find patterns and anomalies, on graphs with billions of nodes and edges, which do not fit in memory? How to use parallelism for such Tera- or Peta-scale graphs? ... Large Scale Graph Mining Given a very large graph spanning Terabytes or Petabytes, how to find patterns and anomalies? ...doi:10.1184/r1/6720629.v1 fatcat:eid3eckey5fzbomvqje5emwl6a
s Map-Reduce-Merge. ... IN A MAP-REDUCE ENVIRONMENT Implementations of map-reduce are being used to perform many operations on very large data. ... LOCAL vs EXTENDED The attacker makes control of various base stations or any vehicles to an local network is said to be an Local attacker, whereas the extended attacker makes control of a variety of base ...fatcat:bq2eelzqujhjzfn6qevs75mdim