6 Hits in 10.0 sec

DisCo: Distributed Co-clustering with Map-Reduce: A Case Study towards Petabyte-Scale End-to-End Mining

Spiros Papadimitriou, Jimeng Sun
2008 2008 Eighth IEEE International Conference on Data Mining  
We propose the Distributed Co-clustering (DisCo) framework, which introduces practical approaches for distributed data pre-processing, and co-clustering.  ...  In particular, we focus on co-clustering, which has been studied in many applications such as text mining, collaborative filtering, bio-informatics, graph mining.  ...  This paper proposes a comprehensive Distributed Co-clustering (DisCo) solution from the raw data to the end clusters.  ... 
doi:10.1109/icdm.2008.142 dblp:conf/icdm/PapadimitriouS08 fatcat:yyab4tkyivg25f5swpo6cbzxdi

A Novel and Efficient Method for Protecting Internet Usage from Unauthorized Access Using Map Reduce

P. Srinivasa Rao, K. Thammi Reddy, MHM. Krishna Prasad
2013 International Journal of Information Technology and Computer Science  
Hadoop consists of different elements out of which Map Reduce is a scalable tool that enables to process a huge data in parallel.  ...  We proposed a Novel and Efficient User Profile Characterization under distributed environment. In this frame work the network ano malies are detected by using Hadoop Map Reduce technique.  ...  Spiros Papadimitriou [11] written a paper on DisCo: Distributed Coclustering with Map-Reduce A Case Study Towards Petabyte-Scale End-to-End Mining.  ... 
doi:10.5815/ijitcs.2013.03.06 fatcat:3m4ytmmnwbfqvcklqk6abb3ati

Distributed data management using MapReduce

Feng Li, Beng Chin Ooi, M. Tamer Özsu, Sai Wu
2014 ACM Computing Surveys  
MapReduce is a framework for processing and managing large scale data sets in a distributed cluster, which has been used for applications such as generating search indexes, document clustering, access  ...  In this paper we aim to provide a comprehensive review of a wide range of proposals and systems that focusing fundamentally on the support of distributed data management and processing using the MapReduce  ...  Disco supports a distributed index, called Discodex, which is distributed over the cluster nodes and stored in the DDFS.  ... 
doi:10.1145/2503009 fatcat:nxfuh67rnrhwvh3c5zxmdkyvae

Predictive Analytics On Big Data - An Overview

Gayathri Nagarajan, Dhinesh Babu L.D
2019 Informatica (Ljubljana, Tiskana izd.)  
While research works carried out continuously to handle big data is at one end, processing it to develop the business insights is a hot topic to work on the other end.  ...  This paper presents an overview on predictive analytics with big data.  ...  Using distributed platform like mapreduce prevents the over utilization of the resources. A detailed survey on map reduce technology is discussed in [6] .  ... 
doi:10.31449/inf.v43i4.2577 fatcat:hqi45o6t7jb63dr3aaesink6l4

Mining Tera-Scale Graphs: Theory, Engineering and Discoveries

U Kang
In this thesis, we propose PEGASUS, a large scale graph mining system implemented on the top of the HADOOP platform, the open source version of MAPREDUCE.  ...  How do we find patterns and anomalies, on graphs with billions of nodes and edges, which do not fit in memory? How to use parallelism for such Tera- or Peta-scale graphs?  ...  Large Scale Graph Mining Given a very large graph spanning Terabytes or Petabytes, how to find patterns and anomalies?  ... 
doi:10.1184/r1/6720629.v1 fatcat:eid3eckey5fzbomvqje5emwl6a

Coimbatore-6410 4243

Tadeusz Hryniewicz, Tomasz Borowski, B Ramani, Anand Paul, Mr Kumar, Mr Ravichandran Advisor, T Devi, Latha Parameswaran, Chairperson, Ms Sampath, Ms Shaju, Mr Prashanth
s Map-Reduce-Merge.  ...  IN A MAP-REDUCE ENVIRONMENT Implementations of map-reduce are being used to perform many operations on very large data.  ...  LOCAL vs EXTENDED The attacker makes control of various base stations or any vehicles to an local network is said to be an Local attacker, whereas the extended attacker makes control of a variety of base  ...