Using R for Iterative and Incremental Processing

Shivaram Venkataraman, Indrajit Roy, Alvin AuYoung, Robert S. Schreiber
2012 USENIX Workshop on Hot Topics in Cloud Computing  
It is cumbersome to write complex machine learning and graph algorithms in existing data-parallel models like MapReduce. Many of these algorithms are, by nature, iterative and perform incremental computations, neither of which are efficiently supported by current frameworks. We argue that array-based languages, like R [1], are ideal to express these algorithms, and we should extend these languages for processing in the cloud. In this paper we present the challenges and abstractions to extend R.
more » ... Early results show that many computations are an order of magnitude faster than processing in Hadoop.
dblp:conf/hotcloud/VenkataramanRAS12 fatcat:mgpn3xv6anh4nnu3uuqvptx7um