Filters








2 Hits in 0.68 sec

Apache hadoop goes realtime at Facebook

Dhruba Borthakur, Samuel Rash, Rodrigo Schmidt, Amitanand Aiyer, Jonathan Gray, Joydeep Sen Sarma, Kannan Muthukkaruppan, Nicolas Spiegelberg, Hairong Kuang, Karthik Ranganathan, Dmytro Molkov, Aravind Menon
2011 Proceedings of the 2011 international conference on Management of data - SIGMOD '11  
Facebook recently deployed Facebook Messages, its first ever user-facing application built on the Apache Hadoop platform. Apache HBase is a database-like layer built on Hadoop designed to support billions of messages per day. This paper describes the reasons why Facebook chose Hadoop and HBase over other systems such as Apache Cassandra and Voldemort and discusses the application's requirements for consistency, availability, partition tolerance, data model and scalability. We explore the
more » ... ments made to Hadoop to make it a more effective realtime system, the tradeoffs we made while configuring the system, and how this solution has significant advantages over the sharded MySQL database scheme used in other applications at Facebook and many other web-scale companies. We discuss the motivations behind our design choices, the challenges that we face in day-to-day operations, and future capabilities and improvements still under development. We offer these observations on the deployment as a model for other companies who are contemplating a Hadoop-based solution over traditional sharded RDBMS deployments.
doi:10.1145/1989323.1989438 dblp:conf/sigmod/BorthakurGSMSKRMMRSA11 fatcat:gnicex2fwzbotcpmlaqhz7k2mm

Data warehousing and analytics infrastructure at facebook

Ashish Thusoo, Zheng Shao, Suresh Anthony, Dhruba Borthakur, Namit Jain, Joydeep Sen Sarma, Raghotham Murthy, Hao Liu
2010 Proceedings of the 2010 international conference on Management of data - SIGMOD '10  
We would like to thank Scott Chen, Dmytro Molkov and Rodrigo Schmidt for contributing a number of enhancements to Hadoop -including htop, resource aware scheduling, dynamic clouds, Hadoop RAID etc.  ... 
doi:10.1145/1807167.1807278 dblp:conf/sigmod/ThusooSABJSML10 fatcat:fby2qni47belhnx5lgfuk5lh2y