High performance state-machine replication

Parisa Jalili Marandi, Marco Primi, Fernando Pedone
2011 2011 IEEE/IFIP 41st International Conference on Dependable Systems & Networks (DSN)  
State-machine replication is a well-established approach to fault tolerance. The idea is to replicate a service on multiple servers so that it remains available despite the failure of one or more servers. From a performance perspective, state-machine replication has two drawbacks. First, it introduces some overhead in service response time, due to the requirement to totally order commands. Second, service throughput cannot be augmented by adding replicas to the system. We address the two issues
more » ... in this paper. We use speculative execution to reduce the response time and state partitioning to increase the throughput of state-machine replication. We illustrate these techniques with a highly available B-Tree service.
doi:10.1109/dsn.2011.5958258 dblp:conf/dsn/MarandiPP11 fatcat:rbdwp5ecafdfpeonlub7dimqoy