Filters








9,225 Hits in 3.9 sec

Dynamic data replication: an approach to providing fault-tolerant shared memory clusters

R. Christodoulopoulou, R. Azimi, A. Bilas
The Ninth International Symposium on High-Performance Computer Architecture, 2003. HPCA-9 2003. Proceedings.  
We design extensions to an existing SVM protocol that has been tuned for lowlatency, high-bandwidth interconnects and SMP nodes and we achieve reliability through dynamic replication of application shared  ...  In this paper we address this problem in shared virtual memory (SVM) clusters at the programming abstraction layer.  ...  Acknowledgments We would like to thank the members of the ATHLOS project for the useful discussions during the course of this work.  ... 
doi:10.1109/hpca.2003.1183538 dblp:conf/hpca/ChristodoulopoulouAB03 fatcat:nkmdwxj5znbdzczzampbx4tdai

Parallel Data Processing in Dynamic Hybrid Computing Environment Using MapReduce [chapter]

Bing Tang, Haiwu He, Gilles Fedak
2014 Lecture Notes in Computer Science  
CPU speed, memory size and I/O bandwidth.  ...  HybridMR relies on a hybrid distributed file system called HybridDFS, and a time-out method has been used in HybridDFS to prevent volatility of desktop PCs, and file replication mechanism is used to realize  ...  Therefore, replication approach is utilized to achieve fault-tolerance. HybridDFS is designed to support large files.  ... 
doi:10.1007/978-3-319-11194-0_1 fatcat:qoeevh2ghzhavmvjqapeostady

Replication-Based Fault-Tolerance for Large-Scale Graph Processing

Peng Wang, Kaiyuan Zhang, Rong Chen, Haibo Chen, Haibing Guan
2014 2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks  
This paper proposes Imitator, a new fault tolerance mechanism, which supports cheap maintenance of vertex states by replicating them to their replicas during normal message exchanges, and provides fast  ...  The increasing algorithm complexity and dataset sizes necessitate the use of networked machines for many graph-parallel algorithms, which also makes fault tolerance a must due to the increasing scale of  ...  Conclusion This paper presented a replication-based approach called Imitator to provide low-overhead fault tolerance and fast crash recovery.  ... 
doi:10.1109/dsn.2014.58 dblp:conf/dsn/WangZCCG14 fatcat:vfuicg3rqrf3lbivizr52ggwr4

Dynamic Data Deduplication in Cloud Storage

Waraporn Leesakul, Paul Townend, Jie Xu
2014 2014 IEEE 8th International Symposium on Service Oriented System Engineering  
Due to this problem, many approaches and techniques have been proposed that not only provide solutions to achieve storage efficiency, but also to improve its fault tolerance.  ...  They proposed an approach to improve reliability by developing a method to weigh and measure the importance of each chunk by examining the number of data files that share the chunk, and use this weight  ... 
doi:10.1109/sose.2014.46 dblp:conf/sose/LeesakulTX14 fatcat:ptveyusctjeslkd4x2urydegtq

DistHash: A robust P2P DHT-based system for replicated objects [article]

Ciprian Dobre, Florin Pop, Valentin Cristea
2011 arXiv   pre-print
We present original solutions to achieve optimal message routing in hop-count and throughput, provide an adequate consistency approach among replicas, as well as provide a fault-tolerant substrate.  ...  In this paper we present DistHash, a P2P overlay network designed to share large sets of replicated distributed objects in the context of large-scale highly dynamic infrastructures.  ...  We presented the original adopted solutions to achieve optimal message routing in hop-count and throughput, provide an adequate consistency approach among replicas, as well as provide a fault-tolerant  ... 
arXiv:1106.5299v1 fatcat:5yvmsgaovrh35pctt5ijrcu5ti

J2EE server scalability through EJB replication

Sylvain Sicard, Noel De Palma, Daniel Hagimont
2006 Proceedings of the 2006 ACM symposium on Applied computing - SAC '06  
In this context, the JOnAS web application server provides an example of EJB replication system called CMI (Cluster Method Invocation).  ...  The aim of this paper is to compare these approaches. In a J2EE web application server, one important component is the EJB tier.  ...  Related work Resource replication (at any level: disk, process, machine, etc.) has been much more studied in the purpose to provide fault tolerance than in the goal to provide scalability.  ... 
doi:10.1145/1141277.1141455 dblp:conf/sac/SicardPH06 fatcat:sxn5mlsd4ra77hv2fajbnj3udu

Dynamic load balancing algorithm for large data flow in distributed complex networks

Zhuo Zhang
2018 Open Physics  
Information society brings convenience to people, but also produces a lot of data. Relational databases are not suitable for processing big data due to architecture defects.  ...  The most commonly used system to store and process large amounts of data is the NoSQL (Not only Structured Query Language) database.  ...  Partition fault tolerance If the data in a large-scale system exceeds the capacity of a single machine, then the system needs to consider replication to ensure reliability, load balancing and data partitioning  ... 
doi:10.1515/phys-2018-0089 fatcat:uze6lojkbfdxdj7hr3wolmyy5e

Workload Adaptive Checkpoint Scheduling of Virtual Machine Replication

Balazs Gerofi, Yutaka Ishikawa
2011 2011 IEEE 17th Pacific Rim International Symposium on Dependable Computing  
Checkpoint-recovery based Virtual Machine (VM) replication is an emerging approach towards accommodating VM installations with high availability, especially, due to its inherent capability of tackling  ...  Our algorithm adapts dynamically to the properties of the workload being executed in the VM, such as changes in the number of dirtied memory pages, network and disk I/O operations, as well as to the network  ...  Reduction of replication data is an orthogonal approach to scheduling, and therefore it may provide further improvements.  ... 
doi:10.1109/prdc.2011.32 dblp:conf/prdc/GerofiI11 fatcat:adbeqaz5pzhxnheg5uxh6i36dm

Invalidation-Based Protocols for Replicated Datastores [article]

Antonios Katsarakis
2021 arXiv   pre-print
The multiprocessor regime uses invalidations to afford strongly consistent replication with high performance but neglects fault tolerance.  ...  The primary contribution of this thesis is in adapting invalidating protocols to the nuances of replicated datastores, which include skewed data accesses, fault tolerance, and distributed transactions.  ...  To guarantee fault tolerance and strong consistency, they replicate data and rely on replication protocols.  ... 
arXiv:2112.02405v1 fatcat:toi3wflouvbzngpdfzypw3l4li

Practical Database Replication [chapter]

Alfrânio Correia, José Pereira, Luís Rodrigues, Nuno Carvalho, Rui Oliveira
2010 Lecture Notes in Computer Science  
Chapter 6 Conclusion Shared-nothing clusters have been proposed as a cost-effective approach for fault-tolerance and scalability.  ...  Experimental results show that AKARA is a promising approach to providing both performance and fault tolerance on database clusters.  ...  Appendix A Requirements for Replication-friendly Databases A.1 Requirements To achieve modularity without losing performance, the replicator must have access to a set of features provided by the DBMSs  ... 
doi:10.1007/978-3-642-11294-2_13 fatcat:oxlujsinxzdwdplq46dxje4lhm

Database replication policies for dynamic content applications

Gokul Soundararajan, Cristiana Amza, Ashvin Goel
2006 ACM SIGOPS Operating Systems Review  
In this paper, we propose using database replication to support multiple applications on a shared cluster.  ...  Our evaluation shows that dynamic replication requires fewer resources than static partitioning or full overlap replication policies and provides over 90% latency compliance to each application under a  ...  This dynamic replication approach enables a unified approach to load management as well as fault tolerance.  ... 
doi:10.1145/1218063.1217945 fatcat:mawpwjty4nd3paqwvisdkgbucu

Database replication policies for dynamic content applications

Gokul Soundararajan, Cristiana Amza, Ashvin Goel
2006 Proceedings of the 2006 EuroSys conference on - EuroSys '06  
In this paper, we propose using database replication to support multiple applications on a shared cluster.  ...  Our evaluation shows that dynamic replication requires fewer resources than static partitioning or full overlap replication policies and provides over 90% latency compliance to each application under a  ...  This dynamic replication approach enables a unified approach to load management as well as fault tolerance.  ... 
doi:10.1145/1217935.1217945 dblp:conf/eurosys/SoundararajanAG06 fatcat:easrybeyrnfopjiupty5xbnzqy

Fault tolerant adaptive parallel and distributed simulation through functional replication

Gabriele D'Angelo, Stefano Ferretti, Moreno Marzolla
2019 Simulation modelling practice and theory  
Results from an analytical model and from an experimental evaluation show that FT-GAIA provides a high degree of fault tolerance, at the cost of a moderate increase in the computational load of the execution  ...  This paper presents FT-GAIA, a software-based fault-tolerant parallel and distributed simulation middleware.  ...  Conclusions and Future Work In this paper we described an approach to provide fault tolerance through functional replication in parallel and distributed simulations.  ... 
doi:10.1016/j.simpat.2018.09.012 fatcat:hog6ldbfivghhohjvsmwwytiky

Distributed Wisdom: Designing a Replication Service for Large Peer-to-Peer Data Grids

A.V. Srinivas, M.V. Reddy, D. Janakiram
2006 IEEE Distributed Systems Online  
The middleware (platform) components must also be adaptive to these dynamics. Data consistency. Data might be replicated for reasons of performance, fault tolerance, maintenance, and so on.  ...  P2P systems have addressed scalability and fault tolerance quite well.  ... 
doi:10.1109/mdso.2006.17 fatcat:wazo653dozhvpl24v63jijtqq4

STAR: Scaling Transactions through Asymmetric Replication [article]

Yi Lu, Xiangyao Yu, Samuel Madden
2019 arXiv   pre-print
In this paper, we present STAR, a new distributed in-memory database with asymmetric replication.  ...  STAR outperforming systems that employ conventional concurrency control and replication algorithms by up to one order of magnitude.  ...  Non-partitioned Systems A typical approach to build a fault tolerant non-partitioned system is to adopt the primary/backup model.  ... 
arXiv:1811.02059v2 fatcat:p4tixpcuefcqfkdsrkpybyttpm
« Previous Showing results 1 — 15 out of 9,225 results