Filters








11 Hits in 8.6 sec

Analysis of a new intra-disk redundancy scheme for high-reliability RAID storage systems in the presence of unrecoverable errors

Ajay Dholakia, Evangelos Eleftheriou, Xiao--Yu Hu, Ilias Iliadis, Jai Menon, KK Rao
2006 Proceedings of the joint international conference on Measurement and modeling of computer systems - SIGMETRICS '06/Performance '06  
INTRA-DISK REDUNDANCY SCHEME j g x w x x g y g w k x x x w e y x m g h g w g i g w h i x g h g y g w h g r x x w g g x l x m g y x w g w g w w x x g y w h g x m x | g y w x g g w g y x x w g i w g e g  ...  PERFORMANCE EVALUATION Impact of Intra-Disk Redundancy on I/O Performance d y g j e i g g y w g w i g y i g m w t z q n g g w x w g y g w x g y w g ¦ d y g x i x g y g w x g y x x i g m g y x y w x w  ... 
doi:10.1145/1140277.1140326 dblp:conf/sigmetrics/DholakiaEHIMR06 fatcat:nfg2ionlnbfvdoc2vgm4vscrw4

Online availability upgrades for parity-based RAIDs through supplementary parity augmentations

Lei Tian, Qiang Cao, Hong Jiang, Dan Feng, Changsheng Xie, Qin Xin
2011 ACM Transactions on Storage  
The basic idea of SPA is to store and update the supplementary parity units on one or a few newly augmented spare disks for on-line RAID systems in the operational mode, thus achieving the goals of improving  ...  In this paper, we propose a simple but powerful on-line availability upgrade mechanism, Supplementary Parity Augmentations (SPA), to address the availability issue for parity-based RAID systems.  ...  As a new form of intra-disk redundancy, IPC [12] is an error recovery method that adds an additional redundancy level on top of the RAID redundancy across multiple disks by adding parity of segments  ... 
doi:10.1145/1970338.1970341 fatcat:4kemju36d5hurlgqa4jyi7ps24

Understanding latent sector errors and how to protect against them

Bianca Schroeder, Sotirios Damouras, Phillipa Gill
2010 ACM Transactions on Storage  
LSEs are a critical factor in data reliability, since a single LSE can lead to data loss when encountered during RAID reconstruction after a disk failure or in systems without redundancy.  ...  Our second contribution is an evaluation of five different scrubbing policies and five different intra-disk redundancy schemes and their potential in protecting against LSEs.  ...  We also thank Jay Wylie for sharing his insights regarding intra-disk redundancy codes and the anonymous reviewers for their useful feedback.  ... 
doi:10.1145/1837915.1837917 fatcat:m7zmijqj3bgafhueg6vk5h7vqe

A Flash-aware Intra-disk Redundancy scheme for high reliable All Flash Array

Wei Yi, Hui Xu, Qiyou Xie, Nan Li
2015 IEICE Electronics Express  
In order to combat the two types of errors, we combine the RAID-5 scheme with a Flash-aware Intra-disk Redundancy (FAIDR) scheme.  ...  For saving parity space, the ratio of redundancy in the FAIDR scheme grows with the increasing of sector errors. However, the parity handling overhead is significant.  ...  Introduction In the past few years, Flash-based SSDs have gained prevalence in enterprise storage systems for their high I/O performance.  ... 
doi:10.1587/elex.12.20150295 fatcat:zw4b37ges5bpdp7dg4l7ptouoa

Effect of Replica Placement on the Reliability of Large-Scale Data Storage Systems

Vinodh Venkatesan, Ilias Iliadis, Xiao-Yu Hu, Robert Haas, Christina Fragouli
2010 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems  
We therefore use one of the alternate measures of reliability that have been proposed in the literature, namely, the probability of data loss during rebuild in the critical mode of the system.  ...  Several systems described in the literature are designed based on the premise that minimizing the rebuild times maximizes the system reliability.  ...  ACKNOWLEDGMENT The authors would like to thank Rüdiger Urbanke of EPFL for his participation and support in the discussion of this work, and the reviewers for comments, which helped improve the presentation  ... 
doi:10.1109/mascots.2010.17 dblp:conf/mascots/VenkatesanIHHF10 fatcat:fv42ipgqjbeuhmbrwraogezwrm

Architectural Techniques to Enable Reliable and Scalable Memory Systems [article]

Prashant J. Nair
2017 arXiv   pre-print
High capacity and scalable memory systems play a vital role in enabling our desktops, smartphones, and pervasive technologies like Internet of Things (IoT).  ...  Today, memory reliability is seen as the key impediment towards using high-density devices, adopting new technologies, and even building the next Exascale supercomputer.  ...  The size of the RAID-Group determines the storage overhead for storing the parity lines, the latency for performing error correction using RAID-4, and the overall reliability of the scheme.  ... 
arXiv:1704.03991v1 fatcat:e4i5pbtuujagnprene3wwv3un4

Using complexity to protect elections

Piotr Faliszewski, Edith Hemaspaandra, Lane A. Hemaspaandra
2010 Communications of the ACM  
in which the content of a file in storage changes with no explanation or recorded errors-in state-of-the-art storage systems.  ...  The NetApp study looked at the incidence of silent storage corruption in individual disks in RAID arrays.  ...  All application material should be sent by e-mail to search@ cs.appstate.edu in a single PDF file attachment or by mail to the chair of the search committee: Dr.  ... 
doi:10.1145/1839676.1839696 fatcat:hbqpm5boabe3jcpa4jcs7czf6y

Measurement of the CMB Polarization at 95 GHz from QUIET [article]

Immanuel Buder
2012 arXiv   pre-print
We also evaluated the systematic errors in the blind stage of the analysis before the result was known.  ...  Inflation, an exponential expansion in the first 10^-36s, is a promising potential explanation.  ...  These files were automatically copied to the Storage PC in the control room, which had an attached RAID with space for 10 weeks of data. We copied newly created data files to Blu-ray disks (25 GB).  ... 
arXiv:1209.1277v1 fatcat:3gg25gnzf5h3tksak7vlpsuxxa

Anomaly symptom recognition in distributed IT systems

Alexander Acker, Technische Universität Berlin, Odej Kao
2021
The number of components such as sensors, actuators, computing, storage, and network nodes, as well as a variety of service applications increases and results in IT systems of high complexity.  ...  For this purpose, artificial intelligence for IT system operations (AIOps) is being explored to improve the availability, maintainability, and reliability of IT systems.  ...  The blue bars represent the measured runtime values for the density grid comparison, while the orange bars show the density grid sizes, which is the number of non-zero cells in the density grid pattern  ... 
doi:10.14279/depositonce-14761 fatcat:jk5ttfibufewjdmo2crzmnhdii

DEPEND 2011 Committee DEPEND Advisory Chairs

Pascal Lorenz, Syed Naqvi, Sergio Hidalgo, Manuel Gil Perez, Reijo Savola, Sergio Hidalgo, Manuel Gil Perez, Afonso Neto, Yudistira Dwi, Wardhana Asnar, Jorge Bernal Bernabé, Nicola Dragoni (+47 others)
2011 DEPEND 2011 The Fourth International Conference on Dependability Foreword The Fourth International Conference on Dependability   unpublished
a series of special events related to the new challenges in dependability on critical and complex information systems Most of critical activities in the areas of communications (telephone, Internet), energy  ...  We also gratefully thank the members of the DEPEND 2011 organizing committee for their help in handling the logistics and for their work that is making this professional meeting a success.  ...  We also gratefully thank the members of the DEPEND 2011 organizing committee for their help in handling the logistics and for their work that is making this professional meeting a success.  ... 
fatcat:yzdyhvwddfhu3aidycnpz742fm

APPLICATION-AWARE ON-LINE FAILURE RECOVERY FOR EXTREME-SCALE HPC ENVIRONMENTS ABSTRACT OF THE DISSERTATION Application-aware On-line Failure Recovery for Extreme-scale HPC Environments

Marc Gamell Balmana, Marc Balmana, Marc Balmana, Manish Parashar
2017 unpublished
Since these systems are expected to contain a large number of components, reliability is one of the key anticipated challenges.  ...  While the illusion of a failure-free machine-implemented either via hardware or system software strategies-is adequate for current HPC systems, they may prove too costly in future extreme-scale machines  ...  The code incorporates high order explicit central difference schemes for spatial derivatives and explicit low-storage Runge-Kutta schemes for temporal integration [25] .  ... 
fatcat:vdo7onbo4vdvbclrob6ercltdi