157 Hits in 4.6 sec

Mining block correlations to improve storage performance

Zhenmin Li, Zhifeng Chen, Yuanyuan Zhou
2005 ACM Transactions on Storage  
In this article, we propose two algorithms, C-Miner and C-Miner*, that use a data mining technique called frequent sequence mining to discover block correlations in storage systems.  ...  C-Miner is a direct application of a frequent-sequence mining algorithm with a few modifications; compared with C-Miner, C-Miner* is redesigned for mining block correlations by making concessions for the  ...  ACKNOWLEDGMENTS We appreciate Kimberly Keeton from HP labs for the constructive discussion and thank HP Storage System Labs for providing us Cello traces.  ... 
doi:10.1145/1063786.1063790 fatcat:xnzli6qgtneprhwaxsfcpsn6u4

Mining based file caching in hybrid system

Sungwoo Hong, Yohwan Kim, Youjip Won
2010 2010 10th International Conference on Intelligent Systems Design and Applications  
The objective of this research is to expedite the application launch latency in legacy Operating System. We apply data mining algorithm to extract the correlated file access pattern.  ...  In this work, we propose mining based file caching scheme for hybrid storage with NAND Flash and Hard disk drive.  ...  The main idea of C-Miner is to discover block correlations in storage systems. One can map a block to a file, and an access sequence to file access in the storage device.  ... 
doi:10.1109/isda.2010.5687025 dblp:conf/isda/HongKW10 fatcat:p4qe5me5sbacfausvrp37p2d6y

AMP: An Affinity-Based Metadata Prefetching Scheme in Large-Scale Distributed Storage Systems

Lin Lin, Xuemin Li, Hong Jiang, Yifeng Zhu, Lei Tian
2008 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID)  
In distributed storage systems, prefetching for metadata files is critical for the overall system performance.  ...  In this paper, an Affinity-based Metadata Prefetching (AMP) scheme is proposed for metadata servers in large-scale distributed storage systems to provide aggressive metadata prefetching.  ...  [30] proposed C-Miner using data mining techniques to find block correlations on storage server to direct prefetching.  ... 
doi:10.1109/ccgrid.2008.117 dblp:conf/ccgrid/LinLJZT08 fatcat:nyumvi3rdvf3nfkg2e7stqsjeu

MITHRIL: Mining Sporadic Associations for Cache Prefetching [article]

Juncheng Yang, Reza Karimi, Trausti Sæmundsson, Avani Wildani, Ymir Vigfusson
2017 arXiv   pre-print
Existing approaches to history-based prefetching, in particular, provide too few benefits for real systems for the resources they cost.  ...  Through evaluation of 135 block-storage traces, we show that MITHRIL is effective, giving an average of a 55% hit ratio increase over LRU and PROBABILITY GRAPH, a 36% hit ratio gain over AMP at reasonable  ...  C-Miner [18] and QuickMine [23] employ this technique to discover block correlations in storage systems. However, precise data mining technique comes with high overhead.  ... 
arXiv:1705.07400v1 fatcat:zoj7chgrore7poh62jqfeqlrwu

Using machine learning techniques to enhance the performance of an automatic backup and recovery system

Dan Pelleg, Eran Raichstein, Amir Ronen
2010 Proceedings of the 3rd Annual Haifa Experimental Systems Conference on - SYSTOR '10  
A typical disaster recovery system will have mirrored storage at a site that is geographically separate from the main operational site.  ...  In both cases we significantly improve the performance of the system.  ...  Basic algorithms that are easy to implement, and novel algorithms which are based on C-miner [8] -an algorithm that mines frequent block sequences.  ... 
doi:10.1145/1815695.1815697 dblp:conf/systor/PellegRR10 fatcat:gzoky7osufcldgijmpbvsqehbe


Peng Xia, Dan Feng, Hong Jiang, Lei Tian, Fang Wang
2008 Proceedings of the 17th international symposium on High performance distributed computing - HPDC '08  
, especially in large-scale distributed storage systems.  ...  File correlations have become an increasingly important consideration for performance enhancement in peta-scale storage systems.  ...  Acknowledgments This work is supported in part by the National Basic Research Program of China (973 Program) under Grant No. 2004CB318201, and the US NSF under Grant No. CCF-0621526.  ... 
doi:10.1145/1383422.1383445 dblp:conf/hpdc/XiaFJTW08 fatcat:km3kjoxzyjavzn6le4ovvwmkhm

CTDGM: A Data Grouping Model Based on Cache Transaction for Unstructured Data Storage Systems [article]

Dongjie Zhu, Haiwen Du, Yundong Sun, Zhaoshuo Tian
2020 arXiv   pre-print
degradation in the performance of data access.  ...  Cache prefetching technology has become the mainstream data access optimization strategy in the data centers.  ...  Therefore, block layer correlations are common semantic patterns in storage systems [17] .  ... 
arXiv:2009.14414v1 fatcat:sd3xp36qprb6ra53jzkidfmv5y

Modeling the aging process of flash storage by leveraging semantic I/O

Yuhui Deng, Lijuan Lu, Qiang Zou, Shuqiang Huang, Jipeng Zhou
2014 Future generations computer systems  
The analysis results in this paper provide useful insights for understanding and designing effective flash memory based storage systems.  ...  Many research efforts have been invested in employing flash memory to build high performance and large-scale storage systems for data-intensive applications.  ...  C-Miner [27] employs a data mining technique called frequent sequence mining to discover block correlations in storage systems. The method is very useful to exploit complex block correlations.  ... 
doi:10.1016/j.future.2013.09.002 fatcat:l6lidnlbibettjv77ryr6zxdbm

Exploiting the performance gains of modern disk drives by enhancing data locality

Yuhui Deng
2009 Information Sciences  
Based on the characteristics and the observation that data access on disk drives is highly skewed, the frequently accessed data blocks and the correlated data blocks are clustered into objects and moved  ...  Disk drives have experienced dramatic development since the first disk drive was announced in 1956. This paper investigates some important characteristics of modern disk drives.  ...  In addition, I am grateful to Prof. Witold Pedrycz for giving me the opportunity to clarify my thoughts.  ... 
doi:10.1016/j.ins.2009.02.002 fatcat:3xuj2q3pynf7tnyhtft2spmts4

Hybrid mining

Krishnendu Chatterjee, Amir Kafshdar Goharshady, Arash Pourdamghani
2019 Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing - SAC '19  
Various other mining protocols have been suggested, including proof of stake, in which a miner's chance of adding the next block is proportional to her current balance.  ...  Then, miners can choose to compete in solving these problems, in lieu of Hashcash, for adding a new block.  ...  Hence, creating an incentive for them to solve real-world useful problems. (1.c) Miners should have the option to use Hashcash for adding a new block, e.g. in cases when they are not able to solve any  ... 
doi:10.1145/3297280.3297319 dblp:conf/sac/ChatterjeeGP19 fatcat:7ey3gc43qfgmzgzixtnwektvwa

Dynamic Process Migration Based on Block Access Patterns Occurring in Storage Servers

Jianwei Liao, François Trahay, Guoqiang Xiao
2016 ACM Transactions on Architecture and Code Optimization (TACO)  
In order to reduce the time required for I/O data exchange and data transfer in the coupled systems or other applications, this paper proposes a dynamic process migration mechanism on the basis of block  ...  We first introduce the scheme of block access counting diagram, to profile the process access pattern during a time period on the storage server.  ...  Zhou et al. proposed a data mining approach called C-miner to explore block correlations in the file server on a local machine.  ... 
doi:10.1145/2899002 fatcat:kcltvgnik5fspfg6q5quht53au

Tombolo: Performance enhancements for cloud storage gateways

Suli Yang, Kiran Srinivasan, Kishore Udayashankar, Swetha Krishnan, Jingxin Feng, Yupu Zhang, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau
2016 2016 32nd Symposium on Mass Storage Systems and Technologies (MSST)  
Object-based cloud storage has been widely adopted for their agility in deploying storage with a very low up-front cost.  ...  We find that when equipped with the right techniques, cloud gateways can provide competitive performance and price compared to on-premise storage.  ...  Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and may not reflect the views of NSF or other institutions.  ... 
doi:10.1109/msst.2016.7897076 dblp:conf/mss/YangSUKFZAA16 fatcat:avvx4affuzfv3dwuajxnabqlp4


Anna Povzner, Kimberly Keeton, Arif Merchant, Charles B. Morrey, Mustafa Uysal, Marcos K. Aguilera
2009 ACM SIGOPS Operating Systems Review  
We describe a system called Autograph that exploits this repeatability to discover files that belong to an application workflow.  ...  Our approach examines traces of file accesses, finds repeated and correlated accesses, and infers which files likely belong to the same workflow.  ...  C-Miner [15] is an algorithm that uses the CloSpan frequent sequence mining algorithm to discover correlations of access to data blocks in a storage system.  ... 
doi:10.1145/1496909.1496925 fatcat:ztdt2lq6b5emfhov5tke5eelqa

Monitoring Grassland Management Effects on Soil Organic Carbon—A Matter of Scale

Alexandra Crème, Cornelia Rumpel, Sparkle L. Malone, Nicolas P. A. Saby, Emmanuelle Vaudour, Marie-Laure Decau, Abad Chabbi
2020 Agronomy  
We measured changes in SOC storage over this 9-year experiment on loamy soil and investigated physicochemical soil parameters.  ...  On 0.4-ha blocks, the continuous grassland increased SOC in the soil profile with highest gains in the first 30 cm, while losses were recorded under continuous cropping.  ...  The contrasting SOC stock responses to fertilization may be related to strong variability of ecosystems' response to management and can be due to increased C mineralization rates following fresh organic  ... 
doi:10.3390/agronomy10122016 fatcat:c72nnenjkveediuxhtoxdu47aa

A self-tuning client-side metadata prefetching scheme for wide area network file systems

Bing Wei, Limin Xiao, Yao Song, Guangjun Qin, Jinbin Zhu, Baicheng Yan, Chaobo Wang, Zhisheng Huo
2021 Science China Information Sciences  
Client-side metadata prefetching is commonly used in wide area network (WAN) file systems because it can effectively hide network latency.  ...  It uses a directory-directed prefetching strategy to prefetch the related file metadata in the same directory, and a correlation-directed prefetching strategy to prefetch the related file metadata accessed  ...  There are several sophisticated prefetching schemes for storage servers. C-Miner [7] uses frequent sequence matching on block I/O data in storage servers to benefit I/O optimization. Hsu et al.  ... 
doi:10.1007/s11432-019-2833-1 fatcat:h66h5astznfuxakkrtzzczhcua
« Previous Showing results 1 — 15 out of 157 results