A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2012; you can also visit the original URL.
The file type is application/pdf
.
Filters
Mining block correlations to improve storage performance
2005
ACM Transactions on Storage
In this article, we propose two algorithms, C-Miner and C-Miner*, that use a data mining technique called frequent sequence mining to discover block correlations in storage systems. ...
C-Miner is a direct application of a frequent-sequence mining algorithm with a few modifications; compared with C-Miner, C-Miner* is redesigned for mining block correlations by making concessions for the ...
ACKNOWLEDGMENTS We appreciate Kimberly Keeton from HP labs for the constructive discussion and thank HP Storage System Labs for providing us Cello traces. ...
doi:10.1145/1063786.1063790
fatcat:xnzli6qgtneprhwaxsfcpsn6u4
Mining based file caching in hybrid system
2010
2010 10th International Conference on Intelligent Systems Design and Applications
The objective of this research is to expedite the application launch latency in legacy Operating System. We apply data mining algorithm to extract the correlated file access pattern. ...
In this work, we propose mining based file caching scheme for hybrid storage with NAND Flash and Hard disk drive. ...
The main idea of C-Miner is to discover block correlations in storage systems. One can map a block to a file, and an access sequence to file access in the storage device. ...
doi:10.1109/isda.2010.5687025
dblp:conf/isda/HongKW10
fatcat:p4qe5me5sbacfausvrp37p2d6y
AMP: An Affinity-Based Metadata Prefetching Scheme in Large-Scale Distributed Storage Systems
2008
2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID)
In distributed storage systems, prefetching for metadata files is critical for the overall system performance. ...
In this paper, an Affinity-based Metadata Prefetching (AMP) scheme is proposed for metadata servers in large-scale distributed storage systems to provide aggressive metadata prefetching. ...
[30] proposed C-Miner using data mining techniques to find block correlations on storage server to direct prefetching. ...
doi:10.1109/ccgrid.2008.117
dblp:conf/ccgrid/LinLJZT08
fatcat:nyumvi3rdvf3nfkg2e7stqsjeu
MITHRIL: Mining Sporadic Associations for Cache Prefetching
[article]
2017
arXiv
pre-print
Existing approaches to history-based prefetching, in particular, provide too few benefits for real systems for the resources they cost. ...
Through evaluation of 135 block-storage traces, we show that MITHRIL is effective, giving an average of a 55% hit ratio increase over LRU and PROBABILITY GRAPH, a 36% hit ratio gain over AMP at reasonable ...
C-Miner [18] and QuickMine [23] employ this technique to discover block correlations in storage systems. However, precise data mining technique comes with high overhead. ...
arXiv:1705.07400v1
fatcat:zoj7chgrore7poh62jqfeqlrwu
Using machine learning techniques to enhance the performance of an automatic backup and recovery system
2010
Proceedings of the 3rd Annual Haifa Experimental Systems Conference on - SYSTOR '10
A typical disaster recovery system will have mirrored storage at a site that is geographically separate from the main operational site. ...
In both cases we significantly improve the performance of the system. ...
Basic algorithms that are easy to implement, and novel algorithms which are based on C-miner [8] -an algorithm that mines frequent block sequences. ...
doi:10.1145/1815695.1815697
dblp:conf/systor/PellegRR10
fatcat:gzoky7osufcldgijmpbvsqehbe
FARMER
2008
Proceedings of the 17th international symposium on High performance distributed computing - HPDC '08
, especially in large-scale distributed storage systems. ...
File correlations have become an increasingly important consideration for performance enhancement in peta-scale storage systems. ...
Acknowledgments This work is supported in part by the National Basic Research Program of China (973 Program) under Grant No. 2004CB318201, and the US NSF under Grant No. CCF-0621526. ...
doi:10.1145/1383422.1383445
dblp:conf/hpdc/XiaFJTW08
fatcat:km3kjoxzyjavzn6le4ovvwmkhm
CTDGM: A Data Grouping Model Based on Cache Transaction for Unstructured Data Storage Systems
[article]
2020
arXiv
pre-print
degradation in the performance of data access. ...
Cache prefetching technology has become the mainstream data access optimization strategy in the data centers. ...
Therefore, block layer correlations are common semantic patterns in storage systems [17] . ...
arXiv:2009.14414v1
fatcat:sd3xp36qprb6ra53jzkidfmv5y
Modeling the aging process of flash storage by leveraging semantic I/O
2014
Future generations computer systems
The analysis results in this paper provide useful insights for understanding and designing effective flash memory based storage systems. ...
Many research efforts have been invested in employing flash memory to build high performance and large-scale storage systems for data-intensive applications. ...
C-Miner [27] employs a data mining technique called frequent sequence mining to discover block correlations in storage systems. The method is very useful to exploit complex block correlations. ...
doi:10.1016/j.future.2013.09.002
fatcat:l6lidnlbibettjv77ryr6zxdbm
Exploiting the performance gains of modern disk drives by enhancing data locality
2009
Information Sciences
Based on the characteristics and the observation that data access on disk drives is highly skewed, the frequently accessed data blocks and the correlated data blocks are clustered into objects and moved ...
Disk drives have experienced dramatic development since the first disk drive was announced in 1956. This paper investigates some important characteristics of modern disk drives. ...
In addition, I am grateful to Prof. Witold Pedrycz for giving me the opportunity to clarify my thoughts. ...
doi:10.1016/j.ins.2009.02.002
fatcat:3xuj2q3pynf7tnyhtft2spmts4
Hybrid mining
2019
Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing - SAC '19
Various other mining protocols have been suggested, including proof of stake, in which a miner's chance of adding the next block is proportional to her current balance. ...
Then, miners can choose to compete in solving these problems, in lieu of Hashcash, for adding a new block. ...
Hence, creating an incentive for them to solve real-world useful problems. (1.c) Miners should have the option to use Hashcash for adding a new block, e.g. in cases when they are not able to solve any ...
doi:10.1145/3297280.3297319
dblp:conf/sac/ChatterjeeGP19
fatcat:7ey3gc43qfgmzgzixtnwektvwa
Dynamic Process Migration Based on Block Access Patterns Occurring in Storage Servers
2016
ACM Transactions on Architecture and Code Optimization (TACO)
In order to reduce the time required for I/O data exchange and data transfer in the coupled systems or other applications, this paper proposes a dynamic process migration mechanism on the basis of block ...
We first introduce the scheme of block access counting diagram, to profile the process access pattern during a time period on the storage server. ...
Zhou et al. proposed a data mining approach called C-miner to explore block correlations in the file server on a local machine. ...
doi:10.1145/2899002
fatcat:kcltvgnik5fspfg6q5quht53au
Tombolo: Performance enhancements for cloud storage gateways
2016
2016 32nd Symposium on Mass Storage Systems and Technologies (MSST)
Object-based cloud storage has been widely adopted for their agility in deploying storage with a very low up-front cost. ...
We find that when equipped with the right techniques, cloud gateways can provide competitive performance and price compared to on-premise storage. ...
Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and may not reflect the views of NSF or other institutions. ...
doi:10.1109/msst.2016.7897076
dblp:conf/mss/YangSUKFZAA16
fatcat:avvx4affuzfv3dwuajxnabqlp4
Autograph
2009
ACM SIGOPS Operating Systems Review
We describe a system called Autograph that exploits this repeatability to discover files that belong to an application workflow. ...
Our approach examines traces of file accesses, finds repeated and correlated accesses, and infers which files likely belong to the same workflow. ...
C-Miner [15] is an algorithm that uses the CloSpan frequent sequence mining algorithm to discover correlations of access to data blocks in a storage system. ...
doi:10.1145/1496909.1496925
fatcat:ztdt2lq6b5emfhov5tke5eelqa
Monitoring Grassland Management Effects on Soil Organic Carbon—A Matter of Scale
2020
Agronomy
We measured changes in SOC storage over this 9-year experiment on loamy soil and investigated physicochemical soil parameters. ...
On 0.4-ha blocks, the continuous grassland increased SOC in the soil profile with highest gains in the first 30 cm, while losses were recorded under continuous cropping. ...
The contrasting SOC stock responses to fertilization may be related to strong variability of ecosystems' response to management and can be due to increased C mineralization rates following fresh organic ...
doi:10.3390/agronomy10122016
fatcat:c72nnenjkveediuxhtoxdu47aa
A self-tuning client-side metadata prefetching scheme for wide area network file systems
2021
Science China Information Sciences
Client-side metadata prefetching is commonly used in wide area network (WAN) file systems because it can effectively hide network latency. ...
It uses a directory-directed prefetching strategy to prefetch the related file metadata in the same directory, and a correlation-directed prefetching strategy to prefetch the related file metadata accessed ...
There are several sophisticated prefetching schemes for storage servers. C-Miner [7] uses frequent sequence matching on block I/O data in storage servers to benefit I/O optimization. Hsu et al. ...
doi:10.1007/s11432-019-2833-1
fatcat:h66h5astznfuxakkrtzzczhcua
« Previous
Showing results 1 — 15 out of 157 results