Filters








11,928 Hits in 6.6 sec

Analyzing large scale genomic data on the cloud with Sparkhit

Liren Huang, Jan Krüger, Alexander Sczyrba, Inanc Birol
<span title="2017-12-15">2017</span> <i title="Oxford University Press (OUP)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/wmo54ba2jnemdingjj4fl3736a" style="color: black;">Bioinformatics</a> </i> &nbsp;
It runs 92-157 times faster than MetaSpark on metagenomic fragment recruitment and 18-32 times faster than Crossbow on data pre-processing.  ...  We analyzed 100 terabytes of data across four genomic projects in the cloud in 21 h, which includes the run times of cluster deployment and data downloading.  ...  All Amazon cloud benchmarks and applications are funded by an Amazon research grant. Conflict of Interest: none declared.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1093/bioinformatics/btx808">doi:10.1093/bioinformatics/btx808</a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pubmed/29253074">pmid:29253074</a> <a target="_blank" rel="external noopener" href="https://pubmed.ncbi.nlm.nih.gov/PMC5925781/">pmcid:PMC5925781</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/avbzcngwmzdjxib67thpdirc2u">fatcat:avbzcngwmzdjxib67thpdirc2u</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200210091056/http://europepmc.org/backend/ptpmcrender.fcgi?accid=PMC5925781&amp;blobtype=pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a7/75/a775d3e43f0455a02dee787416290e0f8304b134.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1093/bioinformatics/btx808"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> oup.com </button> </a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5925781" title="pubmed link"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> pubmed.gov </button> </a>

CSAM: Compressed SAM format

Rodrigo Cánovas, Alistair Moffat, Andrew Turpin
<span title="2016-08-18">2016</span> <i title="Oxford University Press (OUP)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/wmo54ba2jnemdingjj4fl3736a" style="color: black;">Bioinformatics</a> </i> &nbsp;
We thank Vadim Zalunin for helping with the CramTools usage; and Wei Shi and Jan Schrö der for sharing their knowledge of the area. Conflict of Interest: none declared.  ...  Funding This work was supported by the NICTA Victorian Research Laboratory, and funded by the Australian Government as represented by the Department of Broadband, Communications and the Digital Economy  ...  Compression and decompression times were taken as the mean of ten consecutive runs for each file, after an initial run to prime the cache memory. We also computed the SD of the 10 runs.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1093/bioinformatics/btw543">doi:10.1093/bioinformatics/btw543</a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pubmed/27540265">pmid:27540265</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/o7fuy22nmfephpvcl7dhqd6cb4">fatcat:o7fuy22nmfephpvcl7dhqd6cb4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190429005616/https://hal-lirmm.ccsd.cnrs.fr/lirmm-01951638/file/Canovas-bioinformatics-2016-btw543.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/c4/e5/c4e566024c51d61f7955f6d0fcb1a1365a36d0c4.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1093/bioinformatics/btw543"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> oup.com </button> </a>

Rose

Russell Sears, Mark Callaghan, Eric Brewer
<span title="2008-08-01">2008</span> <i title="VLDB Endowment"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p6rqwwpkkjbcldejepcehaalby" style="color: black;">Proceedings of the VLDB Endowment</a> </i> &nbsp;
It increases replication throughput by reducing sequential I/O, and enables efficient tree lookups by supporting small page sizes and doubling as an index of the values it stores.  ...  Rose avoids random I/O during replication and scans, leaving more I/O capacity for queries than existing systems, and providing scalable, real-time replication of seek-bound workloads.  ...  ACKNOWLEDGEMENTS We would like to thank Petros Maniatis, Tyson Condie, Jens Dittrich and the anonymous reviewers for their feedback. Portions of this work were performed at Intel Research, Berkeley.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/1453856.1453914">doi:10.14778/1453856.1453914</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/phivdc64ifd2ngui7ptn6r4v2m">fatcat:phivdc64ifd2ngui7ptn6r4v2m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20090127022446/http://www.vldb.org:80/pvldb/1/1453914.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/34/db/34db8de90d1c44b180862ca352d545bfa5013a3f.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/1453856.1453914"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Survey on Data Deduplication for Cloud Storage to Reduce Fragmentation

Reshma A., R. D.
<span title="2016-01-15">2016</span> <i title="Foundation of Computer Science"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/b637noqf3vhmhjevdfk3h5pdsu" style="color: black;">International Journal of Computer Applications</a> </i> &nbsp;
Cost and maintenance of Information backup storage system for major enterprises can be minimized by storing it on Cloud Storage.  ...  By giving each application differently and storing the associated information distinctly the overall disk usage can be enhanced to a great level.  ...  stored using checking its qualities against an index.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5120/ijca2016907942">doi:10.5120/ijca2016907942</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ii5fsud575evzm4xysfhdjoezu">fatcat:ii5fsud575evzm4xysfhdjoezu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180603050230/https://www.ijcaonline.org/research/volume134/number5/fegade-2016-ijca-907942.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/0d/e6/0de6b89f9017e43ec8bbbd788b3435c421ef3cdf.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5120/ijca2016907942"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Hash based disk imaging using AFF4

Michael Cohen, Bradley Schatz
<span title="">2010</span> <i title="Elsevier BV"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/mpetrfjxlffapitphr7jkntyou" style="color: black;">Digital Investigation. The International Journal of Digital Forensics and Incident Response</a> </i> &nbsp;
For larger bevies however, storing the index inline within the RDF information file is inefficient.  ...  For example when segmenting a file with a file extension such as .mp3 or .avi, it is extremely unlikely to be compressible and we can avoid spending time compressing it by dumping it as an uncompressed  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.diin.2010.05.015">doi:10.1016/j.diin.2010.05.015</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/femrout7knaz5hdsefolmwafxi">fatcat:femrout7knaz5hdsefolmwafxi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170810202507/http://www.dfrws.org/sites/default/files/session-files/paper-hash_based_disk_imaging_using_aff4.pdf?origin=publication_detail" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ca/41/ca410bf5e077242d763ffd339fe24f337d31d4fc.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1016/j.diin.2010.05.015"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> elsevier.com </button> </a>

A Study on Various Data De-duplication Systems

Rashmi Vikraman, Abirami S
<span title="2014-05-16">2014</span> <i title="Foundation of Computer Science"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/b637noqf3vhmhjevdfk3h5pdsu" style="color: black;">International Journal of Computer Applications</a> </i> &nbsp;
For doing so, it is the needed to implement a good backup and recovery plan.  ...  Data deduplication is one such solution that discovers and removes the redundancies among the data blocks.  ...  Else the unique chunk is stored in the disk and the new fingerprint is stored in the chunk index for further process.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5120/16334-5616">doi:10.5120/16334-5616</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/dfmh3ioir5a3lf4fadjkck4bgi">fatcat:dfmh3ioir5a3lf4fadjkck4bgi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180722214558/https://research.ijcaonline.org/volume94/number4/pxc3895616.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/07/18/0718dae5bd977f895d8abc058b6870530dd111fd.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5120/16334-5616"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Optimizing positional index structures for versioned document collections

JInru He, Torsten Suel
<span title="">2012</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/ibcfmixrofb3piydwg5wvir3t4" style="color: black;">Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval - SIGIR &#39;12</a> </i> &nbsp;
In this paper, we study index organization and compression techniques for such versioned full-text index structures.  ...  Thus, versioned document collections are usually stored using special differential (delta) compression techniques, and a number of researchers have recently studied how to exploit this redundancy to obtain  ...  We also thank the Internet Archive for providing access to the Ireland data set.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2348283.2348319">doi:10.1145/2348283.2348319</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigir/HeS12.html">dblp:conf/sigir/HeS12</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/yrfiloglrnfmvoclbxp45agep4">fatcat:yrfiloglrnfmvoclbxp45agep4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20140904205651/http://cse.poly.edu:80/suel/papers/frag.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f3/f6/f3f6ec2bdd26e66e057d39692bce607616187cdf.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2348283.2348319"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

BEETL-fastq: a searchable compressed archive for DNA reads

L. Janin, O. Schulz-Trieglaff, A. J. Cox
<span title="2014-06-20">2014</span> <i title="Oxford University Press (OUP)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/wmo54ba2jnemdingjj4fl3736a" style="color: black;">Bioinformatics</a> </i> &nbsp;
Motivation: FASTQ is a standard file format for DNA sequencing data, which stores both nucleotides and quality scores.  ...  Results: We show that 6.6 terabytes of human reads in FASTQ format can be transformed into 1.7 terabytes of indexed files, from where we can search for 1, 10, 100, 1000 and a million of 30-mers in 3, 8  ...  The read IDs and quality scores are each dealt with in the same way: compressed with razip and augmented with an index that-for every 1024th read-stores the offset in the file at which the data associated  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1093/bioinformatics/btu387">doi:10.1093/bioinformatics/btu387</a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pubmed/24950811">pmid:24950811</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/eol2df6h75faxlp6vxenk7b37e">fatcat:eol2df6h75faxlp6vxenk7b37e</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180729103413/https://watermark.silverchair.com/btu387.pdf?token=AQECAHi208BE49Ooan9kkhW_Ercy7Dm3ZL_9Cf3qfKAc485ysgAAAcEwggG9BgkqhkiG9w0BBwagggGuMIIBqgIBADCCAaMGCSqGSIb3DQEHATAeBglghkgBZQMEAS4wEQQM9RzwaGlReb0dUgQIAgEQgIIBdFkxToT7o0vqxiDuqu8VdMXAP7hQNVCOdHBiPmR63k-iTUVakJKrpY0GU0Nm77WBYiH0_W44RsMnQGcZBUDN_FixAGLfIPYHrETw2opBPzXSP1W_Jdv04yP2EUYxjvcEjVYwi2WurFNRnGLBYVhL_t4yRdrXduTRyAKntYyWFxBJxhzsxN6u5Bk6znj-RLIZMlu-q_XEgu5JKJG8t0oToB5jVSOOrRSlGnMqULVthF0iVLWyMuUqV0j6VdflQwwWMywigououLb8ZzCI5ulV98KcmbmwM4jchsU0sfelQ_DpuMX8NQU9-HxPQRknF8XZIs7hhDx4b9PTFlmveyGLrjsdiWa0kV6e-GUfVRqOH9gXXgtc96keEb8JKy7CF-eDrcfRiUmLMJueZULLQiOGlwCQ_ZXg-LAe41jptvj1Pbs7tg9d2pUno8Wgt3aFvHOqDfqEJdijUXDAR91z3rjLHa61ajVoWYc2G7Clqy0askXcSv3_Yg" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/8e/59/8e595c3626e87abc58cc555b3c851a05f7e2f110.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1093/bioinformatics/btu387"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> oup.com </button> </a>

SQL-on-Hadoop

Avrilia Floratou, Umar Farooq Minhas, Fatma Özcan
<span title="2014-08-01">2014</span> <i title="VLDB Endowment"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p6rqwwpkkjbcldejepcehaalby" style="color: black;">Proceedings of the VLDB Endowment</a> </i> &nbsp;
Both systems optimize their data ingestion via columnar storage, and promote different file formats: ORC and Parquet.  ...  Among many systems providing some SQL support over Hadoop, Hive is the first native Hadoop system that uses an underlying framework such as MapReduce or Tez to process SQL-like statements.  ...  In the reduce phase, it performs a global aggregation and stores speedup. An additional speedup of 1.2X is gained by compressing the ORC file using the snappy compression algorithm.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/2732977.2733002">doi:10.14778/2732977.2733002</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/7onmassrafh33dp3kpo4eod2jy">fatcat:7onmassrafh33dp3kpo4eod2jy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20150318161129/http://www.vldb.org/pvldb/vol7/p1295-floratou.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/cc/2e/cc2e2b0ed86f47e7ee7e78ab211522fbd1ffd534.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/2732977.2733002"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

BEETL-fastq: a searchable compressed archive for DNA reads [article]

Lilian Janin and Ole Schulz-Trieglaff and Anthony J. Cox
<span title="2014-06-17">2014</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Motivation: FASTQ is a standard file format for DNA sequencing data which stores both nucleotides and quality scores.  ...  Results: We show that 6.6 terabytes of human reads in FASTQ format can be transformed into 1.7 terabytes of indexed files, from where we can search for 1, 10, 100, 1000, a million of 30-mers in respectively  ...  The read IDs and quality scores are each dealt with in the same way: compressed with razip and augmented with an index that, for every 1024th read, stores the offset in the file at which the data associated  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1406.4376v1">arXiv:1406.4376v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/k2n33cm34be4jpfknal5ca7eum">fatcat:k2n33cm34be4jpfknal5ca7eum</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200923015242/https://arxiv.org/pdf/1406.4376v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4e/c3/4ec302dd479bebf99ccd6dcdcaaa3825aa13de7b.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1406.4376v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Optimizing I/O for Irregular Applications on Distributed-Memory Machines [chapter]

Jesús Carretero, Jaechun No, Alok Choudhary
<span title="">1999</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
The run-time library has been optimized by applying in-memory compression mechanisms to the collective I/O operations.  ...  and the associated sorting/merging steps.  ...  An index has been included into the library to store the parameters of each compressed chunk. Every time an application creates a data array, an index is associated with the array.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/3-540-49164-3_45">doi:10.1007/3-540-49164-3_45</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vg35rumawbfplf64frbkpinp3e">fatcat:vg35rumawbfplf64frbkpinp3e</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170706062031/http://users.eecs.northwestern.edu/%7Echoudhar/Publications/OptimizingIOForIrregularApplicationsDistributedMemoryMachines.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/8e/26/8e266dfc4b6ff363c9bbd7b137be29800332f908.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/3-540-49164-3_45"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Emerging Database Systems in Support of Scientific Data [chapter]

Per Svensson, Peter Boncz, Milena Ivanova, Martin Kersten, Niels Nes, Doron Rotem
<span title="2009-12-16">2009</span> <i title="Chapman and Hall/CRC"> Scientific Data Management </i> &nbsp;
This is followed by an example of using MonetDB for the SkyServer data, and the query processing improvements it offers.  ...  The topics discussed in this chapter include the evolution of storage structures from the 1970"s till now, data compression techniques, and query processing techniques for single-and multi-variable queries  ...  In most of these algorithms, scanning a compressed transposed file is done by using the sequential read interface which retains run-compressed data.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1201/9781420069815-c7">doi:10.1201/9781420069815-c7</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ft3mckhzr5agfhopo6awmhwk7e">fatcat:ft3mckhzr5agfhopo6awmhwk7e</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180720224719/https://ir.cwi.nl/pub/14982/14982B.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/9f/ca/9fca3b50d98e575224f758199adc86021986cfa2.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1201/9781420069815-c7"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

A Cloud Storage overlay to aggregate heterogeneous Cloud services

Guilherme Sperb Machado, Thomas Bocek, Michael Ammann, Burkhard Stiller
<span title="">2013</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2zaau7xitzdupllip6mf432p5m" style="color: black;">38th Annual IEEE Conference on Local Computer Networks</a> </i> &nbsp;
As opposed to P2P file sharing, where data and indices are stored on peers, PiCsMu uses Cloud storage systems for data storage, while maintaining a distributed index.  ...  The main contribution of this work is to show the feasibility to store arbitrary data in different Cloud services for private use and/or for file sharing.  ...  ACKNOWLEDGEMENTS This work was supported partially by the Smart-enIT and the FLAMINGO projects, funded by the EU FP7 Program under Contract No. FP7-2012-ICT-317846 and No.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/lcn.2013.6761296">doi:10.1109/lcn.2013.6761296</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/lcn/MachadoBAS13.html">dblp:conf/lcn/MachadoBAS13</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/isfwvhi37rblfh75ny3tw3fgky">fatcat:isfwvhi37rblfh75ny3tw3fgky</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190502114229/https://www.zora.uzh.ch/id/eprint/89238/1/Machado_et_al_Cloud_storage_overlay.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/05/dc/05dc2b7a87b3ac335a9388a2e43dfbd76a9074f1.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/lcn.2013.6761296"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

SAP HANA adoption of non-volatile memory

Mihnea Andrei, Surendra Vishnoi, Daniel Booss, Thomas Peh, Ivan Schreter, Werner Thesing, Mehul Wagle, Thomas Willhalm, Christian Lemke, Günter Radestock, Robert Schulze, Carsten Thiel (+4 others)
<span title="2017-08-01">2017</span> <i title="VLDB Endowment"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p6rqwwpkkjbcldejepcehaalby" style="color: black;">Proceedings of the VLDB Endowment</a> </i> &nbsp;
Non-Volatile RAM (NVRAM) is a novel class of hardware technology which is an interesting blend of two storage paradigms: byte-addressable DRAM and block-addressable storage (e.g. HDD/SSD).  ...  As we present our solutions for the NVRAM integration, we also give, as a basis, a detailed description of the relevant HANA internals.  ...  The data structures optimal for storing and processing compressed data are however not update-friendly. As an example, let us consider the dictionary.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/3137765.3137780">doi:10.14778/3137765.3137780</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/te3swz4up5ettpay6kott4xv5i">fatcat:te3swz4up5ettpay6kott4xv5i</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190223192746/http://pdfs.semanticscholar.org/552d/c0b5c667e170cb469b7e3550df0453b887e7.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/55/2d/552dc0b5c667e170cb469b7e3550df0453b887e7.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.14778/3137765.3137780"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

A Novel Way of Deduplication Approach for Cloud Backup Services Using Block Index Caching Technique
English

JYOTI MALHOTRA, PRIYA GHYARE
<span title="2014-07-20">2014</span> <i title="Ess &amp; Ess Research Publications"> International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering </i> &nbsp;
been stored unfortunately, it is impractical to keep such an index in RAM and a disk based index with one seek per incoming chunk is far too slow.  ...  Source Deduplication is useful in cloud backup that saves network bandwidth and reduces network space Deduplication is the process by breaking up an incoming stream into relatively large segments and deduplicating  ...  For this system use different chunking methods for whole file chunking as file contain compressed file and static and dynamic chunking on uncompressed files.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.15662/ijareeie.2014.0307040">doi:10.15662/ijareeie.2014.0307040</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6ozmpr6xhbbbpm4vllxwuapvvu">fatcat:6ozmpr6xhbbbpm4vllxwuapvvu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180605132812/http://www.ijareeie.com/upload/2014/july/16U_ANovel.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/36/4f/364f00b42b8b42b9516bde25ed9bfdbfe17efba5.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.15662/ijareeie.2014.0307040"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 11,928 results