Filters








15,234 Hits in 11.2 sec

Characterizing Same Work Relationships in Large-Scale Digital Libraries [chapter]

Peter Organisciak, Summer Shetenhelm, Danielle Francisco Albuquerque Vasques, Krystyna Matusiak
2019 Lecture Notes in Computer Science  
These serve to contextualize the complexities of same-work alignment in digital libraries, ground future discussion around content similarity, and inform methods to better identify duplicates in large-scale  ...  They provide unique opportunities for resource discovery, but their scale and aggregated models lead to challenges presented by duplicates and variants.  ...  Large-scale DLs like the 16.7 million work HathiTrust Digital Library present great potential value in their scale and coverage but are also quite complex.  ... 
doi:10.1007/978-3-030-15742-5_40 fatcat:vjpg2x4iwza4pa33mmgsyjjn5i

Digital preservation at Big Data scales: proposing a step-change in preservation system architectures

David Maynard Gerrard, James Edward Mooney, Dave Thompson
2018 Library hi tech  
Proposing a step-change in preservation system architectures Structured Abstract Purpose To consider how Digital Preservation system architectures will support Business Analysis of large-scale collections  ...  Design / methodology / approach Architectural reviews of existing systems. Experimental surveys of large digital collections using existing Digital Preservation tools at Big Data scales.  ...  Acknowledgements We would like to thank the Polonsky Foundation for funding our research, and the other Polonsky Digital Preservation Fellows, Edith Halvarsson, Somaya Langley, Sarah Mason and Lee Pretlove  ... 
doi:10.1108/lht-06-2017-0122 fatcat:gimmullbd5hitcmq5blpxfkugq

Ink on Our Hands: Mapping the Integrated Digital Scholarship Ecosystem

Kimberly Silk
2016 Scholarly and Research Communication  
The Canadian Research Knowledge Network (CRKN) is a consortia of Canadian university libraries dedicated to expanding digital content for the academic research enterprise in Canada.  ...  The Integrated Digital Scholarship Ecosystem (IDSE) project addresses these themes by mapping activities in the Canadian digital scholarship landscape, with a view to understanding the complexity of the  ...  , " in Whistler, BC.  ... 
doi:10.22230/src.2016v7n2/3a249 fatcat:hdaaytyo5jdevhj6p2lra2v26q

Large-Scale Cover Song Detection in Digital Music Libraries Using Metadata, Lyrics and Audio Features [article]

Albin Andrew Correya, Romain Hennequin, Mickaël Arcos
2018 arXiv   pre-print
In this work, we investigate whether textual music information (such as metadata and lyrics) can be used along with audio for large-scale cover identification problem in a wide digital music library.  ...  systems in digital music libraries with metadata.  ...  The authors would also like to thank everyone in the Deezer R&D team for their valuable comments and helpful suggestions. This work was funded and supported by Deezer S.A, Paris, France.  ... 
arXiv:1808.10351v1 fatcat:awaw5wsgvzburobqcki47twc5q

A Big Data Modeling Methodology for Apache Cassandra

Artem Chebotko, Andrey Kashlev, Shiyong Lu
2015 2015 IEEE International Congress on Big Data  
With increasingly wider adoption of Cassandra for online transaction processing by hundreds of Web-scale companies, there is a growing need for a rigorous and practical data modeling approach that ensures  ...  modeling, iii) presents visual diagrams for Cassandra logical and physical data models, and iv) demonstrates a data modeling tool that automates the entire data modeling process.  ...  ACKNOWLEDGEMENTS Artem Chebotko would like to thank Anthony Piazza, Patrick McFadin, Jonathan Ellis, and Tim Berglund for their support at various stages of this effort.  ... 
doi:10.1109/bigdatacongress.2015.41 dblp:conf/bigdata/ChebotkoKL15 fatcat:6esta7p35zfnlpq5iqqvnm7gwe

Last Copies: What's at Risk?*

Lynn Silipigni Connaway, Edward T. O'Neill, Chandra Prabha
2006 College and Research Libraries  
This study proposes a conceptual model derived from the examination of materials held exclusively by Vanderbilt University Libraries. The libraries hold approximately 1.5 million items in WorldCat.  ...  Last copies are a class of unique library materials that are at risk and warrant consideration for long-term preservation.  ...  In most large-scale preservation and digitization projects, library materials are not selected on an item-by-item basis.  ... 
doi:10.5860/crl.67.4.370 fatcat:4eyxcvstpffwpnginhsi7fotru

The SCAM Approach to Copy Detection in Digital Libraries

Narayanan Shivakumar, Hector Garcia-Molina
1995 D-Lib Magazine  
Funding for this cooperative agreement is also provided by ARPA, NASA, and the industrial partners of the Stanford Digital Libraries Project. .  ...  Its content does not necessarily reflect the position or the policy of the Government, CNRI or other sponsoring parties, and no official endorsement should be inferred.  ...  Since the number of digital documents is increasing at a fast rate, an important area of research is how to make copy detection mechanisms scale to such large number of articles without losing accuracy  ... 
doi:10.1045/november95-shivakumar fatcat:kd2ynerasndypcglmbuhnbydjm

Sketch-Based Image Queries in Topographic Databases

Peggy Agouris, Anthony Stefanidis, James D Carswell
1999 Journal of Visual Communication and Image Representation  
In this paper we present the development of a system prototype for sketch-based queries for the content-based retrieval of digital images from topographic databases.  ...  The query tools devised in this research are employing user-provided sketches of the shape and spatial configuration of the object(s) which should appear in the images to be retrieved.  ...  In particular, digital photogrammetric applications have become more robust, moving from the experimental use of a few images to large scale projects which employ numerous images.  ... 
doi:10.1006/jvci.1999.0414 fatcat:hyhiqzqq7fglncqvpd2bqkqkju

Rethinking Research Library Collections

Dan Hazen
2010 Library resources & technical services  
Analog materials remain both prevalent and indispensable as the digital explosion continues apace. Despite the persistence of print, large-scale digitization is transforming the library world.  ...  Prestige based on both size and scarcity may diminish as large-scale digitization weakens the once obvious benefits of local ownership.  ... 
doi:10.5860/lrts.54n2.115 fatcat:i2nexeqwhndtfmv5zzavq5zmja

WEON: towards a software ecosystem ONtology

Claudio Gutierrez, Romain Robbes
2013 Proceedings of the 2013 International Workshop on Ecosystem Architectures - WEA 2013  
The natural distributed character of software ecosystems calls for a shared conceptualization and language to describe their architecture and their evolution.  ...  In this paper: we argue in favor of such an approach by showing that there is succesful experience applying ontologies to the fields of software engineering and software architecture; show the issues arising  ...  The natural scaling implicit in the notion of software ecosystems calls for a shared conceptualization to describe the architectures of such systems, which naturally are composed of smaller and distributed  ... 
doi:10.1145/2501585.2501589 dblp:conf/sigsoft/GutierrezR13 fatcat:5qycpwghrfdkjeq7mn23yqfv6q

Improving Text Relationship Modeling with Artificial Data [article]

Peter Organisciak, Maggie Ryan
2020 arXiv   pre-print
We apply and evaluate a synthetic data approach to relationship classification in digital libraries, generating artificial books with relationships that are common in digital libraries but not easier inferred  ...  Data augmentation uses artificially-created examples to support supervised machine learning, adding robustness to the resulting models and helping to account for limited availability of labelled data.  ...  Emerging areas of study make use of the historical breadth and depth of large bibliographic collections to learn from digital library content at scale.  ... 
arXiv:2010.14640v1 fatcat:sti52eutxnbwtoyncuonnydbxe

Toward Virtual Community Knowledge Evolution

Michael Bieber, Douglas Engelbart, Richard Furuta, Starr Roxanne Hiltz, John Noll, Jennifer Preece, Edward A. Stohr, Murray Turoff, Bartel Van De Walle
2002 Journal of Management Information Systems  
, advanced hypermedia features, and conceptual knowledge structures.  ...  members would actively use in everyday tasks and regularly update.  ...  at the New Jersey Institute of Technology (NJIT), the New Jersey Department of Transportation, and the New Jersey Commission of Science and Technology.  ... 
doi:10.1080/07421222.2002.11045707 fatcat:dlf55k2qkja3laam2xzngqaxdm

2dF grows up: Echidna for the AAT

Andrew McGrath, Sam Barden, Stan Miziarski, William Rambold, Greg Smith, Ian S. McLean, Mark M. Casali
2008 Ground-based and Airborne Instrumentation for Astronomy II  
Casali, Downloaded from SPIE Digital Library on 13 Oct 2010 to 129.94.163.103.  ...  Terms of Use: http://spiedl.org/terms Proc. of SPIE Vol. 7014 70144K-3 Downloaded from SPIE Digital Library on 13 Oct 2010 to 129.94.163.103. Terms of Use: http://spiedl.org/terms  ...  Large scale spectroscopic surveys are now recognized as very important for ongoing work, and as a result surveys are presently in progress (e.g.  ... 
doi:10.1117/12.788605 fatcat:pvac2i3awbcdlozc5s4bs5kane

Archival quality and long-term preservation: a research framework for validating the usefulness of digital surrogates

Paul Conway
2011 Archival Science  
It outlines a research project that is designed to develop and test measures of quality for digital content preserved in HathiTrust, a large-scale preservation repository.  ...  Increasingly, stakeholders are creating large-scale digital repositories to ingest surrogates of archival resources or digitized books whose intellectual value as surrogates may exceed that of the original  ...  at a large scale.  ... 
doi:10.1007/s10502-011-9155-0 fatcat:236mrshj4vb7znnvayom5bqedu

Reflections on Collective Collections

Brian Lavoie, Lorcan Dempsey, Constance Malpas
2020 College and Research Libraries  
print, digitization, and group-scale discovery and fulfillment. .  ...  Constructing, understanding, and operationalizing collective collections is an increasingly important aspect of collection management for many libraries.  ...  In a similar way, scale drives the global diversity of the collection.  ... 
doi:10.5860/crl.81.6.981 fatcat:xptx6kwjwrfw5ocila6qahqpcu
« Previous Showing results 1 — 15 out of 15,234 results