Advanced Information Systems for Archival Appraisals of Contemporary Documents

William McFadden, Kenton McHenry, Rob Kooper, Michal Ondrejcek, Alex Yahja, Peter Bajcsy
2008 2008 IEEE Fourth International Conference on eScience  
This work addresses the problem of designing a scalable framework for archival appraisals of contemporary PDF documents. The motivation for our work is to provide an e-Science solution that (a) fuses the independent research methodologies focusing on specific information types to one comprehensive analytical framework, (b) optimizes tradeoffs between computational requirements and preservation costs, and (b) bridges the small scale and large scale computational studies. The e-Science solution
more » ... esented here consists of (1) a methodology for comprehensive comparisons of contemporary documents containing text, images and vector graphics, (2) a framework for including 3D and 3D+time data sets into the appraisal analyses, (3) interfaces supporting exploratory archival appraisal analyses with small scale data sets, and (4) infrastructure supporting the transition from small scale to large scale computations using commodity and high performance computing resources. The novelty of our work is in designing methodologies, mathematical frameworks and prototypes for comprehensive and scalable document appraisals that include text, images, vector graphics , and high dimensional data.
doi:10.1109/escience.2008.140 dblp:conf/eScience/McFaddenMKOYB08 fatcat:fcttvlz6b5b3dkyq3o3vbded5u