4,688 Hits in 7.2 sec

Wrapping PDF Documents Exploiting Uncertain Knowledge [chapter]

S. Flesca, S. Garruzzo, E. Masciari, A. Tagarelli
2006 Lecture Notes in Computer Science  
In this paper we address the problem of wrapping PDF documents, which raises new challenges in the information extraction field.  ...  The proposal is based on a novel bottom-up wrapping approach to extract information tokens and integrate them into groups related according to the logical structure of a document.  ...  To date, the problem of wrapping PDF documents has not been studied at all, in spite of its applicability to a wide variety of scenarios.  ... 
doi:10.1007/11767138_13 fatcat:rwvjwrcbdrfand5zkuzxvtuq7y

Acoustic Study of a Neonatal Intensive Care Unit: Preliminary Results

Ganna Raboshchuk, Climent Nadeu, Blanca Muñoz Mahamud, Ana Riverola de Veciana, Santiago Navarro Hervas
2014 International Work-Conference on Bioinformatics and Biomedical Engineering  
Preliminary results of the acoustical analysis of sounds are presented along with the experiments on automatic detection of the presence of vocalizations.  ...  The acoustic environment of a typical neonatal intensive care unit is very rich and may contain a large number of different sounds, which reflect the activities taking place in it.  ...  Preliminary results on a vocalizations detection task are rather encouraging.  ... 
dblp:conf/iwbbio/RaboshchukNMVH14 fatcat:cthzjd5qczbvzeiz4pdycrkq6u

10. Convict labour in early colonial Northern Nigeria: a preliminary study [chapter]

Mohammed Bashir Salau
2015 From Dust to Digital: Ten Years of the Endangered Archives Programme  
The project EAP373: Documenting, conserving and archiving the Tai Ahom manuscripts of Assam ( a4d?  ...  1 Since 2007, this work on Ahom was funded first by the DoBeS Documentation of Endangered Languages project, financed by the Volkswagen Stiftung, based at the Max Planck Institute in Nijmegen, and later  ...  Markham, A History of the Abyssinian Expedition other preliminary activities continued in the open air, with a fallen tree trunk serving as the studio workbench.  ... 
doi:10.11647/obp.0052.10 fatcat:4uejoyxb25bwlcfaiuuw36yasq

A Preliminary Impact Study of CYGNSS Ocean Surface Wind Speeds on Numerical Simulations of Hurricanes Harvey and Irma (2017)

Zhiqiang Cui, Zhaoxia Pu, Vijay Tallapragada, Robert Atlas, Christopher S. Ruf
2019 Geophysical Research Letters  
This study demonstrates the influence of assimilating an early version of CYGNSS observations of OSWS on numerical simulations of two notable landfalling hurricanes, Harvey and Irma (2017).  ...  A research version of the NCEP operational Hurricane Weather Research and Forecasting model and the Grid-point Statistical Interpolation based hybrid ensemble-3-dimensional variational data assimilation  ...  The goal of this study is to demonstrate the impact of a preliminary version of CYGNSS-retrieved OSWS on numerical simulations of hurricanes.  ... 
doi:10.1029/2019gl082236 fatcat:dmlaul4fcneyxlgebwvnw5mrwa

Pattern-based segmentation of digital documents

Angelo Di Iorio
2008 ACM SIGWEB Newsletter  
This thesis proposes a new document model, according to which any document can be segmented in some independent components and transformed in a patternbased projection, that only uses a very small set  ...  IML is a general and extensible language, which basically adopts an XHTML syntax, able to capture a posteriori the only content of a digital document.  ...  Note that patterns do not allow text to appear wherever in a document, but only wrapped by a block container.  ... 
doi:10.1145/1350502.1350505 fatcat:jjczizjwjfbi7ovtznqrmanlsq

Using ART1 Neural Networks for Clustering Computer Forensics Documents

Georger Araújo, Célia Ralha
2012 The International Journal of Forensic Computer Science  
Furthermore, real world forensic experiments were carried out to validate the model using a two-fold approach with a quantitative and a qualitative analysis method.  ...  Classification methods should be an aid in the exploration of such corpora, but they do not help in the task of thematically grouping together documents.  ...  Some preliminary results of our approach to cluster digital forensic documents were presented in [40] .  ... 
doi:10.5769/j201201003 fatcat:62gwhkvbjnfepmlijvknb57y44

CED2AR: The Comprehensive Extensible Data Documentation and Access Repository

Carl Lagoze, Lars Vilhuber, Jeremy Williams, Benjamin Perry, William C. Block
2014 IEEE/ACM Joint Conference on Digital Libraries  
We describe the design, implementation, and deployment of the Comprehensive Extensible Data Documentation and Access Repository (CED 2 AR).  ...  This is a metadata repository system that allows researchers to search, browse, access, and cite confidential data and metadata through either a web-based user interface or programmatically through a search  ...  The project had not previously released DDI metadata, and used an internal custom metadata store to generate a PDF as the sole documentation.  ... 
doi:10.1109/jcdl.2014.6970178 dblp:conf/jcdl/LagozeVWPB14 fatcat:tpqyllstqnewvbvnp4etc46p5q

Food web changes documented in Lake Tahoe

2003 California Agriculture  
In addition, a preliminary study of eight State University Sacramento; D.  ...  Vol. 1, tistics/PDFs/2016Report.pdf Publ Adm Res Theor 22:1–29. United States: Case studies from Part 5.  ... 
doi:10.3733/ca.v057n04p103b fatcat:ka7kd5tnwnheni5yzdycyvhhua

Art and Documentation no.22

various authors
2020 Sztuka i Dokumentacja  
Some bits of the plaster fall on the wrapping paper.  ...  /journal_37.pdf. 12 Moeglin-Delcroix, Esthétique du livre d'artiste, 101.  ...  for the exhibition in Schwerin is a document from the Intellectual Benefits of Art Mail Art project announced in 1980.  ... 
doi:10.32020/artanddoc/22/2020/30 doaj:0cfeac1eebdb4b41a8b31d8510bcfcf1 fatcat:26s6xzya4bae7aacjwvc5uprcm

Lost in OCR-Translation: Pixel-based Text Reflow to the Rescue

Frode Eika Sandnes
2022 The15th International Conference on PErvasive Technologies Related to Assistive Environments  
The implementation provides low-vision users a practical alternative for simplified access to the content of archival documents with a different view than the state-of-the-art technologies.  ...  This paper discusses the capabilities of key stateof-the-art technologies and describes a browser-based document magnification implementation that reflow document contents at the pixel-level to prevent  ...  Microsoft has recently introduced a feature called PDF-reflow in a relatively recent version of Office. Users open a pdf document in Word.  ... 
doi:10.1145/3529190.3534734 fatcat:p2nj4gox2vhw3hw63xhcipfgke

An Algorithm for Transforming XPath Expressions According to Schema Evolution

Kazuma Hasegawa, Kosetsu Ikeda, Nobutaka Suzuki
2013 ACM Symposium on Document Engineering  
XML is a de-fact standard format on the Web. In general, schemas of XML documents are continuously updated according to changes in real world.  ...  We also show some preliminary experimental results.  ...  INTRODUCTION XML [5] is a de-fact standard format on the Web. An XML document is usually stored with its schema so that the structural consistency of the document is ensured.  ... 
dblp:conf/doceng/HasegawaIS13 fatcat:ctsdnp455bgftop7z4wdx4opqi

Librarians as Co-Teachers and Curators: Integrating Information Literacy in a Studio Art Course at a Liberal Arts College

Lijuan Xu, Nestor Gil
2017 Art Documentation  
as a contribution or response.  ...  The authors describe a faculty-librarian team-teaching approach to building information literacy in a studio class.  ...  a meat necklace around her neck and cradling a piece of beef wrapped in a blanket (Figure 3) .  ... 
doi:10.1086/691376 fatcat:7oab26khxzakjf47rmdwehulp4

Optical character recognition: an illustrated guide to the frontier

George Nagy, Thomas A. Nartker, Stephen V. Rice, Daniel P. Lopresti, Jiangying Zhou
1999 Document Recognition and Retrieval VII  
The analysis of a series of "snippets" from this perspective provides insight into the strengths and weaknesses of current systems, and perhaps a road map to future progress.  ...  We offer a perspective on the performance of current OCR systems by illustrating and explaining actual OCR errors made by three commercial devices.  ...  The study of invariant features that describe printed and hand-printed characters remains a topic of continuing interest.  ... 
doi:10.1117/12.373511 dblp:conf/drr/NagyNR00 fatcat:yzyw6gx5zzasvcjokaas4gkkli

Theory in practice. digital documentation in developing effective methods for preserving cultural heritage. The study case of Raksila wooden neighbourhood in Oulu, Finland

Sara Porzilli, Anna-Maija Ylimaula
2019 Zenodo  
The contribution describes these themes by presenting the research carried out in Raksila neighborhood, Oulu (Finland) by the team of the "Department of History of Architecture and Restoration Studies"  ...  This contribution wants to address the obvious need to rewrite some central restoration guidelines still prevailing, in spite of the recent technological advancements provided by new and radical documentation  ... .pdf. 16 INTRODUCTION Heritage, what we see today, is a micro-cosmic part of the earth; a visible  ... 
doi:10.5281/zenodo.3559606 fatcat:wzdeyj7qkbegni655cbmwjvzam

Bluima: a UIMA-based NLP Toolkit for Neuroscience

Renaud Richardet, Jean-Cédric Chappelier, Martin Telefont
2013 German Society for Computational Linguistics  
The second component is a common analysis structure (CAS) store based on Mon-goDB, to perform incremental annotation of large document corpora.  ...  This paper describes Bluima, a natural language processing (NLP) pipeline focusing on the extraction of neuroscientific content and based on the UIMA framework.  ...  A PDF reader was developed to provide robust and precise text extraction from scientific articles in PDF format.  ... 
dblp:conf/gldv/RichardetCT13 fatcat:6l46vlncefd7pg7nn4nkvk5njy
« Previous Showing results 1 — 15 out of 4,688 results