A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Filters
On Benchmarking of Invoice Analysis Systems
[chapter]
2006
Lecture Notes in Computer Science
The state of the art of benchmarking of document analysis systems is presented, based on the processing levels: Document Page Segmentation, Text Recognition, Document Classification, and Information Extraction ...
An approach is presented to guide the benchmarking of invoice analysis systems, a specific, applied subclass of document analysis systems. ...
Document Page Segmentation Two main classes of approaches for the benchmarking of document page segmentation can be distinguished in the literature: bitmap based approaches that operate at pixel level ...
doi:10.1007/11669487_28
fatcat:vollb74orzewxhu5ty3cfvqalq
Document image analysis: What is missing?
[chapter]
1995
Lecture Notes in Computer Science
The realization that accurate document image analysis requires fairly specific pre-stored information has resulted in the investigation of new data structures for knowledge bases and for the representation ...
Driven partly by document distribution on CD-ROM and via the World Wide Web, there is more interest in the preservation of layout and format attributes to increase legibility (sometimes called "page reconstruction ...
This work has been conducted in the New York State Center for Advanced Technology (CAT) in Manufacturing, Automation and Robotics, which is partially funded by a block grant from the New York. ...
doi:10.1007/3-540-60298-4_317
fatcat:nqgzemstcfhnpitdrsecr6a4du
Bridging the paper and electronic worlds
1993
Proceedings of the SIGCHI conference on Human factors in computing systems - CHI '93
Despite these advantages, paper remains a second class citizen in the electronic world. In this paper, we present a new technology for bridging the paper and the electronic worlds. ...
In the new technology, the user interface moves beyond the workstation and onto paper itself. We describe paper user interface technology and its implementation in a particular system called XAX. ...
Characters and fonta that can be recognized in an clean 400 spot-per-inch image become unrecognizable in a skewed and stretched 200x1CQ spot-per-inch faxbased image. ...
doi:10.1145/169059.169445
dblp:conf/chi/JohnsonJKRC93
fatcat:ubjnlsykejaknhjz5oauy765vq
Development of an Automated Roughness Estimation Algorithm for Sidescan Imagery
2007
OCEANS 2007 - Europe
In repeated trials, results of the automated roughness algorithm correlated well (as high as 87%) with manual roughness estimations. ...
Features such as pockmarks, sand ripples, and rocks on the seafloor are visible in SSI as bright spots ("brights") with adjacent shadows. ...
In other words, a single shadow threshold value (i min ) suffices for more values of x than a single bright threshold value (i max ). ...
doi:10.1109/oceanse.2007.4302388
fatcat:7ot5xgp5tjclzjhw7ts2s5egti
A Study on Optical Character Recognition Techniques
2017
International Journal of Computational Science Information Technology and Control Engineering
In general the alphabets are in the variety of pixel pictures and it could be either handwritten or stamped, of any series, shape or direction etc. ...
In our survey we study on the various OCR techniques. In this paper we resolve and examine the hypothetical and numerical models of Optical Character Identification. ...
in a bitmap format. ...
doi:10.5121/ijcsitce.2017.4101
fatcat:tvyygemmeng7hl7i34w5w34xnm
An old greek handwritten OCR system based on an efficient segmentation-free approach
2007
International Journal on Document Analysis and Recognition
which are written in lowercase letters and originate from St. ...
In this paper, we focus on the problem of recognizing Old Greek manuscripts and propose a novel recognition technique that has been tested in a large number of important historical manuscript collections ...
In the case of historical documents, Manmatha and Croft [23] presented a method for word spotting wherein matching was based on the comparison of entire words rather than individual characters. ...
doi:10.1007/s10032-006-0031-z
fatcat:vobtuts65bffvjn4rnm66fka5a
Ottoman archives explorer
2009
ACM Journal on Computing and Cultural Heritage
The prototype system provides CBR of document images using both example-based queries and a virtual keyboard to construct query words. ...
In particular, sliding-window and histogram segmentation methods are coupled with recognition approaches using spatial features, neural networks, and a graph-based model. ...
In Ataer and Duygulu [2006], a retrieval system for Ottoman documents is proposed that involves first segmenting lines and words in a document, and then comparing whole words while querying. ...
doi:10.1145/1658346.1658348
fatcat:kihwabab25b4lhajjoj2bwqqcq
Graphical information resources
1985
Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '85
Thexe is no an-o,~og of the simple ke~wor~ systems that hndle ,extual dvcuments in any subject area. ...
Again, there are a variety of styles of data representation used in printing documents. ...
This turns, for example "sum from 0 to inf x sub i" into with the items sum, from, to and in/ being reserved words. ...
doi:10.1145/253495.253504
dblp:conf/sigir/Lesk85
fatcat:dnlogk2mqfhvzftisikciosa3y
The University Licensing program (TULIP):
1994
Serials: The Journal for the Serials Community
document delivery systems, P o s W p t or SGML database files or network development. ...
Each university receives, without charge during the project, the electronic full-text (bitmapped and ASCII) for those journals to which it subscribes in paper. ...
doi:10.1629/0702169
fatcat:kcirdvty5za3bjwmb7n7vmjcea
Given a document patch image, BWC utilizes the layout, i.e. relative locations, of word boxes in order to determine the original file, page, and the location on the page. ...
Instead, we propose a document recognition algorithm that automatically determines the location of a patch of text in a large collection of document images given a small document image. ...
For documents, a specially designed printer driver saves high resolution bitmap files for each printed page. ...
doi:10.1145/1459359.1459413
dblp:conf/mm/ErolAH08
fatcat:ytghlfxxufgd3hnmyerbv3kswm
On the applications of multimedia processing to communications
1998
Proceedings of the IEEE
The packet network is primarily accessed by client programs in PC's, and so it is inherently PC oriented and client/server driven. ...
Today, the Internet (the largest of the existing packet networks) connects more than 40 million computers in some 140 countries. ...
Howard (fax and image coding), A. Puri, J. Ostermann, T. Chen, and A. ...
doi:10.1109/5.664272
fatcat:yhn4kb3mmbctbbfus7br26oox4
Word spotting via spatial point processes
Proceedings of 3rd IEEE International Conference on Image Processing
This paper presents a statistically based method for spotting target words in documents. ...
This is accomplished by extracting the coordinate pairs, i.e. pixel locations, where the binary bitmap values of the word are non-zero. ...
It would be useful if this process could be automated and work reliably on documents with di erent fonts, font sizes, and noise contamination, e.g. in faxed documents. ...
doi:10.1109/icip.1996.560733
dblp:conf/icip/ONeillHW96
fatcat:pz6e4ocxivhapimlockw4uvvaq
Developing and Publishing High-Quality Electronic Documents on CD-ROM
[article]
1997
Eurographics State of the Art Reports
These notes explore the opportunities and challenges in this kind of publishing and help the reader evaluate the different kinds of approaches that may be taken in this area. ...
Electronic publishing is the enterprise of producing content in electronic form that can be delivered by electronic media. ...
search on the document or on a set of documents. • Play a movie that is in-line in the document. • Execute a link on the Web to an online document. • Use indexes to speed up word search (if the user is ...
doi:10.2312/egt.19971007
fatcat:m36r4jkb75hetgpx3q5ggv3rva
166 Attentional capture and oculomotor control
1998
International Journal of Psychophysiology
Larger bitmap time intervals gave better results. ...
@cf.ac.uk) Can colour cues attract attention in a manner similar to those documented for luminance cues? ...
doi:10.1016/s0167-8760(98)90166-0
fatcat:7o5vmrk5zvbizd23bmpcbkrs7q
The Design of a Human Memory Prosthesis
1994
Computer journal
These guidelines define this new class of application, provide the basis for our continuing work in support of the problems of everyday office life, and offer a new challenge for computer systems research ...
The next morning I saw my "name switch" said "IN". • Off to fax room with letter to be faxed-forgot to take the fax number (again!). ...
So we're looking to see if we can find it. • I've just forgotten the word for "taking a responsibility". • Confusion about whether I had given Mary something (some documents). ...
doi:10.1093/comjnl/37.3.153
fatcat:zzebva5abzgg7jupyjzdmvilma
« Previous
Showing results 1 — 15 out of 87 results