Filters








17,202 Hits in 6.2 sec

A Novel Approach of Data Extraction from Indian Degraded Historical Documents using Gamma Variation and Histogram Balancing Method

Neelu Maheshwari, Anurag Maloo, Pankaj Singh Parihar
2015 International Journal of Engineering Research and  
Advanced Digital Image processing can help enhance the images of these manuscripts in order to empower recovery of the written content from these degraded documents.  ...  This enhanced image is send to a trained OCR engine for extracting Sanskrit data contents written on manuscripts.  ...  Proposed Solution In this paper a novel method is proposed for digitally enhancing and data extraction from timely degraded historical documents.  ... 
doi:10.17577/ijertv4is060470 fatcat:me4qoqopkzhkdbqy3oy7k5nkp4

Construction of an International Digital Sharing Platform of Dongba Manuscripts and Dongba Hieroglyphs

Xu Xiaoli, Li Dong, Jiang Zhanglei, Li Ning, Wu Guoxin, Wang Hongjun, Zhang Xu, Bai Feng
2019 Computer systems science and engineering  
This platform provides digital resources comprising Dongba manuscripts and related literature, tools for deciphering Dongba manuscripts, an environment for undertaking and sharing research, and dynamic  ...  This platform provides: a channel for resource sharing and academia exchange of a widely scattered collection of Dongba manuscripts among a number of researchers; a means of digitally preserving Dongba  ...  several research institutions were established, and new ways of achieving long-lasting digital protection and bequeathing of Dongba manuscripts and Dongba hieroglyphs were provided.  ... 
doi:10.32604/csse.2019.34.191 fatcat:dkf2p3vpuvhtpoxioll7zqlbw4

Automatic CNN-Based Arabic Numeral Spotting and Handwritten Digit Recognition by Using Deep Transfer Learning in Ottoman Population Registers

Yekta Said Can, M. Erdem Kabadayı
2020 Applied Sciences  
Recent developments in the digital humanities field and the need for extracting information from the historical documents have fastened the digitization processes.  ...  Historical manuscripts and archival documentation are handwritten texts which are the backbone sources for historical inquiry.  ...  After spotting the numerals, Arabic digits should be recognized for information retrieval from the historical manuscripts.  ... 
doi:10.3390/app10165430 fatcat:mxj2ep5cbjesrhwaga35mmcgzq

You Actually Look Twice At it (YALTAi): using an object detection approach instead of region segmentation within the Kraken engine [article]

Thibault Clérice
2022 arXiv   pre-print
The ability of identifying main body of text from marginal text or running titles makes the difference between extracting the work full text of a digitized book and noisy outputs.  ...  We propose to shift, for efficiency, the task from a pixel classification-based polygonization to an object detection using isothetic rectangles.  ...  I INTRODUCTION In recent years, automatic text extraction has become an important activity in digital philology and, in general, in corpus creation for historical documents.  ... 
arXiv:2207.11230v1 fatcat:ubc7ebgi5zdh3mxnsnmo6rfw24

Automatic segmentation of digitalized historical manuscripts

Costantino Grana, Daniele Borghesani, Rita Cucchiara
2010 Multimedia tools and applications  
The artistic content of historical manuscripts provides a lot of challenges in terms of automatic text extraction, picture segmentation and retrieval by similarity.  ...  In particular this work addresses the problem of automatic extraction of meaningful pictures, distinguishing them from handwritten text and floral and abstract decorations.  ...  The former stores the high resolution digitized manuscripts, while the latter contains both the automatically extracted knowledge and later in the future it will contain also the historical comments added  ... 
doi:10.1007/s11042-010-0561-8 fatcat:vmdy54b6wnbuhp465ebne5ghi4

About the authors

2004 Focus on Geography  
His primary focus is on extracting thematic information from remotely sensed imagery.  ...  FOCUS on Geography welcomes article ideas and manuscripts from geographers and those who write geographically.  ... 
doi:10.1111/j.1949-8535.2004.tb00050.x fatcat:u6gzop6qejeularl5i3xeynjnu

Submission guidelines

2004 Focus on Geography  
His primary focus is on extracting thematic information from remotely sensed imagery.  ...  FOCUS on Geography welcomes article ideas and manuscripts from geographers and those who write geographically.  ... 
doi:10.1111/j.1949-8535.2004.tb00051.x fatcat:sskipwiq7zgn3pr4jdksngq67i

ICDAR 2019 Competition on Image Retrieval for Historical Handwritten Documents [article]

Vincent Christlein and Anguelos Nicolaou and Mathias Seuret and Dominique Stutzmann and Andreas Maier
2019 arXiv   pre-print
This competition investigates the performance of large-scale retrieval of historical document images based on writing style.  ...  of (i) manuscript books, (ii) letters, (iii) charters and legal documents.  ...  The different configurations of oriented Basic Image Features (oBIFs) columns histograms [11] , [12] are extracted from historical document samples and concatenated for generating a feature vector.  ... 
arXiv:1912.03713v1 fatcat:7lhdkywotfay5mc36v43cjixha

A Deep Learning Approach for Recognizing the Cursive Tamil Characters in Palm Leaf Manuscripts

Gayathri Devi S, Subramaniyaswamy Vairavasundaram, Yuvaraja Teekaraman, Ramya Kuppusamy, Arun Radhakrishnan, Syed Ahmad Chan Bukhari
2022 Computational Intelligence and Neuroscience  
Because of the necessity for digitalization and transcription, recognizing the cursive characters found in palm leaf manuscripts remains an open problem.  ...  Finally, the extracted cursive characters are given as input to the CNN technique for final classification.  ...  step to obtain useful information from digital text images.  ... 
doi:10.1155/2022/3432330 pmid:35310599 pmcid:PMC8933122 fatcat:bfok3bdwnfdi5kxuwf3wibywbi

A new Connected Component Analysis based System for Text Segmentation in Degraded Historical Document Images

2020 VOLUME-8 ISSUE-10, AUGUST 2019, REGULAR ISSUE  
So, there is a need for text segmentation and feature extraction to convert these manuscripts into machine editable format.  ...  Historical documents contain valuable heritage information. These documents are preserved in the manuscript preservation center and archaeological departments.  ...  These documents cannot be accessed until it is converted into digital format, so these manuscripts are scanned and digitized to preserve these documents from degradation and parchment.  ... 
doi:10.35940/ijitee.f3503.049620 fatcat:kz5joulttnarzljot6adjn3p5e

The use of historical sources in a multi-layered methodology for karez research in Turpan, China

Sophie Barbaix, Alishir Kurban, Philippe De Maeyer, Xi Chen, Jean Bourgeois
2020 Water History  
These problems make it harder to read the landscape, which is why we have to start extracting our data from maps, reports, photographs, and satellite imagery.  ...  In this article, we will present an overview of possible research methods to handle historical sources, in the specific case of karez landscapes.  ...  Most of all, many of the pictures were selected by the researchers to show to the public, once returned from the fieldwork, from a larger set.  ... 
doi:10.1007/s12685-020-00259-z pmid:33224321 pmcid:PMC7672419 fatcat:uiytrsrubfdsbignb3mwyvy2ie

ANALYZING DIFFERENT ALGORITHMS AND TECHNIQUES TO FIND OPTICAL CHARACTER RECOGNITION FOR TAMIL SCRIPTS

Rajkumar N
2020 JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES  
A system that does not include obtaining either Standard size and shape or the color difference between background and foreground to recognize Palm Leaf Manuscript and stone inscriptions and obtaining  ...  This paper focuses mainly in particular on OCR for the digitalization and conservation of texts and inscriptions in the Tamil language.  ...  The research community was interested in the extract of text from document images via a tenner, but very little research was done in the digitalization of inscription images of historical monuments.  ... 
doi:10.26782/jmcms.2020.02.00029 fatcat:t3szo4mhavcvjluba6mhysdre4

A study of Japanese landscapes using structure from motion derived DSMs and DEMs based on historical aerial photographs: New opportunities for vegetation monitoring and diachronic geomorphology

Christopher Gomez, Yuichi Hayakawa, Hiroyuki Obanawa
2015 Geomorphology  
Acknowledgment The authors are in debt to two anonymous reviewers and Lucian Dragut for the numerous feedbacks they provided and which greatly improved the original manuscript.  ...  A C C E P T E D M A N U S C R I P T ACCEPTED MANUSCRIPT Firstly, the suitability of historical aerial photographs for extracting topographic and vegetation data was investigated using the two first datasets  ...  the DSM and extract DEM (digital elevation model) data semiautomatically, and (3) perform measures from the 3D imagery including diachronic geomorphologic analysis.  ... 
doi:10.1016/j.geomorph.2015.02.021 fatcat:3umfightvjawrfugvsudt2yx2a

"Inside the bible"

Costantino Grana, Daniele Borghesani, Simone Calderara, Rita Cucchiara
2008 Proceeding of the 1st ACM international conference on Multimedia information retrieval - MIR '08  
In this paper we present a system for automatic segmentation, annotation and image retrieval based on content, focused on illuminated manuscripts and in particular the Borso D'Este Holy Bible.  ...  standard keyword-based retrieval approach in a commentary with a modern visual-based retrieval by appearance similarity: an entire software user interface for exploration and visual search of illuminated manuscripts  ...  In particular, the high resolution digitalized replicas of the Bible's pages constitutes the image database, while the annotation database contains both the automatically extracted knowledge and the historical  ... 
doi:10.1145/1460096.1460158 dblp:conf/mir/GranaBCC08 fatcat:4suehiqf35hxnpfpt6bcm4h32q

Towards a Digital Infrastructure for Illustrated Handwritten Archives [chapter]

Andreas Weber, Mahya Ameryan, Katherine Wolstencroft, Lise Stork, Maarten Heerlien, Lambert Schomaker
2018 Lecture Notes in Computer Science  
Large and important parts of cultural heritage are stored in archives that are difficult to access, even after digitization.  ...  Documents and notes are written in hard-to-read historical handwriting and are often interspersed with illustrations.  ...  , classification and linking of 11 knowledge from historical manuscript collections.  ... 
doi:10.1007/978-3-319-75826-8_13 fatcat:ytej62msh5b4xfmgyuh4srnq3a
« Previous Showing results 1 — 15 out of 17,202 results