Filters








6 Hits in 1.4 sec

CloudScan - A configuration-free invoice analysis system using recurrent neural networks [article]

Rasmus Berg Palm, Ole Winther, Florian Laws
2017 arXiv   pre-print
We present CloudScan; an invoice analysis system that requires zero configuration or upfront annotation.  ...  We describe a recurrent neural network model that can capture long range context and compare it to a baseline logistic regression model corresponding to the current CloudScan production system.  ...  This is the goal of CloudScan: to be a simple, configuration and maintenance free invoice analysis system that can convert documents from both previously seen and unseen templates with high levels of accuracy  ... 
arXiv:1708.07403v1 fatcat:3zxolqamfncb3m5qdb67pji5xq

CloudScan - A Configuration-Free Invoice Analysis System Using Recurrent Neural Networks

Rasmus Berg Palm, Ole Winther, Florian Laws
2017 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)  
We present CloudScan; an invoice analysis system that requires zero configuration or upfront annotation.  ...  We describe a recurrent neural network model that can capture long range context and compare it to a baseline logistic regression model corresponding to the current CloudScan production system.  ...  This is the goal of CloudScan: to be a simple, configuration and maintenance free invoice analysis system that can convert documents from both previously seen and unseen templates with high levels of accuracy  ... 
doi:10.1109/icdar.2017.74 dblp:conf/icdar/PalmWL17 fatcat:myxovi3lpnbqxnkatihx7jdybq

Context-based Information Classification on Hungarian Invoices

Gábor Szegedi, Diána Bajdikné Veres, Imre Lendák, Tomás Horváth
2020 Conference on Theory and Practice of Information Technologies  
Our goal here was to create a solution capable of finding information on scanned invoices without knowing the template of the invoice.  ...  First we feed the invoice image to a commercially available Optical Character Recognition (OCR) engine which returns the extracted texts with their bounding boxes.  ...  CloudScan -A configuration-free invoice analysis system using recurrent neural networks is another article from 2017 in which the authors have created a fully fledged endto-end pipeline for parsing invoices  ... 
dblp:conf/itat/SzegediVLH20 fatcat:j3vny5u2x5c5vdwvzlbyl74eq4

Efficient Automated Processing of the Unstructured Documents using Artificial Intelligence: A Systematic Literature Review and Future Directions

Dipali Baviskar, Swati Ahirrao, Vidyasagar Potdar, Ketan Kotecha
2021 IEEE Access  
Our SLR also reveals a need for a close association between the businesses and researchers to handle various challenges of the unstructured data analysis.  ...  Our SLR discovered that AI-based approaches have a strong potential to extract useful information from unstructured documents automatically.  ...  It is an invoice analysis system with a Graphical User Interface (GUI) with zero configuration and requires no upfront annotation.  ... 
doi:10.1109/access.2021.3072900 fatcat:lrbzlmo5gnczhadnrxd2aoqz4u

TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents [article]

Zhanzhan Cheng, Peng Zhang, Can Li, Qiao Liang, Yunlu Xu, Pengfei Li, Shiliang Pu, Yi Niu, Fei Wu
2022 arXiv   pre-print
This paper proposes a unified end-to-end information extraction framework from visually rich documents, where text reading and information extraction can reinforce each other via a well-designed multi-modal  ...  ., tickets and resumes) has become a hot and vital research topic due to its widespread commercial value.  ...  Inspired by this idea, [4] proposed CloudScan, an invoice analysis system, which used recurrent neural networks to extract entities of interest from VRDs instead of templates of invoice layout. [5]  ... 
arXiv:2207.06744v1 fatcat:lo3yowbpxzaqrhjuw6qntd6ggy

Cost-effective End-to-end Information Extraction for Semi-structured Document Images

Wonseok Hwang, Hyunji Lee, Jinyeong Yim, Geewook Kim, Minjoon Seo
2021 Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing   unpublished
Cloudscan - A configuration-free invoice Kristina Toutanova. 2018. BERT: pre-training analysis system using recurrent neural networks.  ...  Recurrent neural network Kaiser, and Illia Polosukhin. 2017. Attention grammars. In Proceedings of the 2016 Conference is all you need. In I. Guyon, U. V.  ... 
doi:10.18653/v1/2021.emnlp-main.271 fatcat:fz7cw2lthrfg7o7khaoagplf4i