Recurrent Neural Network Approach for Table Field Extraction in Business Documents

Clement Sage, Alexandre Aussem, Haytham Elghazel, Veronique Eglin, Jeremy Espinas
2019 2019 International Conference on Document Analysis and Recognition (ICDAR)  
Efficiently extracting information from documents issued by their partners is crucial for companies that face huge daily document flows. Particularly, tables contain most valuable information of business documents. However, their contents are challenging to automatically parse as tables from industrial contexts may have complex and ambiguous physical structure. Bypassing their structure recognition, we propose a generic method for end-to-end table field extraction that starts with the sequence
more » ... f document tokens segmented by an OCR engine and directly tags each token with one of the possible field types. Similar to the state-of-the-art methods for non-tabular field extraction, our approach resorts to a token level recurrent neural network combining spatial and textual features. We empirically assess the effectiveness of recurrent connections for our task by comparing our method with a baseline feedforward network having local context knowledge added to its inputs. We train and evaluate both approaches on a dataset of 28,570 purchase orders to retrieve the ID numbers and quantities of the ordered products. Our method outperforms the baseline with micro F1 score on unknown document layouts of 0.821 compared to 0.764.
doi:10.1109/icdar.2019.00211 dblp:conf/icdar/SageAEEE19 fatcat:47u4bzvomzdhhhppyga2tbx3n4