Filters








57,261 Hits in 8.6 sec

Extraction and visualization of numerical and named entity information from a large number of documents

Masaki Murata, Qing Ma, Kentaro Torisawa, Masakazu Iwatate, Tamotsu Shirado, Koji Ichii, Toshiyuki Kanamaru
2008 2008 International Conference on Natural Language Processing and Knowledge Engineering  
We have developed a system that can semiautomatically extract numerical and named entity sets from a large number of Japanese documents and can create various kinds of tables and graphs.  ...  From this perspective, we concluded that our system is useful and convenient for extracting information from a large number of documents. We have constructed a demonstration system.  ...  Shinyama et al. extracted related NE information from a large number of documents [15] , but they did not extract numerical information or graphs.  ... 
doi:10.1109/nlpke.2008.4906795 dblp:conf/nlpke/MurataMTISIK08 fatcat:mcevktdgsreulaugbnouj57b6e

Extraction and Representation of Financial Entities from Text [chapter]

Tim Repke, Ralf Krestel
2021 Data Science for Economics and Finance  
Suitable visualization techniques can overcome this requirement and enable users to explore large sets of documents.  ...  This chapter provides an overview of corpora commonly used in research and highlights related work and state-of-the-art approaches to extract and represent financial entities and relations.The second part  ...  ANNIE, A Nearly-New Information Extraction system, is the component for named entity extraction implementing a more traditional recognition model [15] .  ... 
doi:10.1007/978-3-030-66891-4_11 fatcat:ssd2taqbezg3bp36w2lbo5te4u

MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction [article]

Guozhi Tang, Lele Xie, Lianwen Jin, Jiapeng Wang, Jingdong Chen, Zhen Xu, Qianying Wang, Yaqiang Wu, Hui Li
2021 arXiv   pre-print
Visual Information Extraction (VIE) task aims to extract key information from multifarious document images (e.g., invoices and purchase receipts).  ...  Notably, to the best of our knowledge, MatchVIE may be the first attempt to tackle the VIE task by modeling the relevancy between keys and values and it is a good complement to the existing methods.  ...  Introduction The Visual Information Extraction (VIE) aims to extract key information from document images (invoices, purchase receipts, ID cards, and so on), instead of plain texts.  ... 
arXiv:2106.12940v1 fatcat:zz33x5ah3bblfff7ijnilvowyu

Entity timelines

Arturas Mazeika, Tomasz Tylenda, Gerhard Weikum
2011 Proceedings of the 20th ACM international conference on Information and knowledge management - CIKM '11  
Analytics of the evolution of the entities poses many challenges including extraction, disambiguation, and canonization of entities from large text collections as well as introduction of specific analysis  ...  To this end, we have extracted, disambiguated, canonicalized, and connected named entities with the YAGO ontology. To analyze the evolution we have developed a visual analytics system.  ...  Hoffart, A. Anand, and M. Spaniol for valuable discussions and comments.  ... 
doi:10.1145/2063576.2064026 dblp:conf/cikm/MazeikaTW11 fatcat:4w6hredj25dcvfpwmbbgfvucri

Applying a text mining framework to the extraction of numerical parameters from scientific literature in the biotechnology domain

André SANTOS, Regina NOGUEIRA, Anália LOURENÇO
2012 Advances in Distributed Computing and Artificial Intelligence Journal  
In this context, we have developed a data curation workflow, based on text mining techniques, to extract numerical parameters from scientific literature, and applied it to the biotechnology domain.  ...  This work describes the implementation of the workflow, identifies achievements and current limitations in the overall process, and presents the results obtained for a corpus of 50 full-text documents.  ...  Exporting annotated documents After annotating the documents and extracting the desired information, the results must be exported with two main goals: • Visualization: Graphical and intuitive visualization  ... 
doi:10.14201/adcaij20121118 doaj:2c84ee5fac5d4ceab9fcc85f6d01294f fatcat:xhgtp357inhhxgo6ldocfvdyd4

Text analysis and entity extraction in asymmetric threat response and prediction

Erwin Chan, Jason Ginsburg, Brian Ten Eyck, Jerzy Rozenblit, Mike Dameron
2010 2010 IEEE International Conference on Intelligence and Security Informatics  
ATRAP consists of a set of tools for annotating and automatically extracting entities and relationships from documents, visualizing this information in relational, geographic, and temporal dimensions,  ...  Subsequently, we describe linguistic characteristics of intelligence reports, and describe ATRAP's named entity recognition system.  ...  ACKNOWLEDGMENTS We are grateful to Ephibian, Inc. for their software engineering assistance, and to Neil Garra of the S2 Company for his subject matter expertise.  ... 
doi:10.1109/isi.2010.5484737 dblp:conf/isi/ChanGERD10 fatcat:ss7p5k7kezarhbivhmifdcdfd4

GeoCAM: A geovisual analytics workspace to contextualize and interpret statements about movement

Anuj Jaiswal, Scott Pezanowski, Prasenjit Mitra, Xiao Zhang, Sen Xu, Ian Turton, Alexander Klippel, Alan M. MacEachren
2011 Journal of Spatial Information Science  
This article focuses on integrating computational and visual methods in a system that supports analysts to identify, extract, map, and relate linguistic accounts of movement.  ...  We have built a set of geo-enabled, computational methods to identify documents containing movement statements, and a visual analytics environment that uses natural language processing methods iteratively  ...  The views, opinions, and conclusions contained in this document are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed  ... 
doi:10.5311/josis.2011.3.55 fatcat:hagcq4tcdrb6xpsttwsi42u7wq

Information Extraction from Visually Rich Documents with Font Style Embeddings [article]

Ismail Oussaid, William Vanhuffel, Pirashanth Ratnamogan, Mhamed Hajaiej, Alexis Mathey, Thomas Gilles
2021 arXiv   pre-print
Information extraction (IE) from documents is an intensive area of research with a large set of industrial applications.  ...  We propose to challenge the usage of computer vision in the case where both token style and visual representation are available (i.e native PDF documents).  ...  However, information extraction from native PDF documents represents a large and growing number of real-world use cases.  ... 
arXiv:2111.04045v1 fatcat:i375efp5one6djax4fitvsxbzu

ChemEx: information extraction system for chemical data curation

Atima Tharatipyakul, Somrak Numnark, Duangdao Wichadakul, Supawadee Ingsriswang
2012 BMC Bioinformatics  
ChemEx facilitates and speeds up chemical data curation by extracting compounds, organisms, and assays from a large collection of publications.  ...  Text annotator is able to extract compound, organism, and assay entities from text content while structure image recognition enables translation of chemical raster images to machine readable format.  ...  Acknowledgements This work was supported by National Center for Genetic Engineering and Biotechnology (BIOTEC).  ... 
doi:10.1186/1471-2105-13-s17-s9 pmid:23282330 pmcid:PMC3521388 fatcat:quqcssngxvcr3f4w4z4d3mwwkm

Multidimensional Visualization and Navigation in Search Results [chapter]

Will Archer Arentz, Aleksander Øhrn
2004 Lecture Notes in Computer Science  
By employing advanced linguistic and entity extraction techniques, the scheme can also be applied in situations where the original documents consist of unstructured text.  ...  Search engines traditionally index unstructured text and return ranked lists of documents that match a given query.  ...  By applying automatic entity extraction, elements like date, to-and from-addresses, topics, company names, personal names and more, can be extracted.  ... 
doi:10.1007/978-3-540-30132-5_86 fatcat:o4zetf3dgvemfn7qquxqrgicma

A Collaborative Framework for Discovering the Organizational Structure of Social Networks Using NER Based on NLP
NLP기반 NER을 이용해 소셜 네트워크의 조직 구조 탐색을 위한 협력 프레임 워크

Frank I. Elijorde, Hyun-Ho Yang, Jae-Wan Lee
2012 Journal of Internet Computing and services  
This paper combined a number of natural language processing methods such as NER (named entity recognition), sentence extraction, and part of speech tagging to carry out text analysis.  ...  ABSTRACT Many methods had been developed to improve the accuracy of extracting information from a vast amount of data.  ...  This can be useful in extracting information from unstructured documents in which unknown entities can be revealed and analyzed.  ... 
doi:10.7472/jksii.2012.13.2.99 fatcat:ivyxq6g3sjcmtdntrl55ggvdem

Geo-Quantities: A Framework for Automatic Extraction of Measurements and Spatial Context from Scientific Documents

Thorge Petersen, Muhammad Asif Suryani, Christian Beth, Hardik Patel, Klaus Wallmann, Matthias Renz
2021 17th International Symposium on Spatial and Temporal Databases  
In this paper we will introduce a system Geo-Quantities that supports the automatic extraction of quantitative, spatial and temporal information of a given measurement entity from scientific literature  ...  Quantitative information derived from scientific documents provides an important source of data for studies in almost all domains, however, manual extraction of this information is very time consuming.  ...  However, the MAR data were never compiled in a database but reported in a very large number of individual publications.  ... 
doi:10.1145/3469830.3470911 fatcat:yajwobh6inam3gmljj6xkm4nwu

A SUMMARIZATION ON TEXT MINING TECHNIQUES FOR INFORMATION EXTRACTING FROM APPLICATIONS AND ISSUES

G. Ravi Kumar
2020 JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES  
Text mining is an extract from a huge number of text documents for interesting and nontrivial trends.  ...  The discovery of relevant patterns and trends for analyzing text documents from a huge volume of information is a major issue.  ...  A. Information Extraction Information Extraction (IE) is a method that generates useful information from a large amount of text.  ... 
doi:10.26782/jmcms.spl.5/2020.01.00026 fatcat:jbesfbnafvhcfa3suzghossqhi

Jigsaw: Supporting Investigative Analysis through Interactive Visualization

John Stasko, Carsten Görg, Zhicheng Liu
2008 Information Visualization  
As the number of documents and the corresponding number of concepts and entities within the documents grow larger, sense-making processes become more and more difficult for the analysts.  ...  Jigsaw provides multiple coordinated views of document entities with a special emphasis on visually illustrating connections between entities across the different documents.  ...  Department of Homeland Security Program, under the auspices of the Southeast Regional Visualization and Analytics Center.  ... 
doi:10.1057/palgrave.ivs.9500180 fatcat:4uvpglqgirfq5eerjowtzj7y5u

Jigsaw: Supporting Investigative Analysis through Interactive Visualization

John Stasko, Carsten Gorg, Zhicheng Liu, Kanupriya Singhal
2007 2007 IEEE Symposium on Visual Analytics Science and Technology  
As the number of documents and the corresponding number of concepts and entities within the documents grow larger, sense-making processes become more and more difficult for the analysts.  ...  Jigsaw provides multiple coordinated views of document entities with a special emphasis on visually illustrating connections between entities across the different documents.  ...  Department of Homeland Security Program, under the auspices of the Southeast Regional Visualization and Analytics Center.  ... 
doi:10.1109/vast.2007.4389006 dblp:conf/ieeevast/StaskoGLS07 fatcat:ea34jltzgrhgneca76ue2t7kme
« Previous Showing results 1 — 15 out of 57,261 results