4,385 Hits in 7.6 sec

One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction [article]

Mengli Cheng, Minghui Qiu, Xing Shi, Jun Huang, Wei Lin
2020 arXiv   pre-print
Structured information extraction from document images usually consists of three steps: text detection, text recognition, and text field labeling.  ...  To alleviate these problems, we proposed a novel deep end-to-end trainable approach for one-shot text field labeling, which makes use of attention mechanism to transfer the layout information between document  ...  Text detection, text recognition, and text field labeling are the three key steps for structured information extraction [17, 23] .  ... 
arXiv:2009.04153v1 fatcat:ocrzc4r4f5cbzcfdtruag7gbua

Low-resource Learning with Knowledge Graphs: A Comprehensive Survey [article]

Jiaoyan Chen and Yuxia Geng and Zhuo Chen and Jeff Z. Pan and Yuan He and Wen Zhang and Ian Horrocks and Huajun Chen
2022 arXiv   pre-print
appeared in training, and few-shot learning (FSL) where new classes for prediction have only a small number of labeled samples that are available.  ...  to reduce the reliance on labeled samples.  ...  ACKNOWLEDGMENTS This work was supported by the SIRIUS Centre for Scalable Data Access (Research Council of Norway, project 237889), eBay, Samsung Research UK, Siemens AG, and the EPSRC projects OASIS (  ... 
arXiv:2112.10006v5 fatcat:vxl5hnqe5jaafgwuznbz556fmm

EmergEventMine: End-to-End Chinese Emergency Event Extraction Using a Deep Adversarial Network

Jianzhuo Yan, Lihong Chen, Yongchuan Yu, Hongxia Xu, Qingcai Gao, Kunpeng Cao, Jianhui Chen
2022 ISPRS International Journal of Geo-Information  
and few-shot Chinese emergency event extraction.  ...  However, current studies on the text mining of emergency information mainly focus on text classification and event recognition, only obtaining a general and conceptual cognition about an emergency event  ...  Acknowledgments: The authors would like to thank the editor and the anonymous reviewers who provided insightful comments on improving this paper.  ... 
doi:10.3390/ijgi11060345 doaj:6e7db2e6baa54d548dbac6eff32a2abf fatcat:axatcoloxvd63h3cbja2fnyonm

A Survey of Content-Aware Video Analysis for Sports

Huang-Chia Shih
2018 IEEE transactions on circuits and systems for video technology (Print)  
We believe that our findings can advance the field of research on content-aware video analysis for broadcast sports.  ...  On the basis of this insight, we provide an overview of the themes particularly relevant to the research on content-aware systems for broadcast sports.  ...  [223] presented a method for text localization and segmentation for images and videos, and for extracting information used for semantic indexing. Noll et al.  ... 
doi:10.1109/tcsvt.2017.2655624 fatcat:rwqzu46sgfb7tpkcav4ysmh6ae

A survey of joint intent detection and slot-filling models in natural language understanding [article]

H. Weld, X. Huang, S. Long, J. Poon, S. C. Han
2021 arXiv   pre-print
We observe three milestones in this research so far: Intent detection to identify the speaker's intention, slot filling to label each word token in the speech/text, and finally, joint intent classification  ...  domain used.  ...  Natural language understanding (NLU) then takes the text and extracts the semantics for use in further processes -information gathering, question answering, dialogue management, request fulfilment, and  ... 
arXiv:2101.08091v3 fatcat:ai6w2imilrfupf4m5fm2rjtzxi

CMU Informedia's TRECVID 2005 Skirmishes

Alexander G. Hauptmann, Robert V. Baron, Michael G. Christel, R. Concescu, Jiang Gao, Qin Jin, Wei-Hao Lin, J.-Y. Pan, Scott M. Stevens, Rong Yan, J. Yang, Y. Zhang
2005 TREC Video Retrieval Evaluation  
At TRECVID 2005, CMU participated in the low-level feature extraction task, the semantic concept feature extraction task, automatic, manual and interactive search tasks and the BBC stock footage challenge  ...  ACKNOWLEDGMENTS This work was supported in part by the Advanced Research and Development Activity under contract numbers H98230-04-C-0406 and NBCHC040037.  ...  These shots are labeled in the field and designated as 'bye' shots, shots the filmmaker thinks are the best.  ... 
dblp:conf/trecvid/HauptmannBCCGJL05 fatcat:qxocospbfvfere34yyacwicrs4

Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems

Evgeniia Razumovskaia, Goran Glavas, Olga Majewska, Edoardo M. Ponti, Anna Korhonen, Ivan Vulic
2022 The Journal of Artificial Intelligence Research  
Finally, we list additional challenges that multilinguality poses for related areas (such as speech, fluency in generated text, and human-centred evaluation), and indicate future directions that hold promise  ...  This work provides an extensive overview of existing methods and resources in multilingual ToD as an entry point to this exciting and emerging field.  ...  ., 2020) and BigBIRD (Zaheer et al., 2020) propose modifying the self-attention with localised attention and sparse attention, respectively, for long sequence processing.  ... 
doi:10.1613/jair.1.13083 fatcat:54a6w62wxvbvvigh32zkmtwwqq

Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems [article]

Evgeniia Razumovskaia, Goran Glavaš, Olga Majewska, Edoardo M. Ponti, Anna Korhonen, Ivan Vulić
2022 arXiv   pre-print
This work provides an extensive overview of existing methods and resources in multilingual ToD as an entry point to this exciting and emerging field.  ...  Hence, state-of-the-art approaches to multilingual ToD mostly rely on (zero- or few-shot) cross-lingual transfer from resource-rich languages (almost exclusively English), either by means of machine translation  ...  ., 2020) and BigBIRD (Zaheer et al., 2020) propose modifying the self-attention with localised attention and sparse attention, respectively, for long sequence processing.  ... 
arXiv:2104.08570v3 fatcat:a6vfgcvgqvhllfkgfwcr3mptgq

Sentiment Analysis Using Deep Learning Techniques: A Review

Qurat Tul, Mubashir Ali, Amna Riaz, Amna Noureen, Muhammad Kamranz, Babar Hayat, A. Rehman
2017 International Journal of Advanced Computer Science and Applications  
The unstructured form of data from the social media is needed to be analyzed and well-structured and for this purpose, sentiment analysis has recognized significant attention.  ...  The challenge for sentiment analysis is lack of sufficient labeled data in the field of Natural Language Processing (NLP).  ...  The demand of sentiment analysis is raised due to the requirement of analyzing and structuring hidden information, extracted from social media in form of unstructured data.  ... 
doi:10.14569/ijacsa.2017.080657 fatcat:us4hwclsx5ghtjo4v5vkvfkqqm

Adaptable Conversational Machines

Nurul Lubis, Michael Heck, Carel Van Niekerk, Milica Gasic
2020 The AI Magazine  
This article reviews advancements in dialogue systems research with a focus on the adaptation methods for dialogue modeling, and ventures to have a glance at the future of research on adaptable conversational  ...  Most notably, neural-network–based systems have set the state of the art for difficult tasks such as speech recognition, semantic understanding, dialogue management, language generation, and speech synthesis  ...  Acknowledgments Funding has been provided by the Alexander von Humboldt Foundation within the framework of the Sofja Kovalevskaja Award, endowed by the Federal Ministry of Education and Research.  ... 
doi:10.1609/aimag.v41i3.5322 fatcat:m5grirvy45d7nnqwemcymwq3bq

A Survey on Event Extraction for Natural Language Understanding: Riding the Biomedical Literature Wave

Giacomo Frisoni, Gianluca Moro, Antonella Carbonaro
2021 IEEE Access  
To cope with the everincreasing number of publications, researchers are experiencing a surge of interest in extracting valuable, structured, concise, and unambiguous information from plain texts.  ...  Results: This paper provides a comprehensive and up-to-date survey on the link between event extraction and natural language understanding, focusing on the biomedical domain.  ...  ACKNOWLEDGMENT The authors thank Giulio Carlassare for his contributions during productive discussions and practical experiments on biomedical corpora.  ... 
doi:10.1109/access.2021.3130956 fatcat:wlr7zeikdva77ojuppqx3vmocy

Description Based Text Classification with Reinforcement Learning [article]

Duo Chai, Wei Wu, Qinghong Han, Fei Wu, Jiwei Li
2020 arXiv   pre-print
In this standard formalization categories are merely represented as indexes in the label vocabulary, and the model lacks for explicit instructions on what to classify.  ...  The task of text classification is usually divided into two stages: text feature extraction and classification.  ...  Wang et al. (2018a) proposed a label-embedding attentive model that jointly embeds words and labels in the same latent space, and the text representations are constructed directly using the text-label  ... 
arXiv:2002.03067v3 fatcat:37ny7kxjjbeqxacofm7i2xue7y

Extracting semantics from audio-visual content: the final frontier in multimedia retrieval

M.R. Naphade, T.S. Huang
2002 IEEE Transactions on Neural Networks  
We discuss, how reliance on powerful pattern recognition and machine learning techniques is increasing in the field of multimedia retrieval.  ...  There is tremendous potential for effective use of multimedia content through intelligent analysis. Diverse application areas are increasingly relying on multimedia understanding systems.  ...  Frey for his valuable comments on factor graphs and S. F. Chang and D. Zhong of Columbia University for the blob tracking algorithm.  ... 
doi:10.1109/tnn.2002.1021881 pmid:18244476 fatcat:2joztr4jnbgedmsjvbzvqqe4su

Scene text recognition and tracking to identify athletes in sport videos

Stefano Messelodi, Carla Maria Modena
2011 Multimedia tools and applications  
In some recent approaches, scene text segmentation relies upon graphical models and belief propagation [27] , methods that are also interestingly applied to text recognition [30] .  ...  On the contrary, scene text is inherently embedded within the scene, for example hotel or shop placards, road signs, street names, posters.  ...  The authors would like to thank Paul Chippendale for his careful reading of the manuscript.  ... 
doi:10.1007/s11042-011-0878-y fatcat:5n2qqqp5ljd57k5zddb7qslz3u

Author Index

2010 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition  
Parts by their Context Ulrich, Markus Model Globally, Match Locally: Efficient and Robust 3D Object Recognition Ulusoy, Ali Osman Workshop: Robust One-Shot 3D Scanning Using Loopy Belief Propagation Urschler  ...  Taubin, Gabriel Workshop: REVEAL Intermediate Report Workshop: Robust One-Shot 3D Scanning Using Loopy Belief Propagation Taylor, Geoff Workshop: Rapidly Deployable Video Analysis Sensor Units  ... 
doi:10.1109/cvpr.2010.5539913 fatcat:y6m5knstrzfyfin6jzusc42p54
« Previous Showing results 1 — 15 out of 4,385 results