A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Filters
One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction
[article]
2020
arXiv
pre-print
Structured information extraction from document images usually consists of three steps: text detection, text recognition, and text field labeling. ...
To alleviate these problems, we proposed a novel deep end-to-end trainable approach for one-shot text field labeling, which makes use of attention mechanism to transfer the layout information between document ...
Text detection, text recognition, and text field labeling are the three key steps for structured information extraction [17, 23] . ...
arXiv:2009.04153v1
fatcat:ocrzc4r4f5cbzcfdtruag7gbua
Low-resource Learning with Knowledge Graphs: A Comprehensive Survey
[article]
2022
arXiv
pre-print
appeared in training, and few-shot learning (FSL) where new classes for prediction have only a small number of labeled samples that are available. ...
to reduce the reliance on labeled samples. ...
ACKNOWLEDGMENTS This work was supported by the SIRIUS Centre for Scalable Data Access (Research Council of Norway, project 237889), eBay, Samsung Research UK, Siemens AG, and the EPSRC projects OASIS ( ...
arXiv:2112.10006v5
fatcat:vxl5hnqe5jaafgwuznbz556fmm
EmergEventMine: End-to-End Chinese Emergency Event Extraction Using a Deep Adversarial Network
2022
ISPRS International Journal of Geo-Information
and few-shot Chinese emergency event extraction. ...
However, current studies on the text mining of emergency information mainly focus on text classification and event recognition, only obtaining a general and conceptual cognition about an emergency event ...
Acknowledgments: The authors would like to thank the editor and the anonymous reviewers who provided insightful comments on improving this paper. ...
doi:10.3390/ijgi11060345
doaj:6e7db2e6baa54d548dbac6eff32a2abf
fatcat:axatcoloxvd63h3cbja2fnyonm
A Survey of Content-Aware Video Analysis for Sports
2018
IEEE transactions on circuits and systems for video technology (Print)
We believe that our findings can advance the field of research on content-aware video analysis for broadcast sports. ...
On the basis of this insight, we provide an overview of the themes particularly relevant to the research on content-aware systems for broadcast sports. ...
[223] presented a method for text localization and segmentation for images and videos, and for extracting information used for semantic indexing. Noll et al. ...
doi:10.1109/tcsvt.2017.2655624
fatcat:rwqzu46sgfb7tpkcav4ysmh6ae
A survey of joint intent detection and slot-filling models in natural language understanding
[article]
2021
arXiv
pre-print
We observe three milestones in this research so far: Intent detection to identify the speaker's intention, slot filling to label each word token in the speech/text, and finally, joint intent classification ...
domain used. ...
Natural language understanding (NLU) then takes the text and extracts the semantics for use in further processes -information gathering, question answering, dialogue management, request fulfilment, and ...
arXiv:2101.08091v3
fatcat:ai6w2imilrfupf4m5fm2rjtzxi
CMU Informedia's TRECVID 2005 Skirmishes
2005
TREC Video Retrieval Evaluation
At TRECVID 2005, CMU participated in the low-level feature extraction task, the semantic concept feature extraction task, automatic, manual and interactive search tasks and the BBC stock footage challenge ...
ACKNOWLEDGMENTS This work was supported in part by the Advanced Research and Development Activity under contract numbers H98230-04-C-0406 and NBCHC040037. ...
These shots are labeled in the field and designated as 'bye' shots, shots the filmmaker thinks are the best. ...
dblp:conf/trecvid/HauptmannBCCGJL05
fatcat:qxocospbfvfere34yyacwicrs4
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems
2022
The Journal of Artificial Intelligence Research
Finally, we list additional challenges that multilinguality poses for related areas (such as speech, fluency in generated text, and human-centred evaluation), and indicate future directions that hold promise ...
This work provides an extensive overview of existing methods and resources in multilingual ToD as an entry point to this exciting and emerging field. ...
., 2020) and BigBIRD (Zaheer et al., 2020) propose modifying the self-attention with localised attention and sparse attention, respectively, for long sequence processing. ...
doi:10.1613/jair.1.13083
fatcat:54a6w62wxvbvvigh32zkmtwwqq
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems
[article]
2022
arXiv
pre-print
This work provides an extensive overview of existing methods and resources in multilingual ToD as an entry point to this exciting and emerging field. ...
Hence, state-of-the-art approaches to multilingual ToD mostly rely on (zero- or few-shot) cross-lingual transfer from resource-rich languages (almost exclusively English), either by means of machine translation ...
., 2020) and BigBIRD (Zaheer et al., 2020) propose modifying the self-attention with localised attention and sparse attention, respectively, for long sequence processing. ...
arXiv:2104.08570v3
fatcat:a6vfgcvgqvhllfkgfwcr3mptgq
Sentiment Analysis Using Deep Learning Techniques: A Review
2017
International Journal of Advanced Computer Science and Applications
The unstructured form of data from the social media is needed to be analyzed and well-structured and for this purpose, sentiment analysis has recognized significant attention. ...
The challenge for sentiment analysis is lack of sufficient labeled data in the field of Natural Language Processing (NLP). ...
The demand of sentiment analysis is raised due to the requirement of analyzing and structuring hidden information, extracted from social media in form of unstructured data. ...
doi:10.14569/ijacsa.2017.080657
fatcat:us4hwclsx5ghtjo4v5vkvfkqqm
Adaptable Conversational Machines
2020
The AI Magazine
This article reviews advancements in dialogue systems research with a focus on the adaptation methods for dialogue modeling, and ventures to have a glance at the future of research on adaptable conversational ...
Most notably, neural-network–based systems have set the state of the art for difficult tasks such as speech recognition, semantic understanding, dialogue management, language generation, and speech synthesis ...
Acknowledgments Funding has been provided by the Alexander von Humboldt Foundation within the framework of the Sofja Kovalevskaja Award, endowed by the Federal Ministry of Education and Research. ...
doi:10.1609/aimag.v41i3.5322
fatcat:m5grirvy45d7nnqwemcymwq3bq
A Survey on Event Extraction for Natural Language Understanding: Riding the Biomedical Literature Wave
2021
IEEE Access
To cope with the everincreasing number of publications, researchers are experiencing a surge of interest in extracting valuable, structured, concise, and unambiguous information from plain texts. ...
Results: This paper provides a comprehensive and up-to-date survey on the link between event extraction and natural language understanding, focusing on the biomedical domain. ...
ACKNOWLEDGMENT The authors thank Giulio Carlassare for his contributions during productive discussions and practical experiments on biomedical corpora. ...
doi:10.1109/access.2021.3130956
fatcat:wlr7zeikdva77ojuppqx3vmocy
Description Based Text Classification with Reinforcement Learning
[article]
2020
arXiv
pre-print
In this standard formalization categories are merely represented as indexes in the label vocabulary, and the model lacks for explicit instructions on what to classify. ...
The task of text classification is usually divided into two stages: text feature extraction and classification. ...
Wang et al. (2018a) proposed a label-embedding attentive model that jointly embeds words and labels in the same latent space, and the text representations are constructed directly using the text-label ...
arXiv:2002.03067v3
fatcat:37ny7kxjjbeqxacofm7i2xue7y
Extracting semantics from audio-visual content: the final frontier in multimedia retrieval
2002
IEEE Transactions on Neural Networks
We discuss, how reliance on powerful pattern recognition and machine learning techniques is increasing in the field of multimedia retrieval. ...
There is tremendous potential for effective use of multimedia content through intelligent analysis. Diverse application areas are increasingly relying on multimedia understanding systems. ...
Frey for his valuable comments on factor graphs and S. F. Chang and D. Zhong of Columbia University for the blob tracking algorithm. ...
doi:10.1109/tnn.2002.1021881
pmid:18244476
fatcat:2joztr4jnbgedmsjvbzvqqe4su
Scene text recognition and tracking to identify athletes in sport videos
2011
Multimedia tools and applications
In some recent approaches, scene text segmentation relies upon graphical models and belief propagation [27] , methods that are also interestingly applied to text recognition [30] . ...
On the contrary, scene text is inherently embedded within the scene, for example hotel or shop placards, road signs, street names, posters. ...
The authors would like to thank Paul Chippendale for his careful reading of the manuscript. ...
doi:10.1007/s11042-011-0878-y
fatcat:5n2qqqp5ljd57k5zddb7qslz3u
Author Index
2010
2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Parts by their Context Ulrich, Markus Model Globally, Match Locally: Efficient and Robust 3D Object Recognition Ulusoy, Ali Osman Workshop: Robust One-Shot 3D Scanning Using Loopy Belief Propagation Urschler ...
Taubin, Gabriel
Workshop: REVEAL Intermediate Report
Workshop: Robust One-Shot 3D Scanning Using Loopy Belief Propagation
Taylor, Geoff
Workshop: Rapidly Deployable Video Analysis Sensor Units ...
doi:10.1109/cvpr.2010.5539913
fatcat:y6m5knstrzfyfin6jzusc42p54
« Previous
Showing results 1 — 15 out of 4,385 results