Filters








3,268 Hits in 6.2 sec

A Blended System for Data-Driven Learning of English for Specific Purposes

Hengbin Yan
2022 International Journal of Emerging Technologies in Learning (iJET)  
English for Specific Purposes (ESP) and Data-Driven Learning (DDL) are two constructivist and student-centered approaches to language pedagogy that are well-established in second language acquisition.  ...  Built on a flexible plug-in architecture, the system utilizes state-of-the-art NLP tools for efficient multilayered linguistic annotation and indexing, made query-able through a user-friendly web interface  ...  GD20CWY15), and the Bilingual Cognition and Development Lab, Center for Linguistics and Applied Linguistics, Guangdong University of Foreign Studies (Grant No. BCD20202).  ... 
doi:10.3991/ijet.v17i12.29653 doaj:5c24f1b37a9c45259f6cf6a9286aa5f0 fatcat:baekeqrpcndebbgs22r3hulth4

Identification of Caused Motion Construction

Jena D. Hwang, Martha Palmer
2015 Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics  
We expand on a previous study on the classification CMCs (Hwang et al., 2010) to show that CMCs can be successfully identified in the corpus data.  ...  This research describes the development of a supervised classifier of English Caused Motion Constructions (CMCs) (e.g. The goalie kicked the ball into the field).  ...  Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.  ... 
doi:10.18653/v1/s15-1006 dblp:conf/starsem/HwangP15 fatcat:lskq6jal6bcjxd2ueg3kw3mv6m

Incorporating Coercive Constructions into a Verb Lexicon

Claire Bonial, Susan Windisch Brown, Jena D. Hwang, Christopher Parisien, Martha Palmer, Suzanne Stevenson
2011 Annual Meeting of the Association for Computational Linguistics  
We use unsupervised methods to estimate probabilistic measures from corpus data for predicting usage of the construction across verb classes in the lexicon and evaluate against VerbNet.  ...  We focus on CAUSED-MOTION as an example construction occurring with verbs for which it is a typical usage or for which it must be interpreted as extending the event semantics through coercion, which occurs  ...  We then test our measures over corpus data, manually annotated for use of the CM construction.  ... 
dblp:conf/acl/BonialBHPPS11 fatcat:lei5ygcozjblrdcbjdwksgy2om

Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports

Lu Qin, Xiaowei Xu, Lingling Ding, Zixiao Li, Jiao Li
2020 BMC Medical Informatics and Decision Making  
Furthermore, we developed an annotated corpus via mapping 149 phrases to the 4188 reports. We selected 11 most frequent diagnosis evidence types such as "" (mitral stenosis) for further identifying.  ...  The generated corpus is divided into training set and testing set in the ratio of 8:2, which is used to train and validate a machine learning model to identify the evidence of cardiogenic stroke using  ...  Acknowledgements Not applicable About this supplement This article has been published as part of BMC Medical Informatics and Decision Making Volume 20 Supplement 3, 2020: Health Information Processing  ... 
doi:10.1186/s12911-020-1106-3 pmid:32646410 fatcat:ymbhlm5njbhi3pvyzb7fkvhhu4

Structure and Grammaticalization of Serial Verb Constructions in Sign Language of the Netherlands—A Corpus-Based Study

Sascha Couvee, Roland Pfau
2018 Frontiers in Psychology  
In addition, we identified some novel uses of the verbs GO and GIVE: (i) GO functioning as a future tense marker and (ii) GIVE functioning as a light verb.  ...  In serial verb constructions (SVCs), multiple independent lexical verbs are combined in a mono-clausal construction.  ...  AUTHOR CONTRIBUTIONS SC extracted the data from the corpus and annotated them.  ... 
doi:10.3389/fpsyg.2018.00993 pmid:30065671 pmcid:PMC6056838 fatcat:l4vtioqevbdwflepjo3oiuj6k4

Recent change in the productivity and schematicity of the way-construction: A distributional semantic analysis

Florent Perek
2018 Corpus Linguistics and Linguistic Theory  
This paper presents a corpus-based study of recent change in the English way-construction, drawing on data from the 1830s to the 2000s.  ...  These findings are interpreted in terms of increases in schematicity, either of the verb slot or the motion component contributed by the construction.  ...  For this reason, the corpus is well suited to the study of many syntactic constructions, and the way-construction in particular.  ... 
doi:10.1515/cllt-2016-0014 fatcat:qihforf2wna4rjnwsrvjysz5i4

The Hamburg Metaphor Database project: issues in resource creation

Birte Lönneker-Rodman
2008 Language Resources and Evaluation  
The acquisition of metaphor attestations from electronic corpora is explained, and annotation practices as well as database contents are evaluated.  ...  The paper concludes with an overview of related projects and an outline of possible future work.  ...  Carina Eilts and Astrid Reining annotated the largest part of the HMD entries. -I am grateful to three anonymous reviewers for useful comments.  ... 
doi:10.1007/s10579-008-9073-9 fatcat:bkacfstxunbidj2uhwkxidbmqq

A roadmap towards determining the universal status of semantic frames [chapter]

2020 New Approaches to Contrastive Linguistics  
The Berkeley FrameNet project, founded in 1997, organizes the lexicon of English by semantic frames (Fillmore 1982) , with valence information derived from attested, manually annotated corpus examples.  ...  This paper proposes a systematic method for identifying semantic frames that could be labeled "universal" (based only on data from languages under investigation).  ...  frame=Cotheme]. 10 Fillmore and Atkins (2000: 103) provide a much more detailed corpus study of to crawl, employing corpus data to show that the different senses of motion verbs can be represented in terms  ... 
doi:10.1515/9783110682588-002 fatcat:6ypllfix55bvlm5cgff3glo6si

Linguistics in the digital humanities: (computational) corpus linguistics

Kim Ebensgaard Jensen
2014 MedieKultur: Journal of Media and Communication Research  
Making use of both digitized data in the form of the language corpus and computational methods of analysis involving concordancers and statistics software, corpus linguistics arguably has a place in the  ...  Th is article provides an overview of the main principles of corpus linguistics and the role of computer technology in relation to data and method and also off ers a bird's-eye view of the history of corpus  ...  Introspection often fi gures in the construction of hypotheses to be tested against corpus data.  ... 
doi:10.7146/mediekultur.v30i57.15968 fatcat:eu5xechqrzhwxmfwqwgd2gwmau

PARSEME-It: an Italian corpus annotated with verbal multiword expressions

Johanna Monti, Maria Pia di Buono
2019 Italian Journal of Computational Linguistics  
a comprehensive corpus for the Italian language.  ...  The paper describes the PARSEME-It corpus, developed within the PARSEME-It project which aims at the development of methods, tools and resources for multiword expressions (MWE) processing for the Italian  ...  We are particularly grateful to Federico Sangati who always supported the annotation team and actively took part in the planning and the implementation of the project.  ... 
doi:10.4000/ijcol.483 fatcat:537lh5k3qnh3rmymc3vippijoq

Optimizing Corpus Creation for Training Word Embedding in Low Resource Domains: A Case Study in Autism Spectrum Disorder (ASD)

Yang Gu, Gondy Leroy, Sydney Pettygrove, Maureen Kelly Galindo, Margaret Kurzius-Spencer
2018 AMIA Annual Symposium Proceedings  
We evaluate the importance of corpus specificity versus size and hypothesize that for specific domains small corpora can generate excellent word embeddings.  ...  Due to diversity in its vocabulary, the abstract-based embeddings generated fewer related terms and saw minimal improvement when the size of the corpus increased.  ...  Acknowledgement The data presented in this paper were collected by the Centers for Disease Control (CDC) and Prevention Autism and Developmental Disabilities Monitoring (ADDM) Network supported by CDC  ... 
pmid:30815091 pmcid:PMC6371367 fatcat:3xzfd3hcnvhupeoac5oomfk3c4

Making fine-grained and coarse-grained sense distinctions, both manually and automatically

MARTHA PALMER, HOA TRANG DANG, CHRISTIANE FELLBAUM
2005 Natural Language Engineering  
We compare the system's performance with our human annotator performance in light of both fine-grained and coarse-grained sense distinctions and show that well-defined sense groups can be of value in improving  ...  We investigate sources of human annotator disagreements stemming from the tagging for the English Verb Lexical Sample Task in the Senseval-2 exercise in automatic Word Sense Disambiguation.  ...  We would also like to thank Scott Cotton for system infrastructure support, Joseph Rosenzweig for building the annotation tool, and Lauren Delfs and Susanne Wolff for the manual annotation for Senseval  ... 
doi:10.1017/s135132490500402x fatcat:acn6h2n5rfhthjspvzeu2yebtu

Extracting Body Function from Clinical Text

Guy Divita, Jessica Lo, Chunxiao Zhou, Kathleen Coale, Elizabeth Rasch
2021 International Joint Conference on Artificial Intelligence  
Training and test data utilized the NIH Clinical Center Rehabilitation Medicine Department records.  ...  We have created two extraction systems: a dictionary lookup rule-based version, and a conditional random field (CRF) approach based on training from manual annotations.  ...  corpus and quality assurance statistics.  ... 
dblp:conf/ijcai/DivitaLZCR21 fatcat:wyflxj6fzjdz5iycem2ktqsofy

Structural and temporal inference search (STIS)

Chreston Miller, Louis-Philippe Morency, Francis Quek
2012 Proceedings of the 14th ACM international conference on Multimodal interaction - ICMI '12  
There are a multitude of annotated behavior corpora (manual and automatic annotations) available as research expands in multimodal analysis of human behavior.  ...  Hence, we present Structural and Temporal Inference Search (STIS) to support search for relevant patterns within a multimodal corpus based on the structural and temporal nature of human interactions.  ...  These annotations are events that were extracted from the audio, video, and motion capture data and describe the officers' interactions.  ... 
doi:10.1145/2388676.2388702 dblp:conf/icmi/MillerMQ12 fatcat:nbnq6iep3vfgfihf2xyfs2gefu

A Crowdsourced Frame Disambiguation Corpus with Ambiguity [article]

Anca Dumitrache, Lora Aroyo, Chris Welty
2019 arXiv   pre-print
This is based on the idea that inter-annotator disagreement is at least partly caused by ambiguity that is inherent to the text and frames.  ...  We present a resource for the task of FrameNet semantic frame disambiguation of over 5,000 word-sentence pairs from the Wikipedia corpus.  ...  Acknowledgments We would like to thank Luigi Asprino, Valentina Presutti and Aldo Gangemi for their assistance with using the Framester corpus, as well as their advice in better understanding the task  ... 
arXiv:1904.06101v1 fatcat:xqfnfowudvcqbnpey7vp3ddd2u
« Previous Showing results 1 — 15 out of 3,268 results