Filters








493 Hits in 3.7 sec

Automatic acquisition of subcategorization frames from tagged text

Michael R. Brent, Robert C. Berwick
1991 Proceedings of the workshop on Speech and Natural Language - HLT '91   unpublished
This paper describes an implemented program that takes a tagged text corpus and generates a partial list of the subcategorization frames in wtfich each verb occurs.  ...  Five subeategorization frames are currently detected and we foresee no impediment to detecting many more.  ...  Thanks also to Mark Liberman and the Penn Treebank project at the University of Pennsylvania for supplying tagged text.  ... 
doi:10.3115/112405.112478 fatcat:qc5d6jtv4zgv3fwtljpqayrax4

Automatic acquisition of subcategorization frames from untagged text

Michael R. Brent
1991 Proceedings of the 29th annual meeting on Association for Computational Linguistics -  
This paper describes an implemented program that takes a tagged text corpus and generates a partial list of the subcategorization frames in wtfich each verb occurs.  ...  Five subeategorization frames are currently detected and we foresee no impediment to detecting many more.  ...  Thanks also to Mark Liberman and the Penn Treebank project at the University of Pennsylvania for supplying tagged text.  ... 
doi:10.3115/981344.981371 dblp:conf/acl/Brent91 fatcat:tqnxz2pzxvbudcuunvidrbfsrq

Automatic acquisition of a large subcategorization dictionary from corpora

Christopher D. Manning
1993 Proceedings of the 31st annual meeting on Association for Computational Linguistics -  
This paper presents a new method for producing a dictionary of subcategorization frames from unlabelled text corpora.  ...  Further, it is argued that this method can be used to learn all subcategorization frames, whereas previous methods are not extensible to a general solution to the problem.  ...  However, using hand-tagged text is clearly not a solution to the knowledge acquisition problem (as hand-tagging text is more laborious than collecting subcategorization frames), and so, in more recent  ... 
doi:10.3115/981574.981606 dblp:conf/acl/Manning93 fatcat:uiggnwb5dbdudm64gfxf4csyni

Page 334 of Computational Linguistics Vol. 31, Issue 3 [page]

2005 Computational Linguistics  
We will divide more-general approaches to subcategorization frame acquisition into two groups: those which extract information from raw text and those which use preparsed and hand-corrected treebank data  ...  Typically in the approaches based on raw text, a number of subcategorization patterns are predefined, a set of verb subcategorization frame associations are hypothesized from the data, and statistical  ... 

Automatic acquisition of adjectival subcategorization from corpora

Jeremy Yallop, Anna Korhonen, Ted Briscoe
2005 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics - ACL '05  
This paper describes a novel system for acquiring adjectival subcategorization frames (SCFs) and associated frequency information from English corpus data.  ...  A new tool for linguistic annotation of SCFs in corpus data is also introduced which can considerably alleviate the process of obtaining training and test data for subcategorization acquisition.  ...  Introduction Research into automatic acquisition of lexical information from large repositories of unannotated text (such as the web, corpora of published text, etc.) is starting to produce large scale  ... 
doi:10.3115/1219840.1219916 dblp:conf/acl/YallopKB05 fatcat:75qypk47w5blxjfqs7cnsapwwq

Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II and Penn-III Treebanks

Ruth O'Donovan, Michael Burke, Aoife Cahill, Josef van Genabith, Andy Way
2005 Computational Linguistics  
To our knowledge, this is the largest and most complete evaluation of subcategorization frames acquired automatically for English.  ...  In contrast to many other approaches, ours does not predefine the subcategorization frame types extracted, learning them instead from the source data.  ...  We will divide more-general approaches to subcategorization frame acquisition into two groups: those which extract information from raw text and those which use preparsed and hand-corrected treebank data  ... 
doi:10.1162/089120105774321073 fatcat:sluravdlqje47psb3h2fs2sn6q

Bootstrapping a Verb Lexicon for Biomedical Information Extraction [chapter]

Giulia Venturi, Simonetta Montemagni, Simone Marchi, Yutaka Sasaki, Paul Thompson, John McNaught, Sophia Ananiadou
2009 Lecture Notes in Computer Science  
The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units.  ...  The lexicon is currently focussed on verbs, and includes both automatically-extracted syntactic subcategorization frames, as well as semantic event frames that are based on annotation by domain experts  ...  The National Centre for Text Mining is sponsored by the JISC/BBSRC/EPSRC.  ... 
doi:10.1007/978-3-642-00382-0_11 fatcat:dbycmvokhraslivqktnjlo7paq

Acquisition and evaluation of verb subcategorization resources for biomedicine

Laura Rimell, Thomas Lippincott, Karin Verspoor, Helen L. Johnson, Anna Korhonen
2013 Journal of Biomedical Informatics  
There are currently a limited number of biomedical resources containing information about subcategorization frames (SCFs), and these are the result of either labor-intensive manual collation, or automatic  ...  We evaluate the SCF acquisition methodologies for BioCat with respect to the gold standards, and compare the results with the accuracy of the only previously existing automatically-acquired SCF lexicon  ...  Conclusions Our study has provided some insights into the current state of verb subcategorization frame acquisition for biomedicine.  ... 
doi:10.1016/j.jbi.2013.01.001 pmid:23347886 fatcat:agqvza5c3vh5bamhiug7hf2i2y

Approaches to verb subcategorization for biomedicine

Thomas Lippincott, Laura Rimell, Karin Verspoor, Anna Korhonen
2013 Journal of Biomedical Informatics  
Information about verb subcategorization frames (SCFs) is important to many tasks in natural language processing (NLP) and, in turn, text mining.  ...  We then describe the typical interpretation of subcategorization in biomedical text, and how subcategorization information can improve NLP and text mining applications in biomedicine.  ...  While automatic subcategorization acquisition techniques are relatively well-developed for general English text, and several SCF lexicons have been produced [5] [6] [7] , there are few comparable techniques  ... 
doi:10.1016/j.jbi.2012.12.001 pmid:23276747 fatcat:5x2cd375f5gf5hi6jjdk5hkiwa

Page 17 of Computational Linguistics Vol. 27, Issue 3 [page]

2001 Computational Linguistics  
Computational Linguistics Volume 27, Number 3 on a rough collection of texts, and do not require a carefully balanced corpus or time- consuming semantic tagging. 7.  ...  frequency distributions of subcategorization frames within and across classes can disambiguate the usages of a verb with more than one known lexical semantic class.  ... 

Automatic Extraction of Subcategorization Frames for Czech [article]

Anoop Sarkar, Daniel Zeman
2000 arXiv   pre-print
We show how the learning algorithm can be used to discover previously unknown subcategorization frames from the Czech Prague Dependency Treebank.  ...  We present some novel machine learning techniques for the identification of subcategorization information for verbs in Czech.  ...  We need some general method for the automatic extraction of subcategorization information from text corpora.  ... 
arXiv:cs/0009003v1 fatcat:xnaolebamrbcxm2aus57i4o52i

First experiences of using semantic knowledge learned by ASIUM for information extraction task using INTEX

David Faure, Thierry Poibeau
2000 European Conference on Artificial Intelligence  
We describe a first experiment of coupling an information extraction system based and the machine learning system ASIUM.  ...  Our aim in this article is to show how semantic knowledge learned for a specific domain can help the creating of a powerful information extraction system.  ...  Acknowledgment The research from Thierry Poibeau is partially funded by a Cifre grant between the Laboratoire Central de Recherches of Thomson-CSF and the Laboratoire d'Informatique de l'Université de  ... 
dblp:conf/ecai/FaureP00 fatcat:g7gbi6obnfb2xmuhxa7w3ertrm

A System for Large-Scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora

Judita Preiss, Ted Briscoe, Anna Korhonen
2007 Annual Meeting of the Association for Computational Linguistics  
This paper describes the first system for large-scale acquisition of subcategorization frames (SCFs) from English corpus data which can be used to acquire comprehensive lexicons for verbs, nouns and adjectives  ...  The system incorporates an extensive rulebased classifier which identifies 168 verbal, 37 adjectival and 31 nominal frames from grammatical relations (GRs) output by a robust parser.  ...  Introduction Research into automatic acquisition of lexical information from large repositories of unannotated text (such as the web, corpora of published text, etc.) is starting to produce large scale  ... 
dblp:conf/acl/PreissBK07 fatcat:wnktrafqtrc7lanw2vxg6ioqru

Application of finite-state transducers to the acquisition of verb subcategorization information

I. ALDEZABAL, M. ARANZABE, K. GOJENOLA, M. ORONOZ, K. SARASOLA, A. ATUTXA
2003 Natural Language Engineering  
This paper presents the design and implementation of a finite-state syntactic grammar of Basque that has been used with the objective of extracting information about verb subcategorization instances from  ...  newspaper texts.  ...  of Education, Universities and Research of the Basque Country, the University of the Basque Country and the Interministerial Commision for Science and Technology (CICYT).  ... 
doi:10.1017/s1351324903003097 fatcat:5avnos6n7vcttibe6bdxu2ypgi

A procedure to automatically enrich verbal lexica with subcategorization frames

I. Castellón, L. Alonso Alemany, N.T. Tincheva
2008 Inteligencia Artificial  
In this paper we introduce a method for automatically assigning subcategorization frames to previously unseen verbs of Spanish, as an aid to syntactical analysis.  ...  Since there is not a consensus on the classes of subcategorization frames, we combine supervised and unsupervised learning.  ...  Acknowledgements This research has been partially funded by project KNOW (TIN2006-1549-C03-02) from the Spanish Ministry of Education and Science, a Beatriu de Pinós Postdoctoral Fellowship granted by  ... 
doi:10.4114/ia.v12i37.957 fatcat:rif6iek7nfbj5l7okkqeypkjgi
« Previous Showing results 1 — 15 out of 493 results