A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is
Proceedings of the workshop on Speech and Natural Language - HLT '91
This paper describes an implemented program that takes a tagged text corpus and generates a partial list of the subcategorization frames in wtfich each verb occurs. ... Five subeategorization frames are currently detected and we foresee no impediment to detecting many more. ... Thanks also to Mark Liberman and the Penn Treebank project at the University of Pennsylvania for supplying tagged text. ...doi:10.3115/112405.112478 fatcat:qc5d6jtv4zgv3fwtljpqayrax4
This paper describes an implemented program that takes a tagged text corpus and generates a partial list of the subcategorization frames in wtfich each verb occurs. ... Five subeategorization frames are currently detected and we foresee no impediment to detecting many more. ... Thanks also to Mark Liberman and the Penn Treebank project at the University of Pennsylvania for supplying tagged text. ...doi:10.3115/981344.981371 dblp:conf/acl/Brent91 fatcat:tqnxz2pzxvbudcuunvidrbfsrq
This paper presents a new method for producing a dictionary of subcategorization frames from unlabelled text corpora. ... Further, it is argued that this method can be used to learn all subcategorization frames, whereas previous methods are not extensible to a general solution to the problem. ... However, using hand-tagged text is clearly not a solution to the knowledge acquisition problem (as hand-tagging text is more laborious than collecting subcategorization frames), and so, in more recent ...doi:10.3115/981574.981606 dblp:conf/acl/Manning93 fatcat:uiggnwb5dbdudm64gfxf4csyni
We will divide more-general approaches to subcategorization frame acquisition into two groups: those which extract information from raw text and those which use preparsed and hand-corrected treebank data ... Typically in the approaches based on raw text, a number of subcategorization patterns are predefined, a set of verb subcategorization frame associations are hypothesized from the data, and statistical ...
This paper describes a novel system for acquiring adjectival subcategorization frames (SCFs) and associated frequency information from English corpus data. ... A new tool for linguistic annotation of SCFs in corpus data is also introduced which can considerably alleviate the process of obtaining training and test data for subcategorization acquisition. ... Introduction Research into automatic acquisition of lexical information from large repositories of unannotated text (such as the web, corpora of published text, etc.) is starting to produce large scale ...doi:10.3115/1219840.1219916 dblp:conf/acl/YallopKB05 fatcat:75qypk47w5blxjfqs7cnsapwwq
To our knowledge, this is the largest and most complete evaluation of subcategorization frames acquired automatically for English. ... In contrast to many other approaches, ours does not predefine the subcategorization frame types extracted, learning them instead from the source data. ... We will divide more-general approaches to subcategorization frame acquisition into two groups: those which extract information from raw text and those which use preparsed and hand-corrected treebank data ...doi:10.1162/089120105774321073 fatcat:sluravdlqje47psb3h2fs2sn6q
Lecture Notes in Computer Science
The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. ... The lexicon is currently focussed on verbs, and includes both automatically-extracted syntactic subcategorization frames, as well as semantic event frames that are based on annotation by domain experts ... The National Centre for Text Mining is sponsored by the JISC/BBSRC/EPSRC. ...doi:10.1007/978-3-642-00382-0_11 fatcat:dbycmvokhraslivqktnjlo7paq
There are currently a limited number of biomedical resources containing information about subcategorization frames (SCFs), and these are the result of either labor-intensive manual collation, or automatic ... We evaluate the SCF acquisition methodologies for BioCat with respect to the gold standards, and compare the results with the accuracy of the only previously existing automatically-acquired SCF lexicon ... Conclusions Our study has provided some insights into the current state of verb subcategorization frame acquisition for biomedicine. ...doi:10.1016/j.jbi.2013.01.001 pmid:23347886 fatcat:agqvza5c3vh5bamhiug7hf2i2y
Information about verb subcategorization frames (SCFs) is important to many tasks in natural language processing (NLP) and, in turn, text mining. ... We then describe the typical interpretation of subcategorization in biomedical text, and how subcategorization information can improve NLP and text mining applications in biomedicine. ... While automatic subcategorization acquisition techniques are relatively well-developed for general English text, and several SCF lexicons have been produced    , there are few comparable techniques ...doi:10.1016/j.jbi.2012.12.001 pmid:23276747 fatcat:5x2cd375f5gf5hi6jjdk5hkiwa
Computational Linguistics Volume 27, Number 3 on a rough collection of texts, and do not require a carefully balanced corpus or time- consuming semantic tagging. 7. ... frequency distributions of subcategorization frames within and across classes can disambiguate the usages of a verb with more than one known lexical semantic class. ...
We show how the learning algorithm can be used to discover previously unknown subcategorization frames from the Czech Prague Dependency Treebank. ... We present some novel machine learning techniques for the identification of subcategorization information for verbs in Czech. ... We need some general method for the automatic extraction of subcategorization information from text corpora. ...arXiv:cs/0009003v1 fatcat:xnaolebamrbcxm2aus57i4o52i
We describe a first experiment of coupling an information extraction system based and the machine learning system ASIUM. ... Our aim in this article is to show how semantic knowledge learned for a specific domain can help the creating of a powerful information extraction system. ... Acknowledgment The research from Thierry Poibeau is partially funded by a Cifre grant between the Laboratoire Central de Recherches of Thomson-CSF and the Laboratoire d'Informatique de l'Université de ...dblp:conf/ecai/FaureP00 fatcat:g7gbi6obnfb2xmuhxa7w3ertrm
This paper describes the first system for large-scale acquisition of subcategorization frames (SCFs) from English corpus data which can be used to acquire comprehensive lexicons for verbs, nouns and adjectives ... The system incorporates an extensive rulebased classifier which identifies 168 verbal, 37 adjectival and 31 nominal frames from grammatical relations (GRs) output by a robust parser. ... Introduction Research into automatic acquisition of lexical information from large repositories of unannotated text (such as the web, corpora of published text, etc.) is starting to produce large scale ...dblp:conf/acl/PreissBK07 fatcat:wnktrafqtrc7lanw2vxg6ioqru
This paper presents the design and implementation of a finite-state syntactic grammar of Basque that has been used with the objective of extracting information about verb subcategorization instances from ... newspaper texts. ... of Education, Universities and Research of the Basque Country, the University of the Basque Country and the Interministerial Commision for Science and Technology (CICYT). ...doi:10.1017/s1351324903003097 fatcat:5avnos6n7vcttibe6bdxu2ypgi
In this paper we introduce a method for automatically assigning subcategorization frames to previously unseen verbs of Spanish, as an aid to syntactical analysis. ... Since there is not a consensus on the classes of subcategorization frames, we combine supervised and unsupervised learning. ... Acknowledgements This research has been partially funded by project KNOW (TIN2006-1549-C03-02) from the Spanish Ministry of Education and Science, a Beatriu de Pinós Postdoctoral Fellowship granted by ...doi:10.4114/ia.v12i37.957 fatcat:rif6iek7nfbj5l7okkqeypkjgi
« Previous Showing results 1 — 15 out of 493 results