R2E: Rule-based Event Extractor

Jakub Dutkiewicz, Maciej Nowak, Czeslaw Jedrzejek
2014 International Web Rule Symposium  
In this paper we present a rule-based method of event extraction from the natural language. We use the Stanford dependency parser in order to build a relation graph of elements from input text. This structure along with serialized extraction frames is converted into a set of facts. We describe a process of creation of application of rules, which aims to match elements from the text with corresponding slots in the extraction frames. A possible match is derived by the comparison of verbal phrases
more » ... from the text with lexicalizations of anchors (constituting the most important part of each frame) stored in an ontology. The rest of the extraction frame is filled with other elements of the dependency graph, with regard to their semantic type (determined by lexicalizations of allowed types defined in frames and ontology) and their grammatical properties. We describe conversions required to create a consistent knowledge base of text phrases, classification of semantic types and instantiated slots from the extraction frames. We use the Drools engine in order to extract events from such a knowledge base.
dblp:conf/ruleml/DutkiewiczNJ14 fatcat:gqadokq7ozeufm2hbpia4ocqxq