Filters








48 Hits in 2.9 sec

The SystemT IDE

Laura Chiticariu, Sriram Raghavan, Frederick R. Reiss, Shivakumar Vaithyanathan, Huaiyu Zhu, Vivian Chu, Sajib Dasgupta, Thilo W. Goetz, Howard Ho, Rajasekar Krishnamurthy, Alexander Lang, Yunyao Li (+1 others)
2011 Proceedings of the 2011 international conference on Management of data - SIGMOD '11  
Information Extraction (IE) -the problem of extracting structured information from unstructured text -has become the key enabler for many enterprise applications such as semantic search, business analytics  ...  Our demonstration showcases SystemT IDE, the integrated development environment for SystemT, a state-of-the-art rulebased IE system from IBM Research that has been successfully embedded in multiple IBM  ...  Demonstration Focus. [8]ents, and many more.Figure1illustrates the architecture of SystemT, which consists of AQL[8], a declarative language for expressing IE rules, a cost-based optimizer for compiling  ... 
doi:10.1145/1989323.1989479 dblp:conf/sigmod/ChiticariuCDGHKLLLRRVZ11 fatcat:ka2wiqpse5cjxisalzvp57ofbm

Next generation data analytics at IBM research

Oktie Hassanzadeh, Anastasios Kementsietsidis, Benny Kimelfeld, Rajasekar Krishnamurthy, Fatma Özcan, Ippokratis Pandis
2013 Proceedings of the VLDB Endowment  
We developed Jaql [2] , a declarative scripting language for enterprise data analysis.  ...  One such technology is SystemT [5] that exploits AQL, a declarative rule language for Information Extraction (IE), where an intuitive IE algebra [10] is decoupled from the runtime optimization.  ... 
doi:10.14778/2536222.2536246 fatcat:dvt4wqvbpvajvlw3i25kkmmfc4

INDREX

Torsten Kilias, Alexander Löser, Periklis Andritsos
2013 Proceedings of the sixteenth international workshop on Data warehousing and OLAP - DOLAP '13  
We propose the INDREX system that enables a user for the first time to describe corpus-wide extraction tasks in a declarative language and permits the user to run interactive rule refinement queries.  ...  We store the text corpus and rules in the same RDBMS that already holds domain specific structured data.  ...  Finding the right abstraction level for 'declarative programming' on both, text and relational data, is a difficult task.  ... 
doi:10.1145/2513190.2513196 dblp:conf/dolap/KiliasLA13 fatcat:xyxifmgeyjecnndwsqay36nse4

A System for Extracting Sentiment from Large-Scale Arabic Social Data [article]

Hao Wang, Vijay R. Bommireddipalli, Ayman Hanafy, Mohamed Bahgat, Sara Noeman, Ossama S. Emam
2015 arXiv   pre-print
This paper describes an enterprise system we developed for extracting sentiment from large volumes of social data in Arabic dialects.  ...  Lastly, we demonstrate the value of enriching sentiment results with user profiles in understanding sentiments of a specific user group.  ...  A core concept in SystemT is AQL [21] , a declarative rule language with a SQL-like syntax. AQL replaces multiple obscure languages typically used to build extractors.  ... 
arXiv:1511.04661v1 fatcat:exqxj4w5ujbizi36ibtwwm7pt4

Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases

Gerhard Weikum, Xin Luna Dong, Simon Razniewski, Fabian Suchanek
2021 Foundations and Trends in Databases  
Zhu, "Systemt: Declarative text understanding for enterprise", in North American Chapter of the Association for Computational Linguistics (NAACL), 2018. doi: 10.18653/v1/n18-3010. [70] P. M.  ...  Hovy, "Learning surface text patterns for a question answering system", in Annual Meeting of the Association for Computational Linguistics (ACL), 2002. [Online].  ... 
doi:10.1561/1900000064 fatcat:5pgpa743svgclmkxuvizfnrtxy

Spanners

Ronald Fagin, Benny Kimelfeld, Frederick Reiss, Stijn Vansummeren
2013 Proceedings of the 32nd symposium on Principles of database systems - PODS '13  
This framework is driven by SystemT, an IBM commercial product for text analysis, where the primitive representation is that of regular expressions with capture variables.  ...  An intrinsic part of information extraction is the creation and manipulation of relations extracted from text.  ...  We also thank the SystemT group their intensive work in establishing the system, and for useful input.  ... 
doi:10.1145/2463664.2463665 dblp:conf/pods/FaginKRV13 fatcat:ovwzubpttrdlnofv4l4iaq7g64

Feature Engineering for Knowledge Base Construction [article]

Christopher Ré, Amir Abbas Sadeghian, Zifei Shan, Jaeho Shin, Feiran Wang, Sen Wu, Ce Zhang
2014 arXiv   pre-print
We think of DeepDive as declarative in that one specifies what they want but not how to get it.  ...  For the last several years, our group has been building knowledge bases with scientific collaborators.  ...  declarative queries.  ... 
arXiv:1407.6439v3 fatcat:uwbsndqym5h5vdmscrlipmai2m

INDREX: In-database relation extraction

Torsten Kilias, Alexander Löser, Periklis Andritsos
2015 Information Systems  
The management of text data has a long-standing history in the human mankind. A particular common task is extracting relations from text.  ...  Therefore, end users often desire a single system for both analytical and relation extraction tasks.  ...  Therefore another contribution is the proposal of a suite of business-oriented queries for typical enterprise tasks across text and relational data.  ... 
doi:10.1016/j.is.2014.11.006 fatcat:jrweeola2ffanclmmfw5bpdff4

Exploring Relational Features and Learning under Distant Supervision for Information Extraction Tasks

Ajay Nagesh
2015 Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop  
SystemT and AQL SystemT is a declarative Information Extraction system based on an algebraic framework.  ...  using SystemT were parallelized.  ...  A view is a logical description of a set of tuples in terms of (i) the document text (denoted as a special view called Document), and (ii) the contents of other views, as specified in the from clauses  ... 
doi:10.3115/v1/n15-2006 dblp:conf/naacl/Nagesh15 fatcat:3nhbkrm4vnhjvh37usn72wvb7i

Query-driven on-the-fly knowledge base construction

Dat Ba Nguyen, Abdalghani Abujabal, Nam Khanh Tran, Martin Theobald, Gerhard Weikum
2017 Proceedings of the VLDB Endowment  
., for the task of ad-hoc question answering. PVLDB Reference Format:  ...  Another ground-breaking project in this space is SystemT [11, 12, 45] , which uses declarative rules for IE in a wide range of applications, including enterprise content analytics.  ...  Declarative approaches to IE and KBP, such as DeepDive [42, 48, 57] and SystemT [11, 12, 45] , require specifications of predicates and rules.  ... 
doi:10.14778/3151113.3151119 fatcat:tll2ue5gmraxbfbrc3r5b7sdri

The IBM Research Accelerated Discovery Lab

Laura Haas, Melissa Cefkin, Cheryl Kieliszewski, Wil Plouffe, Mary Roth
2014 SIGMOD record  
ACKNOWLEDGMENTS We thank our many research partners for allowing us to brag about their work. In particular, R. Krishnamurthy, S. Spangler, B. Reinwald and M.  ...  Some platforms, such as SystemT [3] , become indispensable to a set of projects.  ...  We rely on the work of some of these projects, for example, the scalable storage architecture (GPFS-FPO) that provides a robust alternative to HDFS, or the declarative machine learning platform that our  ... 
doi:10.1145/2694413.2694423 fatcat:hjkpk3g7ajbefbciaforhkruyq

A platform for eXtreme Analytics

A. Balmin, K. Beyer, V. Ercegovac, J. McPherson, F. Ozcan, H. Pirahesh, E. Shekita, Y. Sismanis, S. Tata, Y. Tian
2013 IBM Journal of Research and Development  
BALMIN ET AL. 4 : 7 these enormous datasets is invaluable for understanding and boosting business performance.  ...  For this purpose, we developed Jaql modules that harness SystemT [9] so that we can apply sophisticated information extraction rules and libraries in parallel.  ... 
doi:10.1147/jrd.2013.2242693 fatcat:sbdxilynrfhkndjtun4ykdtuui

Development of an Enterprise-Grade Contract Understanding System

Arvind Agarwal, Laura Chiticariu, Poornima Chozhiyath Raman, Marina Danilevsky, Diman Ghazi, Ankush Gupta, Shanmukha Guttula, Yannis Katsis, Rajasekar Krishnamurthy, Yunyao Li, Shubham Mudgal, Vitobha Munigala (+8 others)
2021 Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers   unpublished
In this paper, we describe the Transparent and Expert Contract Understanding System (TECUS): a commercial system designed and deployed for contract understanding and used by a wide range of enterprise  ...  users for the past few years.  ...  Second, it leverages SystemT, a state-of-the-art declarative text understanding engine for the enterprise (Chiticariu et al., 2010 (Chiticariu et al., , 2018)) , towards developing transparent models  ... 
doi:10.18653/v1/2021.naacl-industry.28 fatcat:rqb6iisjnbfkbgodhorzrkrdk4

Augmented Understanding and Automated Adaptation of Curation Rules [article]

Alireza Tabebordbar
2020 arXiv   pre-print
We propose: ~(1) a feature-based and automated technique for curating the raw data. ~(2) We propose an autonomic approach for adapting data curation rules. ~(3) We provide a solution to augment users in  ...  To address these challenges, in this dissertation, we present techniques, algorithms and systems for augmenting analysts in curation tasks.  ...  SystemT has been used in a wide array of enterprise applications and many information extraction systems.  ... 
arXiv:2007.08710v1 fatcat:cw4ka6pzw5ev3hlfidpfllv5sy

Document Spanners

Ronald Fagin, Benny Kimelfeld, Frederick Reiss, Stijn Vansummeren
2015 Journal of the ACM  
This framework is driven by SystemT, an IBM commercial product for text analysis, where the primitive representation is that of regular expressions with capture variables.  ...  An intrinsic part of information extraction is the creation and manipulation of relations extracted from text.  ...  We also thank the SystemT group their intensive work in establishing the system, and for useful input.  ... 
doi:10.1145/2699442 fatcat:5kzwk3k5mzbldbwufa4xb3xqoi
« Previous Showing results 1 — 15 out of 48 results