A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Structured Querying of Web Text Data: A Technical Challenge
2007
Conference on Innovative Data Systems Research
The Web contains a huge amount of text that is currently beyond the reach of structured access tools. This unstructured data often contains a substantial amount of implicit structure, much of which can be captured using information extraction (IE) algorithms. By combining an IE system with an appropriate data model and query language, we could enable structured access to all of the Web's unstructured data. We propose a general-purpose query system called the extraction database, or ExDB, which
dblp:conf/cidr/CafarellaRSE07
fatcat:unbtublzdzcqzhen5sai32rn3y