Filters








142 Hits in 2.4 sec

Web Data Extraction for Business Intelligence: The Lixto Approach

Georg Gottlob
2005 Datenbanksysteme für Business, Technologie und Web  
The extraction from semi-structured information sources is mostly done manually and is therefore very time consuming.  ...  The World Wide Web provides public domain information which can be retrieved for example from Web sites or online shops.  ...  Acknowledgements The authors would like to thank Giacomo del Felice from Pirelli Pneumatici S.p.A. for his continuous and reliable project support.  ... 
dblp:conf/btw/Gottlob05 fatcat:vu42jphwwbbg5psbvkjalpev3e

Supervised Wrapper Generation with Lixto

Robert Baumgartner, Sergio Flesca, Georg Gottlob
2001 Very Large Data Bases Conference  
We illustrate basic features of the Lixto wrapper generator such as the user and system interaction, the capacious visual interface, the marking and selecting procedures, and the extraction tasks by describing  ...  the construction of a simple example program in the current Lixto prototype.  ...  Introduction Lixto is a fully visual and interactive wrapper generation tool whose features are described in [1] .  ... 
dblp:conf/vldb/BaumgartnerFG01a fatcat:jslxlwtlefadnjevij45rrppry

Visual Web Information Extraction with Lixto

Robert Baumgartner, Sergio Flesca, Georg Gottlob
2001 International Conference on Knowledge Capture  
We present new techniques for supervised wrapper generation and automated web information extraction, and a system called Lixto implementing these techniques [6] .  ...  Our system can generate wrappers which translate relevant pieces of HTML pages into XML.  ...  The final two patterns are string patterns. TESTING THE LIXTO TOOL We chose twelve example sites (Table 1 ), some of which were already used for testing purposes by other wrapper generators.  ... 
dblp:conf/kcap/BaumgartnerFG01 fatcat:p43sb2igenewfn4yro4c2gusba

Web Data Extraction System [chapter]

Robert Baumgartner, Wolfgang Gatterbauer, Georg Gottlob
2016 Encyclopedia of Database Systems  
The Lixto Suite comprises the Lixto Visual Developer (VD), a fully visual and interactive wrapper generation framework, and the Java-based Lixto Transformation Server providing a scalable runtime and data  ...  Denodo additionally offers a tool called Aracne for document crawling and indexing. Lixto.  ... 
doi:10.1007/978-1-4899-7993-3_1154-2 fatcat:6ghtb2rgjzgfvmm5kpah7lcm5e

Web Data Extraction System [chapter]

Serguei Mankovskii, Maarten van Steen, Minos Garofalakis, Alan Fekete, Christian S. Jensen, Richard T. Snodgrass, Alex Wun, Vanja Josifovski, Andrei Broder, Dennis Fetterly, Marc Najork, Robert Baumgartner (+55 others)
2009 Encyclopedia of Database Systems  
The Lixto Suite comprises the Lixto Visual Developer (VD), a fully visual and interactive wrapper generation framework, and the Java-based Lixto Transformation Server providing a scalable runtime and data  ...  Denodo additionally offers a tool called Aracne for document crawling and indexing. Lixto.  ... 
doi:10.1007/978-0-387-39940-9_1154 fatcat:zamqe55tt5aupa2vvgdba7wy3u

Scalable web data extraction for online market intelligence

Robert Baumgartner, Georg Gottlob, Marcus Herzog
2009 Proceedings of the VLDB Endowment  
Lixto (www.lixto.com), a company offering data extraction tools and services, has been providing OMI solutions for several customers.  ...  In this paper we show how Lixto has tackled each of the above challenges by improving and extending its original data extraction software.  ...  A survey of various wrapper generation tools, that still covers most existing tools can be found in [15, 20] .  ... 
doi:10.14778/1687553.1687580 fatcat:lo6xmbvu3fheda5xf2imopjymq

The Lixto data extraction project

Georg Gottlob, Christoph Koch, Robert Baumgartner, Marcus Herzog, Sergio Flesca
2004 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems - PODS '04  
We describe the visual wrapper specification process in Lixto and various practical aspects of wrapping.  ...  Then we present theoretical results on monadic datalog over trees and on Elog, its close relative which is used as the internal wrapper language in the Lixto system.  ...  Then we introduce the core visual specification procedure used in the Lixto wrapper generator [3, 4] . Finally, the Elog wrapping language is presented.  ... 
doi:10.1145/1055558.1055560 dblp:conf/pods/GottlobKBHF04 fatcat:3ahbmitxyjd2fm447nn6523ica

Declarative Information Extraction, Web Crawling, and Recursive Wrapping with Lixto [chapter]

Robert Baumgartner, Sergio Flesca, Georg Gottlob
2001 Lecture Notes in Computer Science  
Lixto is a system and method for the visual and interactive generation of wrappers for Web pages under the supervision of a human developer, for automatically extracting information from Web pages using  ...  such wrappers, and for translating the extracted content into XML.  ...  Wrapper generators are software tools that generate wrappers via induction (such as e.g.  ... 
doi:10.1007/3-540-45402-0_2 fatcat:jssfkyxvlbdchhdyownkbtbqey

The Personal Publication Reader: Illustrating Web Data Extraction, Personalization and Reasoning for the Semantic Web [chapter]

Robert Baumgartner, Nicola Henze, Marcus Herzog
2005 Lecture Notes in Computer Science  
Our approach consists of two main parts: The web data extraction part, providing the information system with real-time, dynamic data, and the personalization part, which deduces -with the aid of ontological  ...  The prototype of the system has been realized using the Personal Reader Framework for designing, implementing, and maintaining Web content Readers 1 .  ...  The information provision part for the Personal Publication Reader is solved by using the Lixto approach.  ... 
doi:10.1007/11431053_35 fatcat:ylopabnxubg3jc7m2acbnikqna

The INFOMIX system for advanced integration of incomplete and inconsistent data

Nicola Leone, Riccardo Rosati, Domenico Lembo, Maurizio Lenzerini, Marco Ruzzi, Edyta Kalka, Bartosz Nowicki, Witold Staniszkis, Gianluigi Greco, Giovambattista Ianni, Vincenzino Lio, Giorgio Terracina (+4 others)
2005 Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05  
By LiXto wrappers and tools [1, 8] , powerful data extraction and preprocessing is possible.  ...  Visual wrappers support full interactive development of wrappers at design time. Currently, there is support for developing LiXto wrappers [1] and pipes as well as for Rodan's Data Extractor.  ... 
doi:10.1145/1066157.1066286 dblp:conf/sigmod/LeoneGILTEFFGRLLRKNS05 fatcat:2ni77egqsnf6nidag5ulrlc2lu

A flight meta-search engine with metamorph

Bernhard Kruepl, Wolfgang Holzinger, Yansen Darmaputra, Robert Baumgartner
2009 Proceedings of the 18th international conference on World wide web - WWW '09  
We show how data can be extracted from web forms (rather than the data behind web forms) to generate a graph of flight connections between cities.  ...  Metamorph provides mechanisms to model web forms together with the interactions which are needed to fulfil a request, and can generate interaction sequences that pose queries using these web forms and  ...  Results are piped into wrappers that were interactively generated on a per-website basis using the Lixto Visual Developer tool.  ... 
doi:10.1145/1526709.1526860 dblp:conf/www/KruplHDB09 fatcat:4x34rojhfvhfxix3imkg7bvovq

Logic-based web information extraction

Georg Gottlob, Christoph Koch
2004 SIGMOD record  
Acknowledgments This work was supported by the Austrian Science Fund (FWF) under project No. Z29-N04 and the GAMES Network of Excellence of the European Union.  ...  Visual Wrapper Specification In this section, we introduce the core visual specification procedure used in the Lixto wrapper generator [2, 3] . Lixto uses a wrapping language called Elog.  ...  These are just two reasons for which wrapping tools need to assist humans to render the creation of wrappers a more manageable task.  ... 
doi:10.1145/1024694.1024711 fatcat:6ogbizzxlnak3bbb4rif3irp64

Towards building logical views of websites

Zehua Liu, Wee Keong Ng, Ee-Peng Lim, Feifei Li
2004 Data & Knowledge Engineering  
Using the tool, the time required to construct a logical data model for a given Website is significantly reduced.  ...  To enable easy and rapid creation of such data models, we have implemented a visual tool, called the Mapping Wizard, to facilitate and automate the process of producing Wiccap Data Models.  ...  Although DEByE and Lixto both have an expressive user interface for creating wrappers, the tools available are mostly for wrapper generation operations that are of the lowest granularity, i.e. for specifying  ... 
doi:10.1016/j.datak.2003.10.004 fatcat:hwhw7sd6ibac3ivwlvtqu3y4sy

The Personal Publication Reader [chapter]

Fabian Abel, Robert Baumgartner, Adrian Brooks, Christian Enzi, Georg Gottlob, Nicola Henze, Marcus Herzog, Matthias Kriesell, Wolfgang Nejdl, Kai Tomaschewski
2005 Lecture Notes in Computer Science  
, and the user interface creation step in which the RDF-descriptions resulting from the reasoning step are interpreted and translated into an appropriate, personalized user interface.  ...  The application comprises four steps: The information gathering step, in which information from distributed, heterogenous sources is extracted and enriched with machine-readable semantics, the operation  ...  Availability of the Personal Publication Reader The concept of the Personal Publication Reader and its functionality are summarized in a video, and so are the web data extraction and maintenance tasks.  ... 
doi:10.1007/11574620_75 fatcat:zxkshpjsvbdgzctnazkwvy3mly

Visual Programming of Web Data Aggregation Applications

Robert Baumgartner, Georg Gottlob, Marcus Herzog
2003 International Joint Conference on Artificial Intelligence  
In this paper we will present a fully visual development environment for Web data aggregation applications, the Lixto Transformation Server (TS).  ...  Most of the information needs today can be satisfied by searching and browsing the Web.  ...  Lixto Visual Wrapper allows for very expressive extraction programs [8] , that can be generated by means of a visual builder interface.  ... 
dblp:conf/ijcai/BaumgartnerGH03 fatcat:symguo7gv5aajnfttc3sfosley
« Previous Showing results 1 — 15 out of 142 results