Web Data Extraction for Business Intelligence: The Lixto Approach

Georg Gottlob
2005 Datenbanksysteme für Business, Technologie und Web  
Knowledge about market developments and competitor activities on the market becomes more and more a critical success factor for enterprises. The World Wide Web provides public domain information which can be retrieved for example from Web sites or online shops. The extraction from semi-structured information sources is mostly done manually and is therefore very time consuming. This paper describes how public information can be extracted automatically from Web sites, transformed into structured
more » ... ata formats, and used for data analysis in Business Intelligence systems.
dblp:conf/btw/Gottlob05 fatcat:vu42jphwwbbg5psbvkjalpev3e