Filters








589,386 Hits in 2.6 sec

Web Data Extraction System [chapter]

Serguei Mankovskii, Maarten van Steen, Minos Garofalakis, Alan Fekete, Christian S. Jensen, Richard T. Snodgrass, Alex Wun, Vanja Josifovski, Andrei Broder, Dennis Fetterly, Marc Najork, Robert Baumgartner (+55 others)
2009 Encyclopedia of Database Systems  
HISTORICAL BACKGROUND The precursors of web data extraction systems were screen scrapers which are systems for extracting screen formated data from mainframe applications for terminals such as VT100 or  ...  Many web data extraction systems exhibit an XPATH-like path expression that precisely identifies the selected data item.  ... 
doi:10.1007/978-0-387-39940-9_1154 fatcat:zamqe55tt5aupa2vvgdba7wy3u

Web Data Extraction System [chapter]

Robert Baumgartner, Wolfgang Gatterbauer, Georg Gottlob
2016 Encyclopedia of Database Systems  
HISTORICAL BACKGROUND The precursors of web data extraction systems were screen scrapers which are systems for extracting screen formated data from mainframe applications for terminals such as VT100 or  ...  Many web data extraction systems exhibit an XPATH-like path expression that precisely identifies the selected data item.  ... 
doi:10.1007/978-1-4899-7993-3_1154-2 fatcat:6ghtb2rgjzgfvmm5kpah7lcm5e

Web data extraction, applications and techniques: A survey

Emilio Ferrara, Pasquale De Meo, Giacomo Fiumara, Robert Baumgartner
2014 Knowledge-Based Systems  
At the Enterprise level, Web Data Extraction techniques emerge as a key tool to perform data analysis in Business and Competitive Intelligence systems as well as for business process re-engineering.  ...  At the Social Web level, Web Data Extraction techniques allow to gather a large amount of structured data continuously generated and disseminated by Web 2.0, Social Media and Online Social Network users  ...  The popularity of the open linked data initiative prompted some authors to develop systems which support the extraction of Web content and their storage in some Semantic Web format.  ... 
doi:10.1016/j.knosys.2014.07.007 fatcat:cb6zazpx7nfgxkmkiuoxqx5zyq

Towards information system development for data extraction from web

Yulia Mukolaivna Gontar, Kateryna Victorivna Tkach, Bohdan Oleksandrovych Yena, Artem Victorovych Vasylenko
2018 Bulletin of National Technical University KhPI Series System Analysis Control and Information Technologies  
A conceptual model of extracting data was developed taking into account web space as an external data source.  ...  In this paper, we analyze the extraction of information from certain type of web sources that is required by the user. The analysis of the data extraction problem was carried out.  ...  Web data extraction system. In Encyclopedia of Database Systems. 2009. P. 3465-3471. 2. Anupam V., Freire J., Kumar B., Lieuwen D. Automating web navigation with the WebVCR. Computer Networks. 2000.  ... 
doi:10.20998/2079-0023.2018.22.08 fatcat:sqmu2j6lh5cyfdmzcrgtzq4dwm

Data extraction from Web data sources

J. Robinson
2004 Proceedings. 15th International Workshop on Database and Expert Systems Applications, 2004.  
This paper provides an explanation of the basic data structures used in a new page analysis technique to create wrappers (data extractors) for the result pages produced by web sites in response to user  ...  qeries via web page forms.  ...  The Data Extraction Algorithm The tpGrid ( Figure 5 ) provides the information needed by an extractor (wrapper) program to extract data from result pages from this web site.  ... 
doi:10.1109/dexa.2004.1333487 dblp:conf/dexaw/Robinson04 fatcat:t7od7klqy5bklmfvmwmqnnfswe

An XML-enabled data extraction toolkit for web sources

Ling Liu, Calton Pu, Wei Han
2001 Information Systems  
Hence, the web users or applications need a smart way of extracting data from these web sources.  ...  The amount of useful semi-structured data on the web continues to grow at a stunning pace. Often interesting web data are not in database systems but in HTML pages, XML pages, or text files.  ...  Fig. 1 . 1 XWRAP system architecture for data wrapping. Fig. 2 . 2 A screenshot of the hierarchical structure extraction window. subsequent errors.  ... 
doi:10.1016/s0306-4379(01)00040-0 fatcat:5dxghttqgzgnjlu2dcy2bkprxa

OLERA: Semisupervised Web-Data Extraction with Visual Support

Chia-Hui Chang, Shih-Chien Kuo
2004 IEEE Intelligent Systems  
We propose a semisupervised IE system-On-Line Extraction Rule Analysis-that lets users, with minimal effort, train extraction rules from Web pages.  ...  OLERA is a semisupervised information-extraction system that produces extraction rules from semistructured Web documents without requiring detailed annotation of the training documents.  ...  We propose a semisupervised IE system-On-Line Extraction Rule Analysis-that lets users, with minimal effort, train extraction rules from Web pages.  ... 
doi:10.1109/mis.2004.71 fatcat:7kpqyp7mjjaoto3oei3qsikc34

User-Friendly and Extensible Web Data Extraction [chapter]

T. Novella, I. Holubová
2018 Lecture Notes in Information Systems and Organisation  
Creation of web wrappers is a subject of study in the field of web data extraction.  ...  data.  ...  Type System Serrano type system inherits from the Javascript type system.  ... 
doi:10.1007/978-3-319-74817-7_14 fatcat:idadb6mmjrg33mr2qy47qcelqm

NET – A System for Extracting Web Data from Flat and Nested Data Records [chapter]

Bing Liu, Yanhong Zhai
2005 Lecture Notes in Computer Science  
This paper studies automatic extraction of structured data from Web pages. Each of such pages may contain several groups of structured data records.  ...  After the process ends, data records are found and data items in them are aligned and extracted. The method can extract data from both flat and nested data records.  ...  Our previous system DEPTA [13] is able to align and extract data items from data records, but does not handle nested data records.  ... 
doi:10.1007/11581062_39 fatcat:n7cv3a5aqfhlrgb3hgz2eit5nu

Web data extraction based on structural similarity

Zhao Li, Wee Keong Ng, Aixin Sun
2005 Knowledge and Information Systems  
Web data-extraction systems in use today mainly focus on the generation of extraction rules, i.e., wrapper induction.  ...  In this paper, we demonstrate a holistic approach to Web data extraction. The principal component of our proposal is the notion of a document schema.  ...  In recent years, Web data-extraction techniques have been applied in many automatic agent systems, such as price comparison and recommendation systems.  ... 
doi:10.1007/s10115-004-0188-z fatcat:vyqafjj67re57bxiciehhxv2cq

Similarity based Dynamic Web Data Extraction and Integration System from Search Engine Result Pages for Web Content Mining [article]

Srikantaiah K C, Suraj M, Venugopal K R, L M Patnaik
2013 arXiv   pre-print
There is an explosive growth of information in the World Wide Web thus posing a challenge to Web users to extract essential knowledge from the Web.  ...  Web Content Mining is one of the techniques that help users to extract useful information from these SERPs.  ...  ., [11] survey the major web data extraction systems and compare them in three dimensions: the task domain, the automation degree and the techniques used.  ... 
arXiv:1303.5867v1 fatcat:xjja67gp35cc7jgyffooe6kz7y

Extracting knowledge from web communities and linked data for case-based reasoning systems

Christian Severin Sauer, Thomas Roth-Berghofer
2013 Expert systems  
The process of extracting such knowledge from the diverse data types used in web communities, to transform data obtained from Linked Data sources, and then formalising it for CBR, is not an easy task.  ...  We provide details on the abilities of the KEWo to extract vocabularies from Linked Data sources and generate taxonomies from Linked Data as well as from web community data in the form of semi structured  ...  In order to extract data from the Web 2.0, respectively from a web community, and to use in a CBR system, the extracted data needs to be formalised properly to meet the formal needs of the chosen knowledge  ... 
doi:10.1111/exsy.12034 fatcat:lwcg5y742vhunnzpjwfdstwldq

State-of-the-art web data extraction systems for online business intelligence

Tomas Grigalis, Antanas Čenys
2013 Informacijos mokslai  
However, the online business intelligence presents non-trivial challenges to Web data extraction systems that must deal with technologically sophisticated modern Web pages where traditional manual programming  ...  In this paper, we review commercially available state-of-the-art Web data extraction systems and their technological advances in the context of online business intelligence.Keywords: online business intelligence  ...  Web data extraction systems are best suited for extracting data from a limited number of Web sites.  ... 
doi:10.15388/im.2013.0.1595 fatcat:4su7kllvgvg3zbfhgxoxpqzapa

Web-based closed-domain data extraction on online advertisements

Maria S. Pera, Rani Qumsiyeh, Yiu-Kai Ng
2013 Information Systems  
extraction approaches.  ...  To handle these problems, we introduce ADEx, a tool that relies on various machine learning approaches to automate the process of extracting (un-/semi-/fully-structured) data from online ads to create  ...  for extracting data from structured web pages.  ... 
doi:10.1016/j.is.2012.07.006 fatcat:5ltypqhcgrhclm2l2qnxnzca24

Advanced data extraction infrastructure: Web based system for management of time series data

S Chilingaryan, A Beglarian, A Kopmann, S Vöcking
2010 Journal of Physics, Conference Series  
ADEI Web Interface The main view of ADEI web frontend is represented on Figure 3 . The screenshot is taken using real system running at KATRIN.  ...  The higher levels of system are relaying on this abstract interface to get data in a uniform way from arbitrary storage. The ADEI web frontend is inspired by GoogleMaps interface.  ... 
doi:10.1088/1742-6596/219/4/042034 fatcat:jywp4e4u5vc6vc7dbgxybgd2vi
« Previous Showing results 1 — 15 out of 589,386 results