Filters








213,102 Hits in 4.3 sec

Personalized Web Services for Web Information Extraction [article]

Zahi Jarir, Mohamed Quafafou, Mahammed Erradi
2011 arXiv   pre-print
The field of information extraction from the Web emerged with the growth of the Web and the multiplication of online data sources. This paper is an analysis of information extraction methods.  ...  It presents a service oriented approach for web information extraction considering both web data management and extraction services.  ...  Web Information Extraction Task In order to build a complete information extraction task it is necessary to coordinate the basic tasks.  ... 
arXiv:1108.5460v1 fatcat:4juwvla3arcapbx2gkbhnmw36u

Identifying comparable entities on the web

Alpa Jain, Patrick Pantel
2009 Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09  
With this in mind, we present an initial step of mining comparable entities from sources of information available to a large-scale Web search engine, namely, search query logs and documents from a Web  ...  Web search engines are often presented with user queries that involve comparisons of real-world entities.  ...  Earlier approaches to building information extraction systems relied on hand-crafted extraction rules [4] .  ... 
doi:10.1145/1645953.1646198 dblp:conf/cikm/JainP09 fatcat:zyb5oysurvdhhbintsoninlfou

Learning domain ontologies for Web service descriptions

Marta Sabou, Chris Wroe, Carole Goble, Gilad Mishne
2005 Proceedings of the 14th international conference on World Wide Web - WWW '05  
However, building such domain ontologies is a time consuming and difficult task.  ...  Based on the evaluation of the extracted ontology in the context of the project, we conclude that the proposed extraction method is a helpful tool to support the process of building domain ontologies for  ...  Despite their importance, few domain ontologies for web service descriptions exist and building them is a challenging task.  ... 
doi:10.1145/1060745.1060776 dblp:conf/www/SabouWGM05 fatcat:d65ihoi7yjcaza4sfdl3dyqa6u

Kenyon-web: Reconfigurable web-based feature extractor

Sunghun Kim, Shivkumar Shivaji, E. James Whitehead
2009 2009 IEEE 17th International Conference on Program Comprehension  
Since reusable feature extraction tools are not available, each MSR research group builds their own extraction tool, a duplication of effort.  ...  Kenyon-web is fully reconfigurable, pluggable, and serves most MSR related tasks. In this report, we show the architecture of Kenyonweb and demonstrate its utility by showcasing a sample MSR task.  ...  After extracting all changes from repository, users can add multiple tasks as plug-ins to polish or process useful information from the extracted data.  ... 
doi:10.1109/icpc.2009.5090061 dblp:conf/iwpc/KimSW09 fatcat:srwdovg2vrg3dh55v5cgqaf2va

Extraction of Meaningful Information from the Web: a Brief Survey

Santosh V. Chobe, Dr. Shirish S. Sane
2018 International Journal of Engineering & Technology  
Relevant information in Web documents can be extracted using information extraction and presented in a structured format.By applying information extraction techniques, information can be extracted from  ...  Therefore, to transform the Web pages into databases, Information Extraction (IE) systems are needed.  ...  Conclusion In this paper we have presented major information extraction tools for building wrappers for information extraction from Web documents.  ... 
doi:10.14419/ijet.v7i4.19.28283 fatcat:c53o6mtukndgtgszney7dyfdiy

Supporting Ideation by Integrating Exploratory Search, Browsing, and Curation

Yin Qu
2016 Proceedings of the 2016 ACM on Conference on Human Information Interaction and Retrieval - CHIIR '16  
To enable working with web semantics, we develop a novel type system that brings together data models, dynamic extraction, and presentation of semantic information.  ...  We use web semantics as a basis for summarizing and representing heterogeneous content from diverge sources involved in ideation tasks.  ...  Use extracted summaries to build integrated exploratory search and browsing interfaces.  ... 
doi:10.1145/2854946.2854948 dblp:conf/chiir/Qu16 fatcat:v3j7i5xchzguthlzw2m2aftsum

An Ontology-based Name Entity Recognition NER and NLP Systems in Arabic Storytelling

Marwa Elgamal, Mohamed Abou-Kreisha, Reda Abo Elezz, Salwa Hamada
2020 Al-Azhar Bulletin of Science  
This paper intends to investigate the problem of automatically construct and build an Arabic storytelling ontology based on Arabic named entity recognition (NER) from unstructured story text.  ...  The system framework is a combination of five main stages: The first stage determines the requirement analysis-second document pre-processing using NLP tasks. The third is Conceptualization.  ...  After building our ontology, an OWL file can be generated and uploaded on the web to automate the information extraction and knowledge representation from ontology.  ... 
doi:10.21608/absb.2020.44367.1088 fatcat:al6yof2kozajvot77cta2globe

Ge(o)Lo(cator): Geographic Information Extraction from Unstructured Text Data and Web Documents

Paolo Nesi, Gianni Pantaleo, Marco Tenti
2014 2014 9th International Workshop on Semantic and Social Media Adaptation and Personalization  
Introduction -Geographic Information Extraction: Application Areas  Automatic extraction and retrieval of geographic information from Web Domains and URLs is a field of large and increasing interest.  ...  The necessity arises to associate web domains to geographic information related to the owner/responsible of the web domain.  Extract geographical information potentially related with physical/legal location  ... 
doi:10.1109/smap.2014.27 dblp:conf/smap/NesiPT14 fatcat:bzle5nl5r5hajihge56uhkqtce

WEB SCALE INFORMATION EXTRACTION USING WRAPPER INDUCTION APPROACH

RINA ZAMBAD, JAYANT GADGE
2014 International Journal of Electronics and Electical Engineering  
Information extraction from unstructured, ungrammatical data such as classified listings is difficult because traditional structural and grammatical extraction methods do not apply.  ...  The obtained post data pages are processed by page parsing, cleansing and data extraction to obtain new reference sets.  ...  To extract the search information it build a record database of post data on various web sites.  ... 
doi:10.47893/ijeee.2014.1121 fatcat:jh5qa2w3offqrcnkonkke7mwcm

Learning search tasks in queries and web pages via graph regularization

Ming Ji, Jun Yan, Siyu Gu, Jiawei Han, Xiaofei He, Wei Vivian Zhang, Zheng Chen
2011 Proceedings of the 34th international ACM SIGIR conference on Research and development in Information - SIGIR '11  
-Different feature spaces -Content information + click-through information  Solution -All the information we have => task similarity among queries and web pages => a task-oriented heterogeneous graph  ...  the information on two types of data (queries and web pages)?  ...  -Different feature spaces -Content information + click-through information  Solution -All the information we have => task similarity among queries and web pages => a task-oriented heterogeneous graph  ... 
doi:10.1145/2009916.2009928 dblp:conf/sigir/JiYGHHZC11 fatcat:bblwa26gtvbolpjkvzcrerzii4

Survey of Stages of Developing the Information Extraction Systems from the Web

Asmaa Ahmed Hamed Khalil Elsaeidy
2015 International Journal Of Mechanical Engineering And Information Technology  
Extracting useful information from World Wide Web is an important and challenging problem. Information Extraction (IE) task is an interesting area that is used in getting a useful information.  ...  The main contribution is how to help the user to extract relevant information from different and changeable web pages and integrate this extracted information into a single structured file automatically  ...  (5) Has proposed to compose existing web services with information extraction predefined operators in order to build new information extraction web services.  ... 
doi:10.18535/ijmeit/v3i11.01 fatcat:nytac7qcvzelpjblwzltgmkt64

Building Mashups by Demonstration

Rattapoom Tuchinda, Craig A. Knoblock, Pedro Szekely
2011 ACM Transactions on the Web  
Our approach addresses the problems of extracting data from Web sources, cleaning and modeling the extracted data, and integrating the data across sources.  ...  The latest generation of WWW tools and services enables Web users to generate applications that combine content from multiple sources. This type of Web application is referred to as a mashup.  ...  While there exist attempts to facilitate the process of building information integration applications, none is sufficiently easy to use to enable a Web user to build an end-to-end information integration  ... 
doi:10.1145/1993053.1993058 fatcat:dd6i6jfm2bbprhcgstv5n6my7q

Introduction to Information Extraction: Basic Notions and Current Trends

Wolf-Tilo Balke
2012 Datenbank-Spektrum  
This introduction gives a broad overview about the major topics and current trends in information extraction.  ...  This is especially true for the current efforts to turn the World Wide Web being the world's largest collection of information into the world's largest knowledge base.  ...  The task of building a knowledge base generally requires selecting of data sources, extracting the respective entities, classes, and relationships, and then finally integrating and linking the new information  ... 
doi:10.1007/s13222-012-0090-x fatcat:cqfncv2xzvczhnmsnlaprmx3lm

Learning domain ontologies for semantic Web service descriptions

Marta Sabou, Chris Wroe, Carole Goble, Heiner Stuckenschmidt
2005 Journal of Web Semantics  
task.  ...  In this paper we report on the first stage of research that aims to develop (semi-)automatic ontology learning tools in the context of Web services that can support domain experts in the ontology building  ...  This work was carried out in the context of WonderWeb, an EU Information Society Technologies (IST) funded project (EU IST 2001-33052).  ... 
doi:10.1016/j.websem.2005.09.008 fatcat:vieeqt3wirg3feoe23vu5ndfda

Learning Domain Ontologies for Semantic Web Service Descriptions

Marta Sabou, Chris Wroe, Carole Goble, Heiner Stuckenschmidt
2005 Social Science Research Network  
task.  ...  In this paper we report on the first stage of research that aims to develop (semi-)automatic ontology learning tools in the context of Web services that can support domain experts in the ontology building  ...  This work was carried out in the context of WonderWeb, an EU Information Society Technologies (IST) funded project (EU IST 2001-33052).  ... 
doi:10.2139/ssrn.3199264 fatcat:6eamwvic3rce3dvfxh24s3sja4
« Previous Showing results 1 — 15 out of 213,102 results