A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2016; you can also visit the original URL.
The file type is application/pdf
.
Filters
Searching web data: An entity retrieval and high-performance indexing model
2012
Journal of Web Semantics
Towards this goal, we define an entity retrieval model, develop novel methodologies for supporting this model and show how to achieve a high-performance entity retrieval system. ...
This article examines the shift from the traditional web document model to a web data object (entity) model and studies the challenges faced in implementing a scalable and high performance system for searching ...
An Entity Retrieval Model for Web Data * In this section, we introduce an entity retrieval model for semi-structured information found in distributed and heterogeneous data sources. ...
doi:10.1016/j.websem.2011.04.004
fatcat:ts4tyui34nf7ldub2l7tcavdpy
Searching Web Data: An Entity Retrieval and High-Performance Indexing Model
2012
Social Science Research Network
Towards this goal, we define an entity retrieval model, develop novel methodologies for supporting this model and show how to achieve a high-performance entity retrieval system. ...
This article examines the shift from the traditional web document model to a web data object (entity) model and studies the challenges faced in implementing a scalable and high performance system for searching ...
An Entity Retrieval Model for Web Data * In this section, we introduce an entity retrieval model for semi-structured information found in distributed and heterogeneous data sources. ...
doi:10.2139/ssrn.3198931
fatcat:mimvtyqbkbhaniijpxrl3wqu2i
Compressed data structures for annotated web search
2012
Proceedings of the 21st international conference on World Wide Web - WWW '12
Entity relationship search at Web scale depends on adding dozens of entity annotations to each of billions of crawled pages and indexing the annotations at rates comparable to regular text indexing. ...
These data structures cannot be readily built upon standard inverted indices. Here we present a Web scale entity annotator and annotation index. ...
Thanks to Natassa Ailamaki for vertical database references and Sebastiano Vigna for much help with MG4J. ...
doi:10.1145/2187836.2187854
dblp:conf/www/ChakrabartiKBRS12
fatcat:fvsoblhbtzf2lhfmrhmvzky6c4
Entity Synonyms for Structured Web Search
2012
IEEE Transactions on Knowledge and Data Engineering
Therefore, recognizing the alternative ways people use to reference an entity, is crucial for structured web search. ...
In such scenarios, there is often a mismatch between the values of structured data (how content creators describe entities) and the web queries (how different users try to retrieve them). ...
Index Terms-Entity synonym, fuzzy matching, structured data, web query, query log. ...
doi:10.1109/tkde.2011.168
fatcat:roihldkpzzeyje3mxyjbudlsga
Heterogeneous web data search using relevance-based on the fly data integration
2012
Proceedings of the 21st international conference on World Wide Web - WWW '12
For a structured query adhering to the vocabulary of just one source, the so-called seed query, we construct an entity relevance model (ERM), which captures the content and the structure of the seed query ...
Searching over heterogeneous structured data on the Web is challenging due to vocabulary and structure mismatches among different data sources. ...
Also, we thank Julien Gaugaz and the L3S Research Center for providing us their versions of the IMdb and Amazon datasets. ...
doi:10.1145/2187836.2187856
dblp:conf/www/HerzigT12
fatcat:z55wrsy5zbd3lbb2btyxqbrz4a
Semantic and distributed entity search in the web of data
2012
SIGIR Forum
The main contributions are as follows: • We develop a hybrid approach to search in the Web of Data, using elements from traditional information retrieval and structured retrieval alike. • We formalise ...
The Web of Data (WoD) is an extension of the current web, where not only documents are interlinked by means of hyperlinks but also data in terms of predicates. ...
However, current state-of-the-art web search engines crawl the We also performed an experiment with a large network of N P =1,000 peers to study the scalability of hybrid aggregation. ...
doi:10.1145/2492189.2492203
fatcat:vnqc7pfhpffhnmt7xngort6v5u
Searching and browsing Linked Data with SWSE: The Semantic Web Search Engine
2011
Journal of Web Semantics
Following traditional search engine architecture, SWSE consists of crawling, data enhancing, indexing and a user interface for search, browsing and retrieval of information; unlike traditional search engines ...
In so doing, we also give an insight into how current Semantic Web standards can be tailored, in a besteffort manner, for use on Web data. ...
SFI/08/CE/I1380 (Lion-2), and by an IRCSET postgraduate scholarship. ...
doi:10.1016/j.websem.2011.06.004
fatcat:lteloasxhvgbhp3256ehrv5wf4
Searching Web 2.0 Data Through Entity-Based Aggregation
[chapter]
2016
Lecture Notes in Computer Science
Entity searching over Web 2.0 data facilitates the retrieval of relevant information from the plethora of data available in semantic and social web applications. ...
Entity-based searching has been introduced as a way of allowing users and applications to retrieve information about a specific real world object such as a person, an event, or a location. ...
The entity store provides a repository of entities along with an index for efficient entity retrieval. ...
doi:10.1007/978-3-662-49521-6_7
fatcat:sx54fgvumzddnhlyjhx5x2sehm
Neural Networks in Big Data and Web Search
2018
Data
This survey paper presents a review of neural networks in Big Data and web search that covers web search engines, ranking algorithms, citation analysis and recommender systems. ...
As digitalization is gradually transforming reality into Big Data, Web search engines and recommender systems are fundamental user experience interfaces to make the generated Big Data within the Web as ...
from training data in an information retrieval system. ...
doi:10.3390/data4010007
fatcat:2irxpdvtfrclrbndkrubl5jvqq
Searching and Browsing Linked Data with SWSE: The Semantic Web Search Engine
2011
Social Science Research Network
Following traditional search engine architecture, SWSE consists of crawling, data enhancing, indexing and a user interface for search, browsing and retrieval of information; unlike traditional search engines ...
In so doing, we also give an insight into how current Semantic Web standards can be tailored, in a besteffort manner, for use on Web data. ...
SFI/08/CE/I1380 (Lion-2), and by an IRCSET postgraduate scholarship. ...
doi:10.2139/ssrn.3199532
fatcat:ob2ko5yfbzcqpg3fgbrysqstzi
Event Search and Analytics
2016
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining - WSDM '16
Semantic annotations such as named entities, geographic locations, and temporal expressions can help us mine events from the given corpora. ...
I pose three problems that can help unlock this knowledge vault in semantically annotated text corpora: i. identifying important events; ii. semantic search; and iii. event analytics. ...
The IR system is designed to incorporate the time dimension in an index; thus retrieving documents with text and time similarity. ...
doi:10.1145/2835776.2855083
dblp:conf/wsdm/Gupta16
fatcat:enfrvlnza5fpda3q2j4n2elni4
Data-oriented content query system
2010
Proceedings of the third ACM international conference on Web search and data mining - WSDM '10
, typed-entity search, and question answering. ...
To unify and generalize these efforts, this paper proposes a general search system-Data-oriented Content Query System (DoCQS)to search directly into document contents for finding relevant values of desired ...
Typed-Entity Search (TES) As the Web hosts all sorts of data, several efforts (e.g., [6, 5, 7, 4] ) proposed to target search at specific types of entities, such as person names near "invent" and "television ...
doi:10.1145/1718487.1718503
dblp:conf/wsdm/ZhouCC10
fatcat:eguz6sc3wbdjlasvhu4zxapcmi
Web People Search via Connection Analysis
2008
IEEE Transactions on Knowledge and Data Engineering
Our method exploits a variety of semantic information extracted from web pages, such as named entities and hyperlinks, to disambiguate among namesakes referred to on the web pages. ...
Nowadays, searches for the web pages of a person with a given name constitute a notable fraction of queries to Web search engines. ...
ACKNOWLEDGMENTS This research was supported by US National Science Foundation Awards 0331707 and 0331690. A preliminary version of this paper has appeared as a short paper [29] . ...
doi:10.1109/tkde.2008.78
fatcat:yjabtuklhfhftbxxuqvgqmqiji
Semantically driven snippet selection for supporting focused web searches
2009
Data & Knowledge Engineering
During search, the users visit a web search engine and use an interface to specify a query (typically comprising a few keywords) that best describes their information need. ...
In general, text snippets, extracted from the retrieved pages, are an indicator of the pages' usefulness to the query intention and they help the users browse search results and decide on the pages to ...
We applied our snippet selection technique (SemSS) to a number of searches that we have performed using real web data and we compared its performance to the performance of existing passage retrieval algorithms ...
doi:10.1016/j.datak.2008.10.002
fatcat:7kvqvexu7rb7rhtdv6fsehtiwy
Exploiting web search engines to search structured databases
2009
Proceedings of the 18th international conference on World wide web - WWW '09
We establish and exploit the relationships between web search results and the items in structured databases to identify the relevant structured data items for a much wider range of queries. ...
The relevant structured data items are then returned to the user along with web search results. However, each structured database is searched in isolation. ...
Entity Retrieval: The task of entity retrieval is to lookup the DocIndex for a given document identifier and retrieve the entities extracted from the document (at document indexing time) along with their ...
doi:10.1145/1526709.1526777
dblp:conf/www/AgrawalCCGKX09
fatcat:bdl3fqwzfrcrtepffisfp6xjte
« Previous
Showing results 1 — 15 out of 36,738 results