Filters








56,816 Hits in 10.4 sec

Crowdsourcing for search and data mining

Vitor R. Carvalho, Matthew Lease, Emine Yilmaz
2011 Proceedings of the fourth ACM international conference on Web search and data mining - WSDM '11  
This report on the CSDM 2011 workshop describes advances in the state-of-the-art in using crowdsourcing for search and data mining.  ...  Reported results demonstrated the promise of gathering interaction data via crowdsourcing and suggest further research in this vein.  ... 
doi:10.1145/1935826.1935828 dblp:conf/wsdm/CarvalhoLY11 fatcat:ocgdimi7uvgbnie6cmkcogey4m

Search, Filter, Fork, and Link Open Data

Sebastian Neumaier, Lörinc Thurnay, Thomas J. Lampoltshammer, Tomá Knap
2018 Companion of the The Web Conference 2018 on The Web Conference 2018 - WWW '18  
In the paper, we first describe the requirements of the platform, which are based on focus group interviews and a web-based survey.  ...  The information acquired by the linking and (meta)data improvement steps is then integrated in a semantic search engine.  ...  Track: PROFILES & Data Search: International Workshop on Profiling and Searching Data on the Web WWW 2018, April 23-27, 2018, Lyon, France We already filtered out requirements that are beyond  ... 
doi:10.1145/3184558.3191602 dblp:conf/www/NeumaierTLK18 fatcat:t376wvga4vfhholeylupagbeoa

Usefulness of quality click-through data for training

Craig Macdonald, Iadh Ounis
2009 Proceedings of the 2009 workshop on Web Search Click Data - WSCD '09  
In this work, we examine the usefulness of high-quality clickthrough data for training an IR system, on searching the .gov vertical domain of the Web.  ...  To obtain these parameter settings, quality training is usually required, where assessors have manually labelled the relevance of retrieved items for many queries.  ...  Learning user interaction models for predicting Web search result preferences. In Proceedings of SIGIR 2006, pages 3-10.  ... 
doi:10.1145/1507509.1507521 dblp:conf/wsdm/MacdonaldO09 fatcat:toz2aosyebbsfcomnl7xcwm4cm

Efficient multiple-click models in web search

Fan Guo, Chao Liu, Yi Min Wang
2009 Proceedings of the Second ACM International Conference on Web Search and Data Mining - WSDM '09  
Many tasks that leverage web search users' implicit feedback rely on a proper and unbiased interpretation of user clicks.  ...  We systematically evaluate the two models on click logs obtained in July 2008 from a major commercial search engine.  ...  Web search engine evaluation using click-through data and a user model. In Proceeding of the Workshop on Query Log Analysis: Social and Technological Challenges (WWW '07), 2007. [8] G. E.  ... 
doi:10.1145/1498759.1498818 dblp:conf/wsdm/GuoLW09 fatcat:sdafyi2jmvet3mxnln2sivhphi

Impact of search results on user queries

Sofia Stamou, Lefteris Kozanidis
2009 Proceeding of the eleventh international workshop on Web information and data management - WIDM '09  
The application of our model on a search trace of 19,250 queries issued to Google by 18 users over a period of two months reveals that in overall search results influence the specification of 12.79% of  ...  Based on the analysis of the user querying trends and web visits on the query results, we propose a model that tries to capture the results' influence on the specification of the subsequent user queries  ...  Although there exist several works on how people search for information on the web [6] [3] [10] [8] , most of the reported works concentrate on identifying the search goals associated with web queries  ... 
doi:10.1145/1651587.1651591 dblp:conf/widm/StamouK09 fatcat:isp6w7jxcfhfpfhl6uibmejtrq

On improving local website search using web server traffic logs

Qing Cui, Alex Dekhtyar
2005 Proceedings of the seventh ACM international workshop on Web information and data management - WIDM '05  
In this paper we give a preliminary report on our study of the use of web server traffic logs to improve local search.  ...  on the amount of links.  ...  Front End Both the site crawler and the log miner components of the search engine work off-line, preparing the data for on-line use.  ... 
doi:10.1145/1097047.1097060 dblp:conf/widm/CuiD05 fatcat:7o3gbs472rgkhmer6mg5vbqf74

Intentional query suggestion

Markus Strohmaier, Mark Kröll, Christian Körner
2009 Proceedings of the 2009 workshop on Web Search Click Data - WSCD '09  
The degree to which users' make their search intent explicit can be assumed to represent an upper bound on the level of service that search engines can provide.  ...  Our preliminary results indicate that intentional query suggestions 1) diversify search result sets (i.e. it reduces result set overlap) and 2) have the potential to yield higher click-through rates than  ...  The search query log data is split into two files, one file containing attributes Time, Query, QueryID and ResultCount, the other one attributes QueryID, Query, Time, URL and Position providing click-through  ... 
doi:10.1145/1507509.1507520 dblp:conf/wsdm/StrohmaierKK09 fatcat:suaj4nugfngznce56uhrhojdsy

Patterns for searching data on the web across different research communities

Timo Borst, Fidan Limani
2020 Liber Quarterly: The Journal of European Research Libraries  
With researchers and academic institutions increasingly publishing their data on the public web, traditional research workflows with respect to data search are subject to empirical analysis, user studies  ...  Being a concept quite familiar in the domain of information retrieval, data search in a web based environment has recently gained attention.  ...  Acknowledgments The authors gratefully acknowledge financial support from the GeRDI project, funded by German Research Foundation (DFG), grants no. BO818/16-1 and HA2038/6-1.  ... 
doi:10.18352/lq.10317 fatcat:4afnftc6wbes3f72gqn54vnbrq

Linked Data Metrics for Flexible Expert Search on the Open Web [chapter]

Milan Stankovic, Jelena Jovanovic, Philippe Laublet
2011 Lecture Notes in Computer Science  
We propose an approach for adapting the expert search process (choosing the right type of trace and the right expertise hypothesis) to the given topic of expertise, by relying on Linked Data metrics.  ...  The existing expert search approaches are mostly limited to one corpus and one particular type of tracesometimes even to a particular domain.  ...  The Model of User Traces on the Linked Data Web Definition 1.  ... 
doi:10.1007/978-3-642-21034-1_8 fatcat:clspq62yunbkpb2cobycsrau3q

Analysis of long queries in a large scale search log

Michael Bendersky, W. Bruce Croft
2009 Proceedings of the 2009 workshop on Web Search Click Data - WSCD '09  
They are also, however, quite common in web search, as can be seen by looking at the distribution of query lengths in a large scale search log.  ...  In addition, we propose a simple yet effective method for evaluating the performance of the queries in the search log using a combination of the click data in the search log with the existing TREC corpora  ...  corpora [4, 18] and in the web search setting [9] .  ... 
doi:10.1145/1507509.1507511 dblp:conf/wsdm/BenderskyC09a fatcat:kx7edj4ryvgxxdmldto3hgptlu

Automatic web spreadsheet data extraction

Zhe Chen, Michael Cafarella
2013 Proceedings of the 3rd International Workshop on Semantic Search Over the Web - SS@ '13  
When compared to standard techniques for spreadsheet data extraction on a set of 100 random Web spreadsheets, the system reduces the amount of human labor by 72% to 92%.  ...  A large number of data integration tools exist, but they generally can only work on relational data.  ...  Senbazuru focuses on data frame spreadsheets, which are one of the most popular types in the Web.  ... 
doi:10.1145/2509908.2509909 dblp:conf/vldb/ChenC13 fatcat:k5emfwd3gzdb7bqemefb6c7vp4

LambdaMerge

Daniel Sheldon, Milad Shokouhi, Martin Szummer, Nick Craswell
2011 Proceedings of the fourth ACM international conference on Web search and data mining - WSDM '11  
Score: the original search engine ranker score. On the • ORG: The results for the original query with no merg- Bing data, it is the Bing ranker score.  ...  In experiments on Bing data, the simple CombSUM merg- manages to outperform the RAPP(Ω) oracle on P@5.  ... 
doi:10.1145/1935826.1935930 dblp:conf/wsdm/SheldonSSC11 fatcat:y6blr4jzfnge3ipvzu35wbgg6u

Let web spammers expose themselves

Zhicong Cheng, Bin Gao, Congkai Sun, Yanbing Jiang, Tie-Yan Liu
2011 Proceedings of the fourth ACM international conference on Web search and data mining - WSDM '11  
We find that web spammers usually ally with each other, and SEO forum is one of the major means for them to form the alliance.  ...  ., link farm and link exchange) from search engine optimization (SEO) forums. To provide quality services, it is critical for search engines to address web spam.  ...  Web spam refers to the actions that mislead search engines into ranking some pages higher than they should be ranked. It is clear that web spam is a nuisance to both web users and web search engines.  ... 
doi:10.1145/1935826.1935902 dblp:conf/wsdm/ChengGSJL11 fatcat:jutuanqcd5ff7ecg5kn5jehcny

Supporting the automatic construction of entity aware search engines

Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti
2008 Proceeding of the 10th ACM workshop on Web information and data management - WIDM '08  
We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity.  ...  Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the  ...  A new data integration architecture for web data is the subject of the PAYGO project [17] ; the project focuses on the heterogeneity of structured data on the web: it concentrates on explicit structured  ... 
doi:10.1145/1458502.1458526 dblp:conf/widm/BlancoCMP08 fatcat:kxvczk73wfbitj6kux74wmwrva

On improving wikipedia search using article quality

Meiqun Hu, Ee-Peng Lim, Aixin Sun, Hady Wirawan Lauw, Ba-Quy Vuong
2007 Proceedings of the 9th annual ACM international workshop on Web information and data management - WIDM '07  
We develop two quality measurement models, namely Basic and PeerReview, to derive article quality based on co-authoring data gathered from articles' edit history.  ...  While Wikipedia offers full-text search to its users, the accuracy of its relevance-based search can be compromised by poor quality articles edited by non-experts and inexperienced contributors.  ...  Furthermore, many proposed metrics are subjective and require data supplied from sources outside Wikipedia [11, 18] .  ... 
doi:10.1145/1316902.1316926 dblp:conf/widm/HuLSLV07 fatcat:3sqozero3jef3aj4v5l5kkcdq4
« Previous Showing results 1 — 15 out of 56,816 results