Filters








9 Hits in 7.5 sec

Finding text reuse on the web

Michael Bendersky, W. Bruce Croft
2009 Proceedings of the Second ACM International Conference on Web Search and Data Mining - WSDM '09  
With the overwhelming number of reports on similar events originating from different sources on the web, it is often hard, using existing web search paradigms, to find the original source of "facts", statements  ...  Our experimental results show that the proposed techniques can operate on the scale of the web, are significantly more accurate than standard web search for finding text reuse, and provide a richer representation  ...  WSDM' 09 , 09 February 9-12, 2009, Barcelona, Spain. Copyright 2009 ACM 978-1-60558-390-7/09/02 ...$5.00.  ... 
doi:10.1145/1498759.1498835 dblp:conf/wsdm/BenderskyC09 fatcat:6ouplforinaurjn2x6vixqnegm

Top-kaggregation using intersections of ranked inputs

Ravi Kumar, Kunal Punera, Torsten Suel, Sergei Vassilvitskii
2009 Proceedings of the Second ACM International Conference on Web Search and Data Mining - WSDM '09  
An important instance of this problem is query processing in search engines: One has to combine information from several different posting lists (rankings) of web pages (objects) to obtain the top k web  ...  Using an index of millions of web pages and real-world search engine queries, we empirically characterize the performance gains offered by our new algorithms.  ...  We thank Prabhakar Raghavan, Eva Tardos, and David Williamson for useful pointers.  ... 
doi:10.1145/1498759.1498830 dblp:conf/wsdm/KumarPSV09 fatcat:hb7wislyvfgenogap3l56l3p4m

Query by document

Yin Yang, Nilesh Bansal, Wisam Dakka, Panagiotis Ipeirotis, Nick Koudas, Dimitris Papadias
2009 Proceedings of the Second ACM International Conference on Web Search and Data Mining - WSDM '09  
Such abundance of content complements content on web sites and traditional media forums such as news papers, news and financial streams, and so on.  ...  Detailed experiments demonstrate the effectiveness and efficiency of the proposed techniques for the task of automating retrieval of documents related to a query document.  ...  To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. WSDM'09, February 9-12, 2009 , Barcelona, Spain.  ... 
doi:10.1145/1498759.1498806 dblp:conf/wsdm/YangBDIKP09 fatcat:h4jxatnju5d4td4eiq2vqgc57q

Wikipedia pages as entry points for book search

Marijn Koolen, Gabriella Kazai, Nick Craswell
2009 Proceedings of the Second ACM International Conference on Web Search and Data Mining - WSDM '09  
Thus, we investigate possible ways of using Wikipedia as an intermediary between the user's query and a collection of books being searched.  ...  of relevance that can boost the retrieval score of relevant books in the result ranking of a book search engine.  ...  WSDM '09, Barcelona, Spain Copyright 2009 ACM 978-1-60558-390-7 ...$5.00. in books.  ... 
doi:10.1145/1498759.1498807 dblp:conf/wsdm/KoolenKC09 fatcat:vf4zoujos5ewddftqtaec7fcsy

Algorithmic Fairness Datasets: the Story so Far [article]

Alessandro Fabris, Stefano Messina, Gianmaria Silvello, Gian Antonio Susto
2022 arXiv   pre-print
As a result, a growing community of researchers has been investigating the equity of existing algorithms and proposing novel ones, advancing the understanding of risks and opportunities of automated decision-making  ...  Unfortunately, the algorithmic fairness community suffers from a collective data documentation debt caused by a lack of information on specific resources (opacity) and scatteredness of available information  ...  Acknowledgements The authors would like to thank the following researchers and dataset creators for the useful feedback on the data briefs: Alain Barrat, Luc Behaghel, Asia Biega, Marko Bohanec, Chris  ... 
arXiv:2202.01711v2 fatcat:5hf4a42pubc5vnt7tw3al4m5bq

MetaTS: Meta Teacher-Student Network for Multilingual Sequence Labeling with Minimal Supervision

Zheng Li, Danqing Zhang, Tianyu Cao, Ying Wei, Yiwei Song, Bing Yin
2021 Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing   unpublished
In 8th International Confer- on Web Search and Data Mining, Houston, TX, USA, ence on Learning Representations, ICLR 2020, Ad- February 3-7, 2020, pages 151–159. ACM.  ...  In Proceedings of the 36th International Confer- ence on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA, volume 97 of Proceedings of Machine Learning Research, pages 7045  ... 
doi:10.18653/v1/2021.emnlp-main.255 fatcat:mbanujhsbrbfzp7xxhirzp2nvy

Algorithmic Fairness Datasets: the Story so Far [article]

Alessandro Fabris, Stefano Messina, Gianmaria Silvello, Gian Antonio Susto
2022
As a result, a growing community of algorithmic fairness researchers has been investigating the equity of existing algorithms and proposing novel ones, advancing the understanding of the risks and opportunities  ...  Unfortunately, the algorithmic fairness community, as a whole, suffers from a collective data documentation debt caused by a lack of information on specific resources (opacity) and scatteredness of available  ...  Acknowledgements The authors would like to thank the following researchers and dataset creators for the useful feedback on the data briefs: Alain Barrat, Luc  ... 
doi:10.48550/arxiv.2202.01711 fatcat:mav36x3w5namjhurzpevtsmsju

Web-Site-Biographie – Anforderungsanalyse, Systementwurf, prototypische Realisierung [article]

Maximilian Patzak, Universitätsbibliothek Gießen, Axel Schwickert, Corinna Ewelt-Knauer
2021
Eine Vielzahl der Web Sites ist in den Sammlungen der Web-Archive nicht selten unvollständig oder überhaupt nicht vertreten.  ...  Unternehmen investieren selten in die Erhaltung, Aufarbeitung und Präsentation der eigenen Web-Vergangenheit und sind vielmehr bestrebt, die Anpassung der Web-Präsenz an neue Technologien voranzutreiben  ...  .: The web changes everything: understanding the dynamics of web content, in: Proceedings of the Second ACM International Conference on Web Search and Data Mining (WSDM 2009): Barcelona, Spain, February  ... 
doi:10.22029/jlupub-51 fatcat:okqqftuchzczjbfymbqn62xt74

Leveraging Semantic Annotations for Event-focused Search & Summarization [article]

Arunav Mishra, Universität Des Saarlandes, Universität Des Saarlandes
2018
With lack of further information on the event in Wikipedia, and the need to dig deeper, John switches to a Web search engine to find web articles describing the event.  ...  WSDM 2017.  ... 
doi:10.22028/d291-27108 fatcat:kwkdprnrivfdvi3rtazu4gbcfu