Filters








15,426 Hits in 4.7 sec

Efficient filtering and ranking schemes for finding inclusion dependencies on the web

Atsuyuki Morishima, Erika Yumiya, Masami Takahashi, Shigeo Sugimoto, Hiroyuki Kitagawa
2013 Proceedings of the 22nd ACM international conference on Conference on information & knowledge management - CIKM '13  
This paper presents the results of a first comprehensive study on finding inclusion dependencies on the Web.  ...  Finally, we prove that there exist efficient algorithms for the ranking scheme.  ...  Acknowledgments The authors are grateful to Prof. Tetsuo Sakaguchi and Prof. Mitsuharu Nagamori for the discussion in seminars.  ... 
doi:10.1145/2505515.2505722 dblp:conf/cikm/MorishimaYTSK13 fatcat:bwgisnwkerbj7ihelmtaga2gou

Filtering and ranking schemes for finding inclusion dependencies on the web

Erika Yumiya, Atsuyuki Morishima, Masami Takahashi, Shigeo Sugimoto, Hiroyuki Kitagawa
2012 Proceedings of the 21st international conference companion on World Wide Web - WWW '12 Companion  
This paper addresses the problem of finding inclusion dependencies on the Web.  ...  This paper focuses on the challenges in the finding and ranking processes.  ...  ACKNOWLEDGEMENTS The authors are grateful to Prof. Sakaguchi and Prof. Nagamori for the discussion in seminars.  ... 
doi:10.1145/2187980.2188170 dblp:conf/www/YumiyaMTSK12 fatcat:z43wfwcclzgajdjt676ke6a6e4

Web question answering

Susan Dumais, Michele Banko, Eric Brill, Jimmy Lin, Andrew Ng
2002 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02  
Simple passage ranking and n-gram extraction techniques work well in our system making it efficient to use with many backend retrieval engines.  ...  We focus instead on the redundancy available in large corpora as an important resource.  ...  The top-ranked n-grams for the Scrooge query are: Dickens Filter/Reweight N-Grams.  ... 
doi:10.1145/564426.564428 fatcat:f6exx3uadvhgnf62je33bperce

Web question answering

Susan Dumais, Michele Banko, Eric Brill, Jimmy Lin, Andrew Ng
2002 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02  
Simple passage ranking and n-gram extraction techniques work well in our system making it efficient to use with many backend retrieval engines.  ...  We focus instead on the redundancy available in large corpora as an important resource.  ...  The top-ranked n-grams for the Scrooge query are: Dickens Filter/Reweight N-Grams.  ... 
doi:10.1145/564376.564428 dblp:conf/sigir/DumaisBBLN02 fatcat:f32htau4frbd3cqf5foxwy5ydq

Comparison of techniques for measuring research coverage of scientific papers: A case study

Aravind Sesagiri Raamkumar, Schubert Foo, Natalie Pang
2015 2015 Tenth International Conference on Digital Information Management (ICDIM)  
Different methods have been proposed for calculating coverage scores using the references and citations network of papers, based on coccurences techniques and graph ranking algorithms.  ...  In this paper, we propose two techniques for measuring coverage based on author-specified keywords in research papers.  ...  ACKNOWLEDGEMENTS We wish to thank ACM for providing us with an extract of the ACM DL indexed papers.  ... 
doi:10.1109/icdim.2015.7381881 dblp:conf/icdim/RaamkumarFP15 fatcat:xs2blcul6rfbdcnvez3zr35mgi

Intent feature discovery using Q&A corpus and web data

Soungwoong Yoon, Adam Jatowt, Katsumi Tanaka
2010 Proceedings of the 4th International Conference on Uniquitous Information Management and Communication - ICUIMC '10  
We collect candidate intent features using Web Q&A corpus analysis, and suggest the automated judgment method using search engine indexes powered by Click Chain Model to demonstrate the adaptability of  ...  However, users may have different intents even in the same queries. In this paper, we attempt to discover the characteristics of intent through finding its features.  ...  This research was supported in part by the National Institute of Information and Communications Technology, Japan, by Grantsin-Aid for Scientific Research (No. 18049041) from MEXT of Japan, and by the  ... 
doi:10.1145/2108616.2108665 dblp:conf/icuimc/YoonJT10 fatcat:bupyeen6krew7fgkaedscz5qme

A Parametric Layered Approach to Perform Web Page Ranking

Ratika Goel, Anchal Garg
2013 International Journal of Computer Applications  
Web crawling is the foremost step to perform the effective and efficient web content search so that the user will get the specific web pages initially in an indexed form.  ...  Web crawling is not only used for searching a webpage over the web but also to order them according to user interest.  ...  The efficiency and the reliability of a search engine actually depend on the efficiency vector of a web crawler.  ... 
doi:10.5120/11467-7251 fatcat:p4wl56r6vrcydk3qa4pezsb3da

Threshold selection for web-page classification with highly skewed class distribution

Xiaofeng He, Lei Duan, Yiping Zhou, Byron Dom
2009 Proceedings of the 18th international conference on World wide web - WWW '09  
We propose a novel cost-efficient approach to threshold selection for binary web-page classification problems with imbalanced class distributions.  ...  On the other hand, manually labeling examples is expensive and budgetary considerations require that the size of sample sets be limited.  ...  Classifiers provide necessary information for crawler, indexer and relevance ranking functions to select, index and rank high quality web search results.  ... 
doi:10.1145/1526709.1526866 dblp:conf/www/HeDZD09 fatcat:7ccjl6lihngn5f5lzdyjxfxxki

Subject categorization of query terms for exploring Web users' search interests

Hsiao-Tieh Pu, Shui-Lung Chuang, Chyan Yang
2002 Journal of the American Society for Information Science and Technology  
The experimental results demonstrate that the approach is efficient in dealing with large numbers of queries and adaptable to the dynamic Web environment.  ...  Our approach, therefore, combines the search processes of real-world search engines to obtain highly ranked Web documents based on each unknown query term.  ...  The developed categorization process ranks all candidate categories in C using the ranking function R(t,c) to find the most appropriate categories for term t, and to also judge whether the accumulated  ... 
doi:10.1002/asi.10071 fatcat:emcjaesqorc3jg3j23xihmrf2y

An Effective Feature Selection Approach Using the Hybrid Filter Wrapper

Haitao Wang, Shufen Liu
2016 International Journal of Hybrid Information Technology  
The experimental results show that our approach owned the obvious merits in the aspect of classification accuracy ratio and number features selected by extensive comparing with other methods.  ...  Feature selection is an important data preprocessing technique and has been widely studied in data mining, machine learning and granular computing.  ...  Acknowledgement The authors are grate to the editor and anonymous reviewers for their valuable comments on this paper, and the work of this paper is supported by the National Nature Science Foundation  ... 
doi:10.14257/ijhit.2016.9.1.11 fatcat:6ld4ad2fynhf7e4wpxcohi6iye

User profiling for web personalization using multi agent and DBSCAN based approach

Sipra Sahoo, Bikram Kesari Ratha
2018 International Journal of Engineering & Technology  
The user experience is enhanced by the Web Personalization System (WPS), which depends on the User's Interests (UI) and references are stored in the User Profile (UP).  ...  The profiles should be able to adapt and reproduce the change of user's behavior for such system.  ...  The operations like insertion, updating, modification or deletion is processed in the action query and the scheme used the selection query for finding the UI.  ... 
doi:10.14419/ijet.v7i2.10224 fatcat:e3x7x2znmfay3nfstf7s735czq

The Continued Saga of DB-IR Integration [chapter]

R BAEZAYATES, M CONSENS
2004 Proceedings 2004 VLDB Conference  
to be dumb The Notion of Relevance • Data retrieval: semantics tied to syntax • Information retrieval: ambiguous semantics • Relevance: -Depends on the user -Depends on the context (task, time  ...  the search contexts RankingFind how relevant to "usability" are the books • Find the best two books on "usability tests" -Take into account reviewers comments • Return all books with only the sections  ... 
doi:10.1016/b978-012088469-8/50118-2 fatcat:dktiusnpj5hcfbu2fopto7psqq

The Continued Saga of DB-IR Integration [chapter]

Ricardo Baeza-Yates, Mariano Consens
2004 Proceedings 2004 VLDB Conference  
to be dumb The Notion of Relevance • Data retrieval: semantics tied to syntax • Information retrieval: ambiguous semantics • Relevance: -Depends on the user -Depends on the context (task, time  ...  the search contexts RankingFind how relevant to "usability" are the books • Find the best two books on "usability tests" -Take into account reviewers comments • Return all books with only the sections  ... 
doi:10.1016/b978-012088469-8.50118-2 dblp:conf/vldb/Baeza-YatesC04 fatcat:2lzk6qlgurgbdoj6do2qtxy2za

A Comparison of Citation Metrics to Machine Learning Filters for the Identification of High Quality MEDLINE Documents

Y. Aphinyanaphongs, A. Statnikov, C. F. Aliferis
2006 JAMIA Journal of the American Medical Informatics Association  
Conclusions: These experiments provide evidence that when building information retrieval filters focused on a retrieval task and corresponding gold standard, the filter models have to be built specifically  ...  Previous research that claimed better performance of citation metrics than machine learning in one of the corpora examined here is attributed to using machine learning filters built for a different gold  ...  of the target variable and has been shown in prior experiments to approximate well the Markov Blanket in text categorization tasks while being more computationally efficient than finding the latter.  ... 
doi:10.1197/jamia.m2031 pmid:16622165 pmcid:PMC1513679 fatcat:chzf22pti5cz5eag5pdfq2vmzi

Web Services Discovery and Recommendation Based on Information Extraction and Symbolic Reputation

Mustapha Aznag, Mohamed Quafafou, Nicolas Durand, Zahi Jarir
2013 International Journal on Web Service Computing  
The impact of the use of these representations on web service discovery and recommendation is studied and discussed in the experimentation using real world web services.  ...  This paper shows that the problem of web services representation is crucial and analyzes the various factors that influence on it.  ...  The symbolic reputation (SR) is not efficient for web services discovery, except for one category (i.e. 'Weather'), and consequently it will not be used for web services discovery, 3.  ... 
doi:10.5121/ijwsc.2013.4101 fatcat:c2xi4b5qojbizg4o4x5ynm2buu
« Previous Showing results 1 — 15 out of 15,426 results