A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Filters
Efficient filtering and ranking schemes for finding inclusion dependencies on the web
2013
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management - CIKM '13
This paper presents the results of a first comprehensive study on finding inclusion dependencies on the Web. ...
Finally, we prove that there exist efficient algorithms for the ranking scheme. ...
Acknowledgments The authors are grateful to Prof. Tetsuo Sakaguchi and Prof. Mitsuharu Nagamori for the discussion in seminars. ...
doi:10.1145/2505515.2505722
dblp:conf/cikm/MorishimaYTSK13
fatcat:bwgisnwkerbj7ihelmtaga2gou
Filtering and ranking schemes for finding inclusion dependencies on the web
2012
Proceedings of the 21st international conference companion on World Wide Web - WWW '12 Companion
This paper addresses the problem of finding inclusion dependencies on the Web. ...
This paper focuses on the challenges in the finding and ranking processes. ...
ACKNOWLEDGEMENTS The authors are grateful to Prof. Sakaguchi and Prof. Nagamori for the discussion in seminars. ...
doi:10.1145/2187980.2188170
dblp:conf/www/YumiyaMTSK12
fatcat:z43wfwcclzgajdjt676ke6a6e4
Web question answering
2002
Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02
Simple passage ranking and n-gram extraction techniques work well in our system making it efficient to use with many backend retrieval engines. ...
We focus instead on the redundancy available in large corpora as an important resource. ...
The top-ranked n-grams for the Scrooge query are: Dickens Filter/Reweight N-Grams. ...
doi:10.1145/564426.564428
fatcat:f6exx3uadvhgnf62je33bperce
Web question answering
2002
Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02
Simple passage ranking and n-gram extraction techniques work well in our system making it efficient to use with many backend retrieval engines. ...
We focus instead on the redundancy available in large corpora as an important resource. ...
The top-ranked n-grams for the Scrooge query are: Dickens Filter/Reweight N-Grams. ...
doi:10.1145/564376.564428
dblp:conf/sigir/DumaisBBLN02
fatcat:f32htau4frbd3cqf5foxwy5ydq
Comparison of techniques for measuring research coverage of scientific papers: A case study
2015
2015 Tenth International Conference on Digital Information Management (ICDIM)
Different methods have been proposed for calculating coverage scores using the references and citations network of papers, based on coccurences techniques and graph ranking algorithms. ...
In this paper, we propose two techniques for measuring coverage based on author-specified keywords in research papers. ...
ACKNOWLEDGEMENTS We wish to thank ACM for providing us with an extract of the ACM DL indexed papers. ...
doi:10.1109/icdim.2015.7381881
dblp:conf/icdim/RaamkumarFP15
fatcat:xs2blcul6rfbdcnvez3zr35mgi
Intent feature discovery using Q&A corpus and web data
2010
Proceedings of the 4th International Conference on Uniquitous Information Management and Communication - ICUIMC '10
We collect candidate intent features using Web Q&A corpus analysis, and suggest the automated judgment method using search engine indexes powered by Click Chain Model to demonstrate the adaptability of ...
However, users may have different intents even in the same queries. In this paper, we attempt to discover the characteristics of intent through finding its features. ...
This research was supported in part by the National Institute of Information and Communications Technology, Japan, by Grantsin-Aid for Scientific Research (No. 18049041) from MEXT of Japan, and by the ...
doi:10.1145/2108616.2108665
dblp:conf/icuimc/YoonJT10
fatcat:bupyeen6krew7fgkaedscz5qme
A Parametric Layered Approach to Perform Web Page Ranking
2013
International Journal of Computer Applications
Web crawling is the foremost step to perform the effective and efficient web content search so that the user will get the specific web pages initially in an indexed form. ...
Web crawling is not only used for searching a webpage over the web but also to order them according to user interest. ...
The efficiency and the reliability of a search engine actually depend on the efficiency vector of a web crawler. ...
doi:10.5120/11467-7251
fatcat:p4wl56r6vrcydk3qa4pezsb3da
Threshold selection for web-page classification with highly skewed class distribution
2009
Proceedings of the 18th international conference on World wide web - WWW '09
We propose a novel cost-efficient approach to threshold selection for binary web-page classification problems with imbalanced class distributions. ...
On the other hand, manually labeling examples is expensive and budgetary considerations require that the size of sample sets be limited. ...
Classifiers provide necessary information for crawler, indexer and relevance ranking functions to select, index and rank high quality web search results. ...
doi:10.1145/1526709.1526866
dblp:conf/www/HeDZD09
fatcat:7ccjl6lihngn5f5lzdyjxfxxki
Subject categorization of query terms for exploring Web users' search interests
2002
Journal of the American Society for Information Science and Technology
The experimental results demonstrate that the approach is efficient in dealing with large numbers of queries and adaptable to the dynamic Web environment. ...
Our approach, therefore, combines the search processes of real-world search engines to obtain highly ranked Web documents based on each unknown query term. ...
The developed categorization process ranks all candidate categories in C using the ranking function R(t,c) to find the most appropriate categories for term t, and to also judge whether the accumulated ...
doi:10.1002/asi.10071
fatcat:emcjaesqorc3jg3j23xihmrf2y
An Effective Feature Selection Approach Using the Hybrid Filter Wrapper
2016
International Journal of Hybrid Information Technology
The experimental results show that our approach owned the obvious merits in the aspect of classification accuracy ratio and number features selected by extensive comparing with other methods. ...
Feature selection is an important data preprocessing technique and has been widely studied in data mining, machine learning and granular computing. ...
Acknowledgement The authors are grate to the editor and anonymous reviewers for their valuable comments on this paper, and the work of this paper is supported by the National Nature Science Foundation ...
doi:10.14257/ijhit.2016.9.1.11
fatcat:6ld4ad2fynhf7e4wpxcohi6iye
User profiling for web personalization using multi agent and DBSCAN based approach
2018
International Journal of Engineering & Technology
The user experience is enhanced by the Web Personalization System (WPS), which depends on the User's Interests (UI) and references are stored in the User Profile (UP). ...
The profiles should be able to adapt and reproduce the change of user's behavior for such system. ...
The operations like insertion, updating, modification or deletion is processed in the action query and the scheme used the selection query for finding the UI. ...
doi:10.14419/ijet.v7i2.10224
fatcat:e3x7x2znmfay3nfstf7s735czq
The Continued Saga of DB-IR Integration
[chapter]
2004
Proceedings 2004 VLDB Conference
to be
dumb
The Notion of Relevance
• Data retrieval: semantics tied to syntax
• Information retrieval: ambiguous semantics
• Relevance:
-Depends on the user
-Depends on the context (task, time ...
the search contexts
Ranking • Find how relevant to "usability" are the books • Find the best two books on "usability tests" -Take into account reviewers comments • Return all books with only the sections ...
doi:10.1016/b978-012088469-8/50118-2
fatcat:dktiusnpj5hcfbu2fopto7psqq
The Continued Saga of DB-IR Integration
[chapter]
2004
Proceedings 2004 VLDB Conference
to be
dumb
The Notion of Relevance
• Data retrieval: semantics tied to syntax
• Information retrieval: ambiguous semantics
• Relevance:
-Depends on the user
-Depends on the context (task, time ...
the search contexts
Ranking • Find how relevant to "usability" are the books • Find the best two books on "usability tests" -Take into account reviewers comments • Return all books with only the sections ...
doi:10.1016/b978-012088469-8.50118-2
dblp:conf/vldb/Baeza-YatesC04
fatcat:2lzk6qlgurgbdoj6do2qtxy2za
A Comparison of Citation Metrics to Machine Learning Filters for the Identification of High Quality MEDLINE Documents
2006
JAMIA Journal of the American Medical Informatics Association
Conclusions: These experiments provide evidence that when building information retrieval filters focused on a retrieval task and corresponding gold standard, the filter models have to be built specifically ...
Previous research that claimed better performance of citation metrics than machine learning in one of the corpora examined here is attributed to using machine learning filters built for a different gold ...
of the target variable and has been shown in prior experiments to approximate well the Markov Blanket in text categorization tasks while being more computationally efficient than finding the latter. ...
doi:10.1197/jamia.m2031
pmid:16622165
pmcid:PMC1513679
fatcat:chzf22pti5cz5eag5pdfq2vmzi
Web Services Discovery and Recommendation Based on Information Extraction and Symbolic Reputation
2013
International Journal on Web Service Computing
The impact of the use of these representations on web service discovery and recommendation is studied and discussed in the experimentation using real world web services. ...
This paper shows that the problem of web services representation is crucial and analyzes the various factors that influence on it. ...
The symbolic reputation (SR) is not efficient for web services discovery, except for one category (i.e. 'Weather'), and consequently it will not be used for web services discovery, 3. ...
doi:10.5121/ijwsc.2013.4101
fatcat:c2xi4b5qojbizg4o4x5ynm2buu
« Previous
Showing results 1 — 15 out of 15,426 results