Filters








3,645 Hits in 6.3 sec

A study of thresholding strategies for text categorization

Yiming Yang
2001 Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '01  
This paper presents an examination of the e ect of thresholding strategies on the performance of a classi er under various conditions.  ...  Thresholding strategies in automated text categorization are an underexplored area of research.  ...  project for providing the Hoovers data sets.  ... 
doi:10.1145/383952.383975 dblp:conf/sigir/Yang01 fatcat:x7igfebgqja2jpeceaicvmo7ae

Constructing informative prior distributions from domain knowledge in text classification

Aynur Dayanik, David D. Lewis, David Madigan, Vladimir Menkov, Alexander Genkin
2006 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '06  
We propose instead combining domain knowledge with training examples in a Bayesian framework.  ...  Domain knowledge is used to specify a prior distribution for parameters of a logistic regression model, and labeled training data is used to produce and find the mode of the posterior distribution.  ...  The views expressed in this article are those of the authors, and do not necessarily represent the views of the sponsoring agency.  ... 
doi:10.1145/1148170.1148255 dblp:conf/sigir/DayanikLMMG06 fatcat:6zipts5ddjct3kwgaiddtrfnme

Pantheon: the training ground for Internet congestion-control research

Francis Y. Yan, Jestin Ma, Greg D. Hill, Deepti Raghavan, Riad S. Wahby, Philip Levis, Keith Winstein
2018 USENIX Annual Technical Conference  
It allows network researchers to benefit from and contribute to a common set of benchmark algorithms, a shared evaluation platform, and a public archive of results.  ...  We present the Pantheon, a system that addresses this by serving as a community "training ground" for research on Internet transport protocols and congestion control (https: //pantheon.stanford.edu).  ...  Acknowledgments We thank the USENIX ATC reviewers for their helpful comments and suggestions.  ... 
dblp:conf/usenix/YanMHRWLW18 fatcat:ckxm4nnftvdvhkxasd3cc2rday

An application of text categorization methods to gene ontology annotation

Kazuhiro Seki, Javed Mostafa
2005 Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '05  
This paper describes an application of IR and text categorization methods to a highly practical problem in biomedicine, specifically, Gene Ontology (GO) annotation.  ...  As a first step toward automatic GO annotation, we aim to assign GO domain codes given a specific gene and an article in which the gene appears, which is one of the task challenges at the TREC 2004 Genomics  ...  number of k neighbors were optimized to maximize F 1 for each term weighting scheme using the training data.  ... 
doi:10.1145/1076034.1076060 dblp:conf/sigir/SekiM05 fatcat:kbomr47p6vfkvkix4majlrzxf4

Towards Automatic Recognition of Scientifically Rigorous Clinical Research Evidence

H. Kilicoglu, D. Demner-Fushman, T. C. Rindflesch, N. L. Wilczynski, R. B. Haynes
2009 JAMIA Journal of the American Medical Informatics Association  
Using a training set of 10,000 manually annotated MEDLINE citations, and a test set of an additional 2,000 citations, we achieve 73.7% precision and 61.5% recall in identifying rigorous, clinically relevant  ...  The gold standard used in the development of PubMed clinical query filters forms the basis of our approach.  ...  Their feature set exploits MeSH indexing terms and publication types assigned by NLM indexers as well as words in the title and abstract of the citation.  ... 
doi:10.1197/jamia.m2996 pmid:18952929 pmcid:PMC2605595 fatcat:mvgjn426qngi3lqvbgbla4aloq

Hierarchical classification of Web content

Susan Dumais, Hao Chen
2000 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '00  
This paper explores the use of hierarchical structure for classifying a large, heterogeneous collection of web content.  ...  The hierarchical structure is initially used to train different second-level classifiers.  ...  ACKNOWLEDGMENTS We are grateful to John Platt for help with the Support Vector Machine code, and to four anonymous reviewers for their comments.  ... 
doi:10.1145/345508.345593 dblp:conf/sigir/DumaisC00 fatcat:aqdoka6ca5bl7kklw5bfstekdy

Towards Effective Research-Paper Recommender Systems and User Modeling based on Mind Maps [article]

Joeran Beel
2017 arXiv   pre-print
The recommender system builds user models based on the mind maps, and recommends research papers based on the user models.  ...  Such systems could create additional value for millions of mind-mapping users.  ...  In MeSH, for instance, terms from a controlled vocabulary are assigned to research papers. Papers with the same MeSH terms are considered similar.  ... 
arXiv:1703.09109v1 fatcat:egcsnop34jbi7p2urz34pxr3vi

Computational Approaches for Translational Clinical Research in Disease Progression

Mary F. McGuire, Madurai Sriram Iyengar, David W. Mercer
2011 Journal of Investigative Medicine  
In this paper we review a selection of published research regarding computational methodologies, primarily from systems biology, that support translational research from the molecular level to the bedside  ...  Trauma is the leading cause of mortality in Americans under 45 years of age, and its rapid progression offers both opportunities and challenges for computational analysis of trends in molecular patterns  ...  Conjugate gradient decent method for optimization during ANN training with training data. Leave-one-out cross-validation of predictive performance with test data.  ... 
doi:10.2310/jim.0b013e318224d8cc pmid:21712727 pmcid:PMC3196807 fatcat:bxnxrhooefb4lgdpf6n6ezpppi

Bridging the Language Gap: Topic Adaptation for Documents with Different Technicality

Shuang-Hong Yang, Steven P. Crain, Hongyuan Zha
2011 Journal of machine learning research  
We present a probabilistic model for this purpose based on joint modeling of topic and technicality.  ...  This paper seeks to close the gap at the thematic level via topic adaptation, i.e., adjusting the topical structures for cross-domain documents according to a domain factor such as technicality.  ...  Acknowledgement The authors would like to thank Yu Jiao (@ORNL) and the anonymous reviewers for helpful comments. Part of this work is supported by NSF #IIS-1049694 and a grant from Hewlett-Packard.  ... 
dblp:journals/jmlr/YangCZ11 fatcat:hduiwfcgm5badoqznvgtm7ov4q

Neural network modelling and prediction of the flotation deinking behaviour of industrial paper recycling processes

Pauck
2014 Nordic Pulp & Paper Research Journal  
for deinking of differing quality materials.  ...  As a developing country, South Africa is still showing growth in the publication paper and hygiene paper markets, for which recycled fibre is an important source of raw material.  ...  The variable or factor responsible for the greatest net effect in each case was assigned the rank of 1, intermediate factors were assigned ranks of 2 to 10 and the factor with the least effect was assigned  ... 
doi:10.3183/npprj-2014-29-03-p521-532 fatcat:utl4of6zsbeyzfi4dglxuukkmy

Unsupervised Machine Learning for Networking: Techniques, Applications and Research Challenges

Muhammad Usama, Junaid Qadir, Aunn Raza, Hunain Arif, Kok-lim Alvin Yau, Yehia Elkhatib, Amir Hussain, Ala Al-Fuqaha
2019 IEEE Access  
anomaly detection, Internet traffic classification, and quality of service optimization.  ...  The focus of this survey paper is to provide an overview of applications of unsupervised learning in the domain of networking.  ...  Learning is the process of assigning optimal activation parameters enabling ANN to perform input to output mapping.  ... 
doi:10.1109/access.2019.2916648 fatcat:xutxh3neynh4bgcsmugxsclkna

Unsupervised Machine Learning for Networking: Techniques, Applications and Research Challenges [article]

Muhammad Usama, Junaid Qadir, Aunn Raza, Hunain Arif, Kok-Lim Alvin Yau, Yehia Elkhatib, Amir Hussain, Ala Al-Fuqaha
2017 arXiv   pre-print
detection, Internet traffic classification, and quality of service optimization.  ...  The focus of this survey paper is to provide an overview of the applications of unsupervised learning in the domain of networking.  ...  Learning is the process of assigning optimal activation parameters enabling ANN to perform input to output mapping.  ... 
arXiv:1709.06599v1 fatcat:llcg6gxgpjahha6bkhsitglrsm

MATRIX FACTORIZATION-BASED DATA FUSION FOR GENE FUNCTION PREDICTION IN BAKER'S YEAST AND SLIME MOLD

MARINKA ŽITNIK, BLAŽ ZUPAN
2013 Biocomputing 2014  
We have previously developed a general matrix factorization-based data fusion approach for gene function prediction.  ...  The development of effective methods for the characterization of gene functions that are able to combine diverse data sources in a sound and easily-extendible way is an important goal in computational  ...  Acknowledgements We thank Gad Shaulsky from Baylor College of Medicine, Houston, TX, for selecting functional terms from Table 2 .  ... 
doi:10.1142/9789814583220_0038 fatcat:jguk2vhrpbbt3ku7djefcvl53q

CoLe and UTAI Participation at the 2014 BioASQ Semantic Indexing Challenge

Francisco J. Ribadas-Pena, Luis M. de Campos Ibañez, Víctor Manuel Darriba Bilbao, Alfonso E. Romero
2014 Conference and Labs of the Evaluation Forum  
one using a Bayesian network built from the MeSH thesaurus structure.  ...  We also have tested different methods for combining the results of ensembles of our classifiers.  ...  Acknowledgements Research reported in this paper has been partially funded by "Ministerio de Economía y Competitividad" and FEDER (project TIN2010-18552-C03-01), by "Xunta de Galicia" (project CN 2012/  ... 
dblp:conf/clef/Ribadas-PenaIBR14 fatcat:ukfvowrx3rhhzmfckoebmskm2e

Abstracts of Working Papers in Economics

1998 Abstracts of Working Papers in Economics  
AB This paper uses Bayesian stochastic frontier methods to measure the productivity gap between Poland and Western countries that existed before the beginning of the main Polish economic reform.  ...  AB This paper assumes that the underlying asset prices are lognormally distributed, and derives necessary and sufficient conditions for the valuation of options using a Black-Scholes type methodology.  ...  (Pcleg and Tijs 1996) to apply to solutions which assign to each game a collection of product sets of strategics.  ... 
doi:10.1017/s0951007900003995 fatcat:siqha7fymfbdfnz2vr74tl4oai
« Previous Showing results 1 — 15 out of 3,645 results