Filters








1,578 Hits in 4.8 sec

A visual tool for bayesian data analysis

Giorgio Maria Di Nunzio, Alessandro Sordoni
2012 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '12  
Our research question is: can a visualization tool help researchers to quickly assess the goodness of the performance of NB classifiers by setting optimal smoothing parameters?  ...  Among these, the Bayesian approach incorporates smoothing in terms of prior knowledge about the parameters of the model usually called hyper-parameters.  ...  Jian-Yun Nie for the useful discussions. This work has been partially supported by the QONTEXT project under grant agreement N. 247590 (FP7/2007-2013).  ... 
doi:10.1145/2348283.2348427 dblp:conf/sigir/NunzioS12 fatcat:nyr4tspjdzcutnae3g3hhwzc74

Finding Uninformative Features in Binary Data [chapter]

Xin Wang, Ata Kabán
2005 Lecture Notes in Computer Science  
For statistical modelling of multivariate binary data, such as text documents, datum instances are typically represented as vectors over a global vocabulary of attributes.  ...  In this paper we propose and study a relatively simple cluster-based generative model for multivariate binary data, equipped with automated feature weighting capability.  ...  Comparisons with a multinomial version of the model for ad-hoc retrieval and classifications are also underway.  ... 
doi:10.1007/11508069_6 fatcat:644zsjnbnraqlfstjqgtwxih6q

Model-based ordination with constrained latent variables [article]

Bert van der Veen, Francis K.C. Hui, Knut A Hovstad, Robert B O'Hara
2021 bioRxiv   pre-print
We propose a new method for model-based ordination with constrained latent variables in the Generalized Linear Latent Variable Model framework, which incorporates both measured predictors and residual  ...  However, existing constrained ordination methods do not explicitly account for information provided by species responses, so that they have the potential to misrepresent community structure if not all  ...  As a model-based approach, we can leverage conventional methods such as hypothesis testing, information criteria (Burnham & Anderson 2002) , residual diagnostics (Hartig 2021) among others for assessing  ... 
doi:10.1101/2021.10.11.463884 fatcat:qfxke3eqgzg7nlwnxodyxcmbne

Can fallopian tube anatomy predict pregnancy and pregnancy outcomes after tubal reversal surgery? [article]

Rafael S. de Souza, Gary S. Berger
2019 bioRxiv   pre-print
We built a flexible, non-parametric, multivariate model via generalised additive models to assess the effects of these tubal parameters observed during tubal reparative surgery: tubal lengths; differences  ...  For pregnancy outcomes, age is the primary predictor of miscarriage, but tubal length is the most influential predictor of the odds of birth and ectopic pregnancy.  ...  Acknowledgements We thank Christian Iliadis for his encouraging support of this work. RSS acknowledges the support from NASA under the Astrophysics Theory Program Grant 14-ATP14-0007.  ... 
doi:10.1101/616508 fatcat:h3ppewxxlnbgtktuuzhuaafugm

A novel scan statistics approach for clustering identification and comparison in binary genomic data

Danilo Pellin, Clelia Di Serio
2016 BMC Bioinformatics  
Results: We adapt the so called Bernoulli-model scan statistic to the genomic field and we propose a multivariate extension, named Relative Scan Statistics, for the comparison of two series of Bernoulli  ...  Algorithms that require to specify a-priori number/dimension/length of clusters assumed for the data suffer from a high degree of arbitrariness whenever no precise information are available, and this may  ...  Acknowledgements The authors thank all members of the CUSSB for helpful suggestions. Declaration  ... 
doi:10.1186/s12859-016-1173-8 pmid:28185547 pmcid:PMC5046198 fatcat:q332d5mwq5gbtf2lv27fmezxyu

Can fallopian tube anatomy predict pregnancy and pregnancy outcomes after tubal reversal surgery? [article]

Rafael S. de Souza, Gary S. Berger
2019 arXiv   pre-print
We built a flexible, non-parametric, multivariate model via generalised additive models to assess the effects of these tubal parameters observed during tubal reparative surgery: tubal lengths; differences  ...  For pregnancy outcomes, age is the primary predictor of miscarriage, but tubal length is the most influential predictor of the odds of birth and ectopic pregnancy.  ...  Acknowledgements We thank Christian Iliadis for his encouraging support of this work. RSS acknowledges the support from NASA under the Astrophysics Theory Program Grant 14-ATP14-0007.  ... 
arXiv:1904.10398v1 fatcat:2kuyo6paifg4xou6nekg4zjop4

A randomised approach for NARX model identification based on a multivariate Bernoulli distribution

F. Bianchi, A. Falsone, M. Prandini, L. Piroddi
2016 International Journal of Systems Science  
In this research endeavor a more complex multivariate Bernoulli distribution is employed, in which the sampling of a given term is conditioned by the sampling of the others.  ...  Then, a randomized method is used to sample from the model distribution and gather reliable information to update the distribution, until convergence to a specific model is achieved.  ...  This kind of similarity measure is widely used in the Information Retrieval field to assess the relevance of a document to a given query (Berry, Drmac, & Jessup, 2006) .  ... 
doi:10.1080/00207721.2016.1244309 fatcat:lxwsfnbwwvhkjfmzegmnv3vony

Are There New BM25 Expectations?

Emanuele Di Buccio, Giorgio Maria Di Nunzio
2013 Italian Information Retrieval Workshop  
be one of the parameters that can be modelled in a better way.  ...  In particular, we have focused on a full bayesian approach for deriving a smoothed formula that takes into account a-priori knowledge on the probability of terms.  ...  Introduction The relevance weighting model, also known as RSJ by the name of its creators (Roberston and Sparck-Jones), has been one of the most influential model in the history of Information Retrieval  ... 
dblp:conf/iir/BuccioN13 fatcat:ysuqcoozezb2das44sl2nzlaoi

Deep Semantic Hashing Using Pairwise Labels

Richeng Xuan, Junho Shim, Sang-goo Lee
2021 IEEE Access  
In addition, ( ) and ( ) in this method are multivariate standard distributions.  ...  Therefore, it is incredibly difficult for users to search for desired information from the massive collection of textual data.  ... 
doi:10.1109/access.2021.3092150 fatcat:5at4wt7cobfhbi4ngwlv3s6ynm

Analogical Reasoning with Relational Bayesian Sets

Ricardo Bezerra de Andrade e Silva, Katherine A. Heller, Zoubin Ghahramani
2007 Journal of machine learning research  
We develop a generative model for predicting the existence of relationships and extend the framework of Ghahramani and Heller (2005) to provide a Bayesian measure for how analogous a relation is to other  ...  For instance, the analogy between an electron around the nucleus of an atom and a planet around the Sun is hardly justified by isolated, non-relational, comparisons of an electron to a planet, and a nucleus  ...  Evaluation of the significance of retrieved items often relies on subjective assessments (Ghahramani and Heller, 2005) .  ... 
dblp:journals/jmlr/SilvaHG07 fatcat:ntbkw24k35cwhb2kxsjzje44aa

A General Framework for Association Analysis of Heterogeneous Data [article]

Gen Li, Irina Gaynanova
2017 arXiv   pre-print
Multivariate association analysis is of primary interest in many applications.  ...  The application to the CAL500 data sheds light on the relationship between acoustic features and semantic annotations, and provides effective means for automatic music annotation and retrieval.  ...  Acknowledgement The authors would like to thank the Computer Audition Laboratory at the University of California, San Diego, for generating the CAL500 data.  ... 
arXiv:1707.06485v1 fatcat:efftef3aijap5irnjfgfiyem7m

Model‐based ordination for species with unequal niche widths

Bert van der Veen, Francis K.C. Hui, Knut A. Hovstad, Erik B. Solbu, Robert B. O'Hara
2021 Methods in Ecology and Evolution  
The relationship of the species packing model to  ...  The combination of assuming equally spaced optima, equal maxima and equal tolerances gives an early niche model called the species packing model (MacArthur & Levins, 1967) .  ...  In Appendix S2 we provide derivations for the log-likelihood of common response types in community ecology, such as count data | S IMUL ATION S TUDY To assess how well the proposed model retrieves the  ... 
doi:10.1111/2041-210x.13595 fatcat:geubqi42g5aonmqavp7hgferw4

Hypergeometric language models for republished article finding

Manos Tsagkias, Maarten de Rijke, Wouter Weerkamp
2011 Proceedings of the 34th international ACM SIGIR conference on Research and development in Information - SIGIR '11  
We present two retrieval models that build on top of these distributions: a log odds model and a bayesian model where document parameters are estimated using the Dirichlet compound multinomial distribution  ...  We address this task as an ad hoc retrieval problem, using the source article as a query. Our approach is based on language modeling.  ...  Log odds retrieval model with multivariate non- central hypergeometric distribution (16) Table 2 : 2 Relevance assessments for strict and loose interpretations of the RAF task.  ... 
doi:10.1145/2009916.2009983 dblp:conf/sigir/TsagkiasRW11 fatcat:6xiam7p3wbehbcqrfqob4wroyi

On the Convergence of Structured Search, Information Retrieval and Trust Management in Distributed Systems [chapter]

Karl Aberer, Philippe Cudré-Mauroux, Zoran Despotovic
2005 Lecture Notes in Computer Science  
We will provide a first sketch of such a framework and illustrate it with examples from our previous work on information retrieval, structured search and trust assessment.  ...  The database and information retrieval communities have long been recognized as being irreconcilable.  ...  Thus the discipline of information retrieval has developed a rich set of models for assessing the relevance of documents for a given query.  ... 
doi:10.1007/11550648_1 fatcat:nhplwe343jd2vfeeyx2hyjgksu

Dog bite Emergency department presentations in Brisbane metro south: Epidemiology and exploratory medical geography for targeted interventions

Alexander Pekin, Hester Rynhoud, Bradley Brennan, Ricardo J. Soares Magalhães
2020 One Health  
Bernoulli logistic regression models including patient postcode as a random effect.  ...  paediatric vs. adult), probability of dog bites to the head (head injury vs. other injury), and probability of re-presentation to the ED following their initial dog bite (yes or no) by way of univariable then multivariable  ...  The second stage of analysis involved arriving at a final multivariable Bernoulli logistic regression model for each outcome of interest using a backward stepwise regression procedure.  ... 
doi:10.1016/j.onehlt.2020.100204 pmid:33364297 pmcid:PMC7750554 fatcat:lvbk2mb6fnaopirz6sulsu3woq
« Previous Showing results 1 — 15 out of 1,578 results