A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2012; you can also visit the original URL.
The file type is application/pdf
.
When is Chemical Similarity Significant? The Statistical Distribution of Chemical Similarity Scores and Its Extreme Values
2010
Journal of Chemical Information and Modeling
As repositories of chemical molecules continue to expand and become more open, it becomes increasingly important to develop tools to search them efficiently and assess the statistical significance of chemical similarity scores. Here we develop a general framework for understanding, modeling, predicting, and approximating the distribution of chemical similarity scores and its extreme values in large databases. The framework can be applied to different chemical representations and similarity
doi:10.1021/ci100010v
pmid:20540577
pmcid:PMC2914517
fatcat:zm6gnmgjxzdihkg46gpmrytwu4