PolySearch: a web-based text mining system for extracting relationships between human diseases, genes, mutations, drugs and metabolites

D. Cheng, C. Knox, N. Young, P. Stothard, S. Damaraju, D. S. Wishart
<span title="2008-05-19">2008</span> <i title="Oxford University Press (OUP)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/hfp6p6inqbdexbsu4r7usndpte" style="color: black;">Nucleic Acids Research</a> </i> &nbsp;
A particular challenge in biomedical text mining is to find ways of handling 'comprehensive' or 'associative' queries such as 'Find all genes associated with breast cancer'. Given that many queries in genomics, proteomics or metabolomics involve these kind of comprehensive searches we believe that a webbased tool that could support these searches would be quite useful. In response to this need, we have developed the PolySearch web server. PolySearch supports'50 different classes of queries
more &raquo; ... st nearly a dozen different types of text, scientific abstract or bioinformatic databases. The typical query supported by PolySearch is 'Given X, find all Y's' where X or Y can be diseases, tissues, cell compartments, gene/protein names, SNPs, mutations, drugs and metabolites. PolySearch also exploits a variety of techniques in text mining and information retrieval to identify, highlight and rank informative abstracts, paragraphs or sentences. PolySearch's performance has been assessed in tasks such as gene synonym identification, proteinprotein interaction identification and disease gene identification using a variety of manually assembled 'gold standard' text corpuses. Its f-measure on these tasks is 88, 81 and 79%, respectively. These values are between 5 and 50% better than other published tools. The server is freely available at
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1093/nar/gkn296">doi:10.1093/nar/gkn296</a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pubmed/18487273">pmid:18487273</a> <a target="_blank" rel="external noopener" href="https://pubmed.ncbi.nlm.nih.gov/PMC2447794/">pmcid:PMC2447794</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/4nqjvxjlejdmjlrwrnuesj3voe">fatcat:4nqjvxjlejdmjlrwrnuesj3voe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190225220405/http://pdfs.semanticscholar.org/68e6/86127186c507656243b50dc0c88ac18ee6fb.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/68/e6/68e686127186c507656243b50dc0c88ac18ee6fb.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1093/nar/gkn296"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> oup.com </button> </a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447794" title="pubmed link"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> pubmed.gov </button> </a>