Automated DNA Motif Discovery [article]

W. B. Langdon, Olivia Sanchez Graillet, A. P. Harrison
<span title="2010-01-30">2010</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Ensembl's human non-coding and protein coding genes are used to automatically find DNA pattern motifs. The Backus-Naur form (BNF) grammar for regular expressions (RE) is used by genetic programming to ensure the generated strings are legal. The evolved motif suggests the presence of Thymine followed by one or more Adenines etc. early in transcripts indicate a non-protein coding gene. Keywords: pseudogene, short and microRNAs, non-coding transcripts, systems biology, machine learning,
more &raquo; ... ics, motif, regular expression, strongly typed genetic programming, context-free grammar.
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="">arXiv:1002.0065v1</a> <a target="_blank" rel="external noopener" href="">fatcat:scdo4jlcxjafxdeowvp64n5zjm</a> </span>
<a target="_blank" rel="noopener" href="" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> File Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="" title=" access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> </button> </a>