Inference of Gene Regulatory Networks Based on a Universal Minimum Description Length

John Dougherty, Ioan Tabus, Jaakko Astola
2008 EURASIP Journal on Bioinformatics and Systems Biology  
The Boolean network paradigm is a simple and effective way to interpret genomic systems, but discovering the structure of these networks remains a difficult task. The minimum description length (MDL) principle has already been used for inferring genetic regulatory networks from time-series expression data and has proven useful for recovering the directed connections in Boolean networks. However, the existing method uses an ad hoc measure of description length that necessitates a tuning
more » ... for artificially balancing the model and error costs and, as a result, directly conflicts with the MDL principle's implied universality. In order to surpass this difficulty, we propose a novel MDL-based method in which the description length is a theoretical measure derived from a universal normalized maximum likelihood model. The search space is reduced by applying an implementable analogue of Kolmogorov's structure function. The performance of the proposed method is demonstrated on random synthetic networks, for which it is shown to improve upon previously published network inference algorithms with respect to both speed and accuracy. Finally, it is applied to time-series Drosophila gene expression measurements.
doi:10.1155/2008/482090 pmid:18437238 pmcid:PMC3171396 fatcat:oiwswxvasfhbfl74opdfsqpgpa