CEBS object model for systems biology data, SysBio-OM

S. Xirasagar, S. Gustafson, B. A. Merrick, K. B. Tomer, S. Stasiewicz, D. D. Chan, K. J. Yost, J. R. Yates, S. Sumner, N. Xiao, M. D. Waters
2004 Bioinformatics  
Motivation: To promote a systems biology approach to understanding the biological effects of environmental stressors, the Chemical Effects in Biological Systems (CEBS) knowledge base is being developed to house data from multiple complex data streams in a systems friendly manner that will accommodate extensive querying from users. Unified data representation via a single object model will greatly aid in integrating data storage and management, and facilitate reuse of software to analyze and
more » ... lay data resulting from diverse differential expression or differential profile technologies. Data streams include, but are not limited to, gene expression analysis (transcriptomics), protein expression and protein-protein interaction analysis (proteomics) and changes in low molecular weight metabolite levels (metabolomics). Results: To enable the integration of microarray gene expression, proteomics and metabolomics data in the CEBS system, we designed an object model, Systems Biology Object Model (SysBio-OM). The model is comprehensive and leverages other open source efforts, namely the Micro-Array Gene Expression Object Model (MAGE-OM) and the Proteomics Experiment Data Repository (PEDRo) object model. SysBio-OM is designed by extending MAGE-OM to represent protein expression data elements (including those from PEDRo), protein-protein interaction and metabolomics data. SysBio-OM promotes the standardization of data representation and data quality by facilitating the capture of the minimum annotation required for an experiment. Such standardization refines the accuracy of data mining and interpretation. The * To whom correspondence should be addressed. † The authors wish it to be known that, in their opinion, these two authors should be regarded as joint First Authors. open source SysBio-OM model, which can be implemented on varied computing platforms is presented here. Availability: A universal modeling language depiction of the entire SysBio-OM is available at http://cebs.niehs.nih.gov/ SysBioOM/. The Rational Rose object model package is distributed under an open source license that permits unrestricted academic and commercial use and is available at http://cebs.niehs.nih.gov/cebsdownloads. The database and interface are being built to implement the model and will be available for public use at http://cebs.niehs.nih.gov.
doi:10.1093/bioinformatics/bth189 pmid:15044233 fatcat:r5nao3ztmbf27evmhwjyx3uny4