CNSA: a data repository for archiving omics data [article]

Xueqin Guo, Fengzhen Chen, Fei Gao, Ling Li, Ke Liu, Lijin You, Cong Hua, Fan Yang, Wanliang Liu, Chunhua Peng, Lina Wang, Xiaoxia Yang (+14 others)
2020 bioRxiv   pre-print
With the application and development of high-throughput sequencing technology in life and health sciences, massive multi-dimensional biological data brings the problem of efficient management and utilization. Database development and biocuration are the prerequisites for the reuse of these big data. Here, relying on China National GeneBank (CNGB), we present CNGB Sequence Archive (CNSA) for archiving omics data, including raw sequencing data and its analytical data and related metadata which
more » ... organized into six objects, namely Project, Sample, Experiment, Run, Assembly, and Variation at present. Moreover, CNSA has created the correlation model of living samples, sample information, and analytical data on some projects, so that all data can be traced throughout the life cycle from the living sample to the sample information to the analytical data. Complying with the data standards commonly used in the life sciences, CNSA is committed to improving the data standards and building a comprehensive and curated data repository for the storage, management and sharing of omics data, and providing free access to open data resources for worldwide scientific communities to support academic research and the bio-industry. Database URL:
doi:10.1101/2020.04.07.030833 fatcat:vmy7cw2qpzabdbdzak66etlfwq