Learning common and specific patterns from data of multiple interrelated biological scenarios with matrix factorization release_vgv2wpvbfzgj7iasmgulthrlya

by Lihua Zhang, Shihua Zhang

Released as a post by Cold Spring Harbor Laboratory.

2018  

Abstract

High-throughput biological technologies (e.g., ChIP-seq, RNA-seq and single-cell RNA-seq) rapidly accelerate the accumulation of genome-wide omics data in diverse interrelated biological scenarios (e.g., cells, tissues and conditions). Data dimension reduction and differential analysis are two common paradigms for exploring and analyzing such data. However, they are typically used in a separate or/and sequential manner. In this study, we propose a flexible non-negative matrix factorization framework CSMF to combine them into one paradigm to simultaneously reveal common and specific patterns from data generated under interrelated biological scenarios. We demonstrate the effectiveness of CSMF with four applications including pairwise ChIP-seq data describing the chromatin modification map on protein-DNA interactions between K562 and Huvec cell lines; pairwise RNA-seq data representing the expression profiles of two cancers (breast invasive carcinoma and uterine corpus endometrial carcinoma); RNA-seq data of three breast cancer subtypes; and single-cell sequencing data of human embryonic stem cells and differentiated cells at six time points. Extensive analysis yields novel insights into hidden combinatorial patterns embedded in these interrelated multi-modal data. Results demonstrate that CSMF is a powerful tool to uncover common and specific patterns with significant biological implications from data of interrelated biological scenarios.
In application/xml+jats format

Archived Files and Locations

application/pdf   2.7 MB
file_p2pexjsoqjgtxdi6zcd6wrisma
www.biorxiv.org (repository)
web.archive.org (webarchive)
application/pdf   2.7 MB
file_bjwiidvoavbwlpswnodrnlizny
web.archive.org (webarchive)
www.biorxiv.org (web)
application/pdf   2.7 MB
file_4k755aiu5zax7nb6uvvjxnf34u
web.archive.org (webarchive)
www.biorxiv.org (web)
Read Archived PDF
Preserved and Accessible
Type  post
Stage   unknown
Date   2018-02-27
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: dfe065b7-3fbe-4334-b507-37d465565b80
API URL: JSON