Statistical Methods for Deconvolution in Cancer Genomics release_4qyj3fdiqrgtnpckuooeceomju

by Liuqing Yang

Published by The University of North Carolina at Chapel Hill University Libraries.

2019  

Abstract

With the advance of deep sequencing techniques, intratumor heterogeneity becomes a prevalent confounding factor to tumor genomic profiling studies. The heterogeneous composition of a tumor tissue can potentially lead to false positive differential expression conclusions and influence patients' clinical outcomes and therapeutic responses. Many deconvolution methods aiming to separate the subcomponent signals have been developed in the past decades, modeling the tumor genomic profiling as a linear combination of the abundance of the mixing components. In this dissertation, we characterize a two- components (tumor versus non-tumor) model and develop a Fast Tumor Deconvolution (FasTD) pipeline to address the heterogeneity issue. We build a semi-parametric regression- based framework utilizing raw measured gene expression values, and provide mixing proportions and individual component genomic profiles as outputs. We demonstrate our method and show it is more than a thousand times faster than several current probabilistic models. Both simulated data and real data applications are provided to demonstrate the effectiveness of our proposed method. Our method is then extended to deconvolve heterogeneous tumor samples with more than two subcomponents. The extended pipeline (FasTDK) can effectively deconvolve an unknown component in K-subcomponent mixtures provided with K-1 reference profiles.
In text/plain format

Archived Files and Locations

application/pdf   12.1 MB
file_yisgr5yfancg5bkbudtdzxlbq4
cdr.lib.unc.edu (publisher)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  thesis
Stage   published
Year   2019
Language   en ?
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 97d08f40-499d-4730-838e-e353b40e4b18
API URL: JSON