Concepedia

Publication | Open Access

DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data

330

Citations

9

References

2013

Year

TLDR

Gene‑expression measurements from heterogeneous tissues are confounded by varying cell‑type proportions. The authors present DeconRNASeq, an R package that deconvolves heterogeneous tissue gene‑expression data. The package employs a globally optimized non‑negative decomposition algorithm via quadratic programming to estimate tissue‑mixing proportions and offers modular design for custom pipelines on other high‑throughput platforms. DeconRNASeq accurately recovers known cell‑type proportions in silico mixtures and benchmark datasets, showing high correlation with true fractions and enabling rigorous, high‑resolution analysis of mRNA‑Seq data. DeconRNASeq is freely available on Bioconductor (http://bioconductor.org/packages) and supplementary data can be accessed online; contact the authors at tinggong@gmail.com.

Abstract

Abstract Summary: For heterogeneous tissues, measurements of gene expression through mRNA-Seq data are confounded by relative proportions of cell types involved. In this note, we introduce an efficient pipeline: DeconRNASeq, an R package for deconvolution of heterogeneous tissues based on mRNA-Seq data. It adopts a globally optimized non-negative decomposition algorithm through quadratic programming for estimating the mixing proportions of distinctive tissue types in next-generation sequencing data. We demonstrated the feasibility and validity of DeconRNASeq across a range of mixing levels and sources using mRNA-Seq data mixed in silico at known concentrations. We validated our computational approach for various benchmark data, with high correlation between our predicted cell proportions and the real fractions of tissues. Our study provides a rigorous, quantitative and high-resolution tool as a prerequisite to use mRNA-Seq data. The modularity of package design allows an easy deployment of custom analytical pipelines for data from other high-throughput platforms. Availability: DeconRNASeq is written in R, and is freely available at http://bioconductor.org/packages. Contact: tinggong@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online.

References

YearCitations

Page 1