Publication | Open Access
Feature Selection and Dimension Reduction for Single Cell RNA-Seq based on a Multinomial Model
56
Citations
44
References
2019
Year
Unknown Venue
EngineeringGeneticsFeature SelectionTranscriptomics TechnologyGenomicsGene Expression ProfilingData ScienceData MiningSingle Cell SequencingSingle Cell Rna-seqLong Non-coding RnaBiostatisticsTranscriptomicsPrincipal Component AnalysisDimension ReductionKnowledge DiscoveryRna SequencingSingle-cell GenomicsGene ExpressionSingle-cell AnalysisBioinformaticsFunctional GenomicsComputational BiologySystems BiologyMedicine
Abstract Single cell RNA-Seq (scRNA-Seq) profiles gene expression of individual cells. Recent scRNA-Seq datasets have incorporated unique molecular identifiers (UMIs). Using negative controls, we show UMI counts follow multinomial sampling with no zero-inflation. Current normalization pro-cedures such as log of counts per million and feature selection by highly variable genes produce false variability in dimension reduction. We pro-pose simple multinomial methods, including generalized principal component analysis (GLM-PCA) for non-normal distributions, and feature selection using deviance. These methods outperform current practice in a downstream clustering assessment using ground-truth datasets.
| Year | Citations | |
|---|---|---|
Page 1
Page 1