Concepedia

TLDR

Ultra‑high‑throughput sequencing is emerging as an attractive alternative to microarrays for genotyping, methylation analysis, and transcription factor binding site identification. The study aims to estimate the technical variance of Illumina sequencing for mRNA expression and compare its ability to detect differentially expressed genes with microarrays, while proposing an empirical protocol and statistical framework. The authors used Illumina sequencing to quantify mRNA expression in liver and kidney samples with replicates, compared the results to Affymetrix arrays, and developed an empirical protocol and statistical framework for analysis. Illumina sequencing shows high reproducibility with minimal technical variation, making a single lane sufficient for many purposes, and its single‑lane data are comparable to a single microarray for detecting differentially expressed genes while also enabling detection of low‑expressed genes, alternative splice variants, and novel transcripts.

Abstract

Ultra-high-throughput sequencing is emerging as an attractive alternative to microarrays for genotyping, analysis of methylation patterns, and identification of transcription factor binding sites. Here, we describe an application of the Illumina sequencing (formerly Solexa sequencing) platform to study mRNA expression levels. Our goals were to estimate technical variance associated with Illumina sequencing in this context and to compare its ability to identify differentially expressed genes with existing array technologies. To do so, we estimated gene expression differences between liver and kidney RNA samples using multiple sequencing replicates, and compared the sequencing data to results obtained from Affymetrix arrays using the same RNA samples. We find that the Illumina sequencing data are highly replicable, with relatively little technical variation, and thus, for many purposes, it may suffice to sequence each mRNA sample only once (i.e., using one lane). The information in a single lane of Illumina sequencing data appears comparable to that in a single array in enabling identification of differentially expressed genes, while allowing for additional analyses such as detection of low-expressed genes, alternative splice variants, and novel transcripts. Based on our observations, we propose an empirical protocol and a statistical framework for the analysis of gene expression using ultra-high-throughput sequencing technology.

References

YearCitations

Page 1