Concepedia

Publication | Open Access

MEME-ChIP: motif analysis of large DNA datasets

1.9K

Citations

10

References

2011

Year

TLDR

Advances in high‑throughput sequencing have produced large, high‑quality datasets, including TF ChIP‑seq data, yet many existing tools cannot directly process such large datasets. MEME‑ChIP analyzes ChIP‑seq peak regions by performing ab initio motif discovery with MEME and DREME, motif enrichment via AME, visualization, binding‑affinity analysis, and motif identification. Unlike the MEME web service, MEME‑ChIP imposes no size or sequence‑number limits, enabling analysis of very large ChIP‑seq datasets and providing users with diverse views of TF binding and potential involvement of other TFs. MEME‑ChIP is available as part of the MEME Suite at http://meme.nbcr.net; contact t.bailey@uq.edu.au; supplementary data are online.

Abstract

Abstract Motivation: Advances in high-throughput sequencing have resulted in rapid growth in large, high-quality datasets including those arising from transcription factor (TF) ChIP-seq experiments. While there are many existing tools for discovering TF binding site motifs in such datasets, most web-based tools cannot directly process such large datasets. Results: The MEME-ChIP web service is designed to analyze ChIP-seq ‘peak regions’—short genomic regions surrounding declared ChIP-seq ‘peaks’. Given a set of genomic regions, it performs (i) ab initio motif discovery, (ii) motif enrichment analysis, (iii) motif visualization, (iv) binding affinity analysis and (v) motif identification. It runs two complementary motif discovery algorithms on the input data—MEME and DREME—and uses the motifs they discover in subsequent visualization, binding affinity and identification steps. MEME-ChIP also performs motif enrichment analysis using the AME algorithm, which can detect very low levels of enrichment of binding sites for TFs with known DNA-binding motifs. Importantly, unlike with the MEME web service, there is no restriction on the size or number of uploaded sequences, allowing very large ChIP-seq datasets to be analyzed. The analyses performed by MEME-ChIP provide the user with a varied view of the binding and regulatory activity of the ChIP-ed TF, as well as the possible involvement of other DNA-binding TFs. Availability: MEME-ChIP is available as part of the MEME Suite at http://meme.nbcr.net. Contact: t.bailey@uq.edu.au Supplementary information: Supplementary data are available at Bioinformatics online.

References

YearCitations

Page 1