Concepedia

TLDR

JASPAR is an open‑access database of curated, non‑redundant transcription factor binding profiles stored as position frequency matrices and flexible models for TFs across six taxonomic groups. The 2018 CORE vertebrate collection of PFMs was used to predict TF‑binding sites in the human genome, and all data can be accessed programmatically via a RESTful API and the JASPAR R/Bioconductor package. The 2018 release expanded the CORE collection by 30 %, adding 322 new PFMs (60 vertebrates, 262 plants), updating 33 PFMs, introducing 316 TFFMs, clustering similar PFMs, and providing predictions through a UCSC Genome Browser track hub and a new interactive web framework.

Abstract

JASPAR (http://jaspar.genereg.net) is an open-access database of curated, non-redundant transcription factor (TF)-binding profiles stored as position frequency matrices (PFMs) and TF flexible models (TFFMs) for TFs across multiple species in six taxonomic groups. In the 2018 release of JASPAR, the CORE collection has been expanded with 322 new PFMs (60 for vertebrates and 262 for plants) and 33 PFMs were updated (24 for vertebrates, 8 for plants and 1 for insects). These new profiles represent a 30% expansion compared to the 2016 release. In addition, we have introduced 316 TFFMs (95 for vertebrates, 218 for plants and 3 for insects). This release incorporates clusters of similar PFMs in each taxon and each TF class per taxon. The JASPAR 2018 CORE vertebrate collection of PFMs was used to predict TF-binding sites in the human genome. The predictions are made available to the scientific community through a UCSC Genome Browser track data hub. Finally, this update comes with a new web framework with an interactive and responsive user-interface, along with new features. All the underlying data can be retrieved programmatically using a RESTful API and through the JASPAR 2018 R/Bioconductor package.

References

YearCitations

Page 1