Publication | Open Access
PubChem Substance and Compound databases
5.3K
Citations
46
References
2015
Year
Chemical BiologyMedicinal ChemistryPubchem SubstanceBiochemistryBiological DatabaseNatural SciencesMedicineChemical AnalysisMass SpectrometryChemometric MethodCompound ClassForensic ChemistryBioassay DatabaseChemistryChromatographyPharmacologyDrug DiscoveryPubchem Data
PubChem is a public repository launched in 2004 that aggregates chemical substance information and biological activity data through its interconnected Substance, Compound, and BioAssay databases, serving the scientific community for over a decade. This paper aims to provide an overview of the PubChem Substance and Compound databases, detailing their data sources, contents, organization, submission processes, chemical structure standardization, and web-based and programmatic search interfaces. The authors also describe PubChem3D, a resource of theoretical 3D structures, and PubChemRDF, an RDF-formatted dataset facilitating data sharing and integration with other databases.
PubChem (https://pubchem.ncbi.nlm.nih.gov) is a public repository for information on chemical substances and their biological activities, launched in 2004 as a component of the Molecular Libraries Roadmap Initiatives of the US National Institutes of Health (NIH). For the past 11 years, PubChem has grown to a sizable system, serving as a chemical information resource for the scientific research community. PubChem consists of three inter-linked databases, Substance, Compound and BioAssay. The Substance database contains chemical information deposited by individual data contributors to PubChem, and the Compound database stores unique chemical structures extracted from the Substance database. Biological activity data of chemical substances tested in assay experiments are contained in the BioAssay database. This paper provides an overview of the PubChem Substance and Compound databases, including data sources and contents, data organization, data submission using PubChem Upload, chemical structure standardization, web-based interfaces for textual and non-textual searches, and programmatic access. It also gives a brief description of PubChem3D, a resource derived from theoretical three-dimensional structures of compounds in PubChem, as well as PubChemRDF, Resource Description Framework (RDF)-formatted PubChem data for data sharing, analysis and integration with information contained in other databases.
| Year | Citations | |
|---|---|---|
Page 1
Page 1