Concepedia

TLDR

Molecular frameworks serve as the leaf nodes of the scaffold hierarchy trees. The method iteratively removes rings—prioritizing peripheral rings first—to build higher‑level scaffolds, and is applied to PubChem datasets of pyruvate kinase binders and pesticides. The deterministic, dataset‑independent classification produces well‑defined scaffolds that scale linearly with compound count and robustly handles both synthetic and natural products.

Abstract

A hierarchical classification of chemical scaffolds (molecular framework, which is obtained by pruning all terminal side chains) has been introduced. The molecular frameworks form the leaf nodes in the hierarchy trees. By an iterative removal of rings, scaffolds forming the higher levels in the hierarchy tree are obtained. Prioritization rules ensure that less characteristic, peripheral rings are removed first. All scaffolds in the hierarchy tree are well-defined chemical entities making the classification chemically intuitive. The classification is deterministic, data-set-independent, and scales linearly with the number of compounds included in the data set. The application of the classification is demonstrated on two data sets extracted from the PubChem database, namely, pyruvate kinase binders and a collection of pesticides. The examples shown demonstrate that the classification procedure handles robustly synthetic structures and natural products.

References

YearCitations

Page 1