Publication | Closed Access
Stigmata: An Algorithm To Determine Structural Commonalities in Diverse Datasets
156
Citations
17
References
1996
Year
Structural CommonalitiesEngineeringHit IdentificationSimilarity MeasureLarge-scale DatasetsMultiset Data AnalysisText MiningComputational Social ScienceData ScienceData MiningData IntegrationBiostatisticsDiversity Assessment ToolStatisticsMolecular DiversityKnowledge DiscoveryChemometric MethodNeuropharmacologyOmicsDopamine D2PharmacologyBioinformaticsDataset CreationData SetComputational BiologyForensic ToxicologyData HeterogeneitySystems BiologyMedicineDrug Discovery
An algorithm, Stigmata, is described, which extracts structural commonalities from chemical datasets. It is discussed using several illustrative examples and a pharmaceutically interesting set of dopamine D2 agonists. The commonalities are determined using two-dimensional topological chemical descriptions and are incorporated into the key feature of the algorithm, the modal fingerprint. Flexibility is built into the algorithm by means of a user-defined threshold value, which affects the information content of the modal fingerprint. The use of the modal fingerprint as a diversity assessment tool, as a database similarity query, and as a basis for color mapping the determined commonalities back onto the chemical structures is demonstrated.
| Year | Citations | |
|---|---|---|
Page 1
Page 1