Publication | Closed Access
Latent Semantic Structure Indexing (LaSSI) for Defining Chemical Similarity
30
Citations
16
References
2001
Year
EngineeringSimilarity MeasureMolecular BiologyChemical SimilarityText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningPattern RecognitionProbe MoleculeMolecular RecognitionBiochemistryKnowledge DiscoverySingular Value DecompositionDimensionality ReductionBioinformaticsNatural SciencesMolecular PropertyComputational BiologySimilarity SearchSemantic Similarity
A novel method for computing chemical similarity from chemical substructure descriptors is described. This new method, called LaSSI, uses the singular value decomposition (SVD) of a chemical descriptor-molecule matrix to create a low-dimensional representation of the original descriptor space. Ranking molecules by similarity to a probe molecule in the reduced-dimensional space has several advantages over analogous ranking in the original descriptor space: matching latent structures is more robust than matching discrete descriptors, choosing the number of singular values provides a rational way to vary the "fuzziness" of the search, and the reduction in the dimensionality of the chemical space increases searching speed. LaSSI also allows the calculation of the similarity between two descriptors and between a descriptor and a molecule.
| Year | Citations | |
|---|---|---|
Page 1
Page 1