DD.chem.data | R Documentation |
Chemical structure similarities between 1253 FDA approved drugs obtained from DrugBank 3.0.
data(DD.chem.data)
A 1253 x 1253 named matrix where both rows and columns refer to DrugBank drugs. Drugs names are DrugBank 3.0 identifiers.
This matrix contains the Tanimoto chemical structure similarity scores between 1253 DrugBank drugs. Canonical Simplified molecular-input line-entry specifications (SMILES) of the drugs were extracted from the DrugBank Drugcards. The SMILES were then converted into molecular extended fingerprints (1024 bits) using the rcdk package. The set of fingerprints was finally converted into a Tanimoto similarity matrix using the fp.sim.matrix rcdk function. All the real values contained in this matrix represent the chemical structure similarity between each possible pair of drugs and are comprised between 0 (completely different chimical structures) and 1 (identical chemical structures).
The SMILES representations of the drugs can be obtained from: https://go.drugbank.com/releases/latest
Wishart, D., Knox, C., Guo, A., Shrivastava, S., Hassanali, M., Stothard, P., Chang, Z., Woolsey, J.: DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 34(Jan), D668-D672 (2006)
Guha, R.: Chemical Informatics Functionality in R. Journal of Statistical Software 18(6), (2007)
data(DD.chem.data) DD.chem.data[1:10,1:10]
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.