Description Usage Arguments Value Note Author(s) References Examples
Looks for amino acid and/or nucleotide patterns coordinated to a given prosthetic centre. It also accounts for small molecule ligands. Patterns are aligned, clustered and translated to logo-like sequences to infer coordination motifs.
| 1 | PdPDB(path, metal, n, perc, interactive, dropsReplicate)
 | 
| path | A string containing the path to the PDB directory. | 
| metal | A string containing the PDB chemical symbol of the target prosthetic centre; e.g. SF4 for [4Fe-4S] cluster, ZN for zinc. The PDB chemical symbol is case sensitive for macOS. | 
| n | A numerical value that contains the number or residue in following/preceding n positions from the ligated amino acid or nucleotide; if n=1 PdPDB searches for x(L)x motif-like chains, if n=2 for xx(L)xx. (L)igand. | 
| perc | A numerical value about the minimum percent of letters in a column otherwise residues are dropped. | 
| interactive | A numerical value. 0 interactive, 1 automated (will not cut dendrogram), 2 user decided cut. In mode 1 and 2 ExPASy amino acid frequencies are used as reference. | 
| dropsReplicate | A numerical value. 0 keeps replicated patterns, 1 drops replicated patterns entry by entry, 2 keeps only unique patterns. | 
PdPDB generates a list of ".csv" and ".svg" files that will be stored in the same folder of the analyzed pdb/cif files (see "path"), its output is as follows:
| frequency.csv | PDB-like patterns (i.e. with PDB chem Ids). "-" and "+" are used for residues out of the n inspecting window or from different monomers, respectively. Patterns come along with their frequency. | 
| alignment.csv | Ligand-aligned patterns with dashes, plus signs and gaps ("*"). See 'frequency.csv'. | 
| following_X_enrichment.csv | n files. Each file contains enrichment score, z-score and statistics at up to n following positions. X is the +position from ligated residue. | 
| ligands_enrichment.csv | Enrichment scores and statistics for ligands. | 
| notLigands_enrichment.csv | Enrichment statistics for the whole specimen but ligands. | 
| preceeding_X_enrichment.csv | As for "following" but this is meant for residues preceeding ligands. See "following_X_enrichment.csv." | 
| root_enrichment.csv | Overall enrichment score. | 
| logo_Y.csv | Y files. Each file contains the logo and consensus sequence for a cluster. Y is the cluster number. | 
| dendrogram.svg | The dendrogram along with the user deciced cutoff and clusters. | 
| following_X_proportions.svg | Plot of the enrichment score per each amino acid in following positions. | 
| ligands_proportions.svg | Plot of the enrichment score per each amino acid in ligated position. | 
| notLigands_proportions.svg | Plot of the enrichment score per each amino acid in non ligated position. | 
| preceeding_X_proportions.svg | Plot of the enrichment score per each amino acid in preceeding positions. | 
| root_proportions.svg | Plot of the root enrichment score. | 
| logo_Y.svg | Plot of the logo and consensus sequence of the Yth cluster. The complete aligned cluster is given as homonym '.csv' file. Sequences come along with percentages. If the dendrogram is not cut the root logo is given. | 
| following_X_standardized.svg | Plot of the z-score per each amino acid in following positions. | 
| ligands_standardized.svg | Plot of the z-score per each amino acid in ligated position. | 
| notLigands_standardized.svg | Plot of the z-score per each amino acid in non ligated position. | 
| preceeding_X_standardized.svg | Plot of the z-score per each amino acid in preceeding positions. | 
| root_standardized.svg | Plot of the root z-score. | 
| patterns.csv | PDB like extracted patterns along with the PDB ID and metal IDs. Useful for debbugging. Needed for restore. | 
| PdPDB.log | PdPDB log file. Useful for debbugging. Needed for restore. | 
Files have to be in the local file system and contain the ".pdb" or ".cif" extension. Output files use brackets to highlight ligands and/or 'L' in heading line.
Luca Belmonte, Sheref S. Mansy
Belmonte L, Mansy SS Patterns of Ligands Coordinated to Metallocofactors Extracted from the Protein Data Bank, Journal of Chemical Information and Modeling (accepted)
| 1 2 3 4 5 6 7 8 9 10 11 12 | ################ Defining path to PDBs
path_to_PDB="inst/extdata/PDB" # this is where pdb/cif files are stored
################ Research Parameters
metal="SF4"  # searches for [4fe-4s] coordinating patterns
n=1  # searches for x(L)x patterns, (L) coordinates to SF4
perc=20  # drops residues with less than the 20% of frequency
interactive= 0 # interactive. User decided references and dendrogram cut
dropsReplicate=0 # do not remove replicated patterns 
################ Launch PdPDB
PdPDB(path_to_PDB,metal,n, perc, interactive, dropsReplicate)
 | 
Loading required package: plyr
Loading required package: dendextend
---------------------
Welcome to dendextend version 1.14.0
Type citation('dendextend') for how to cite the package.
Type browseVignettes(package = 'dendextend') for the package vignette.
The github page is: https://github.com/talgalili/dendextend/
Suggestions and bug-reports can be submitted at: https://github.com/talgalili/dendextend/issues
Or contact: <tal.galili@gmail.com>
	To suppress this message use:  suppressPackageStartupMessages(library(dendextend))
---------------------
Attaching package: ‘dendextend’
The following object is masked from ‘package:stats’:
    cutree
Loading required package: tseries
Registered S3 method overwritten by 'quantmod':
  method            from
  as.zoo.data.frame zoo 
[1] "########################################################## "
[1] "          PdPDB: Pattern discovery in PDB files"
[1] "########################################################## "
Do you want PdPDB writes results in  inst/extdata/PDB ? [1= Yes, 0=No] 
[1] "PdPDB now keeps replicated patterns..."
sh: 1: cd: can't cd to inst/extdata/PDB
sh: 1: /usr/lib/R/site-library/PdPDB/exec/fileManager: Permission denied
[1] "PdPDB runs from scratch!"
[1] "Invalid PATH! No PDB entries in this directory!"
[1] "Patterns won't be analyzed!!!"
sh: 1: rm: Permission denied
sh: 1: cannot create inst/extdata/PDBPdPDB.log: Directory nonexistent
sh: 1: cannot create inst/extdata/PDBPdPDB.log: Directory nonexistent
[1] " "
[1] "---------------------------------------------------------- "
[1] "Logos and output files (.csv, .svg, .log) are in  inst/extdata/PDB"
[1] "---------------------------------------------------------- "
[1] " "
[1] "Legend for 'frequencies.csv' file: () = ligand, - = conect residues interleaved by of more than a position in the same chain, ... = conect residues from different chains"
[1] "Legend for logos and 'alignment.csv' file: * = gap introduced by ligands alignment"
[1] " "
[1] "########################################################## "
sh: 1: cannot create inst/extdata/PDBPdPDB.log: Directory nonexistent
sh: 1: cannot create inst/extdata/PDBPdPDB.log: Directory nonexistent
[1] "            PdPDB exited gracefully...!!!"
[1] "########################################################## "
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.