Generates consistent PSSM and PDB files for protein-protein complexes

71 commits | Last update: February 12, 2021

Cite this software

Choose a version:
[[ releases.length > 0 ? releases[selectedIndex].doi : conceptDOI ]]
Copy to clipboard
Choose a reference manager file format:
Download file

What PSSMGen can do for you

  • Easy-to-use tool to calculate PSSM
  • Generate PSSM files for docking models of protein-protein complexes
  • Generate PSSM-matched PDB files

PSSMGen is geared toward computing the pssm files for all models of a particular protein-protein complex. This tool assumes your files have following structure: caseID | pdb | fasta | pssm_raw | pssm |_ pdb_raw caseID is the ID of protein-protein complex, e.g. PDB ID 1AK4 or CAPRI target ID T161. Only the pdb dir must exist at run time, and it must contain the PDB files of the models generated by HADDOCK. Based on these PDB files, the code will extract protein sequences and generate the FASTA files (stored in the fasta subdir). It will then use the BLAST tool psiblast to compute PSSM (stored in pssm_raw subdir). Finally the sequences from PSSM files and PDB files will be aligned and then the consistent PSSM files and/or PDB files will generated. The consistent PSSM files are stored in pssm subdir. If applicable, the consistent PDB files are stored in pdb subdir, while the original non-consistent PDB files are moved to pdb_raw subdir. You can then use PDB files from pdb and PSSM files from pssm for further analysis.

Read more
  • Big data
  • Optimized data handling
  • High performance computing
Programming Language
  • Python
  • Apache-2.0
Source code


  • Nicolas Renaud
    Netherlands eScience Center
  • Cunliang Geng
    Netherlands eScience Center
Contact person
Cunliang Geng
Netherlands eScience Center

Information for page maintainers

OAI-PMH metadata:
citation metadata: