Many studies have used position-specific scoring matrices (PSSM) profiles to characterize residues in protein structures and to predict a broad range of protein features. Moreover, PSSM profiles of Protein Data Bank (PDB) entries have been recalculated in many works for different purposes. Although the computational cost of calculating a single PSSM profile is affordable, many statistical studies or machine learning-based methods used thousands of profiles to achieve their goals, thereby leading to a substantial increase of the computational cost. In this work we present a new database compiling PSSM profiles for the proteins of the PDB. Currently, the database contains 333,532 protein chain profiles involving 123,135 different PDB entries.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6149929PMC
http://dx.doi.org/10.3390/molecules22122230DOI Listing

Publication Analysis

Top Keywords

pssm profiles
12
position-specific scoring
8
scoring matrices
8
protein structures
8
pdb entries
8
computational cost
8
protein
5
profiles
5
3dcons-db database
4
database position-specific
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!