[Bank of samples from the Prof_Pat protein family, assessment of efficacy].

Mol Biol (Mosk)

Institute of Molecular Biology, State Research Center of Virology and Biotechnology VECTOR, Russian Ministry of Health, Koltsovo, Novosibirsk Region, 630559 Russia.

Published: August 2004

The PROF_PAT protein pattern database has been created and maintained so as to comprise the maximal number of the SWISS-PROT + TrEMBL proteins as patterns. The present paper describes some characteristic features of PROF_PAT to assist the potential user. New amino acid sequences (10938) from the SWISS-PROT database have been analyzed to determine the boundary values of the "score" parameter to distinguish random and significant similarities. Analysis through the Internet of 20 amino acid sequences having no descriptions in the TrEMBL database demonstrated that PROF_PAT, being highly competitive with its counterparts in specificity, surpasses them in amplitude and variety of proteins, working several times as fast. The real representation of protein families in the PROF_PAT database (release 1.11), which contains 50,149 patterns of 344,429 proteins, has been estimated at 31,450.

Download full-text PDF

Source

Publication Analysis

Top Keywords

prof_pat protein
8
amino acid
8
acid sequences
8
prof_pat
5
[bank samples
4
samples prof_pat
4
protein family
4
family assessment
4
assessment efficacy]
4
efficacy] prof_pat
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!