Compared with the conventional amino acid (AA) composition, the pseudo-amino acid (PseAA) composition as originally introduced for protein subcellular location prediction can incorporate much more information of a protein sequence, so as to remarkably enhance the power of using a discrete model to predict various attributes of a protein. In this study, based on the concept of PseAA composition, the approximate entropy and hydrophobicity pattern of a protein sequence are used to characterize the PseAA components. Also, the immune genetic algorithm (IGA) is applied to search the optimal weight factors in generating the PseAA composition. Thus, for a given protein sequence sample, a 27-D (dimensional) PseAA composition is generated as its descriptor. The fuzzy K nearest neighbors (FKNN) classifier is adopted as the prediction engine. The results thus obtained in predicting protein structural classification are quite encouraging, indicating that the current approach may also be used to improve the prediction quality of other protein attributes, or at least can play a complimentary role to the existing methods in the relevant areas. Our algorithm is written in Matlab that is available by contacting the corresponding author.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.jtbi.2007.09.014 | DOI Listing |
J Theor Biol
November 2018
School of Internet of Things Engineering, Wuxi City College of Vocational Technology, Wuxi 214153, China.
Venomous animals produce toxins that inhibit ion channels with high affinity. These small peptide inhibitors are used in the characterization of ion channels structurally as well as pharmacologically. So, identification of these toxins is an important task.
View Article and Find Full Text PDFJ Theor Biol
June 2018
School of Internet of Things Engineering, Wuxi City College of Vocational Technology, Wuxi 214153, China.
Presynaptic neurotoxins and postsynaptic neurotoxins are two important neurotoxins isolated from venoms of venomous animals and have been proven to be potential effective in neurosciences and pharmacology. With the number of toxin sequences appeared in the public databases, there was a need for developing a computational method for fast and accurate identification and classification of the novel presynaptic neurotoxins and postsynaptic neurotoxins in the large databases. In this study, the Multinomial Naive Bayes Classifier (MNBC) had been developed to discriminate the presynaptic neurotoxins and postsynaptic neurotoxins based on the different kinds of features.
View Article and Find Full Text PDFSci Rep
February 2018
School of Internet of Things Engineering, Wuxi City College of Vocational Technology, Wuxi, 214153, China.
Human immunodeficiency virus (HIV) is the retroviral agent that causes acquired immune deficiency syndrome (AIDS). The number of HIV caused deaths was about 4 million in 2016 alone; it was estimated that about 33 million to 46 million people worldwide living with HIV. The HIV disease is especially harmful because the progressive destruction of the immune system prevents the ability of forming specific antibodies and to maintain an efficacious killer T cell activity.
View Article and Find Full Text PDFSci Rep
July 2017
College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China.
Presynaptic and postsynaptic neurotoxins are two groups of neurotoxins. Identification of presynaptic and postsynaptic neurotoxins is an important work for numerous newly found toxins. It is both costly and time consuming to determine these two neurotoxins by experimental methods.
View Article and Find Full Text PDFBMC Syst Biol
December 2016
Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100010, China.
Background: Protein-protein interactions (PPIs) are essential to most biological processes. Since bioscience has entered into the era of genome and proteome, there is a growing demand for the knowledge about PPI network. High-throughput biological technologies can be used to identify new PPIs, but they are expensive, time-consuming, and tedious.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!