In computational biology, the Protein Remote homology Detection technique (PRHD) has got undeniable significance. It is mostly important for structure and function identification of a protein sequence. The previous years have seen a challenge that lacks postulating a correlation among the sequences. However, the sequences are of variable length. Thereby, it inhibits the proper derivation of evolutionary information among the sequences. The challenges are the usage of physico-chemical properties as a source to get the evolutionary information and the number of sequences generated every day. This however facilitates a new technique to integrate huge amount of data with a massive feature set. In this article, a new and efficient technique is proposed to predict homology for distantly located sequences of proteins. Deep neural network(CNN-GRU model) is used for the classification of the protein sequences. This is based on different protein families and methods of feature extraction.The efficiency of the proposed model DeepRHD is tested on average 8000 sequences per superfamily taken from SCOP benchmark dataset and the results shows that the proposed model is better than other state of art methods. This model is useful in detecting diseases like sickle cell anemia and influenza and developing a drug thereafter.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiolchem.2022.107749DOI Listing

Publication Analysis

Top Keywords

protein remote
8
remote homology
8
homology detection
8
proposed model
8
sequences
7
protein
5
deeprhd efficient
4
efficient hybrid
4
hybrid feature
4
feature extraction
4

Similar Publications

Introduction: The 6-minute walk test (6MWT) is used to assess submaximal exercise capacity in clinical trials. Conducting the 6MWT can be challenging when patients cannot visit the clinic due to physical/travel limitations. This pilot study assessed the feasibility of conducting the 6MWT using wearable sensors for patients with transthyretin amyloid cardiomyopathy.

View Article and Find Full Text PDF

Colorectal cancer (CRC), one of the most common tumors in the world, is generally proposed to be generated from intestinal stem cells (ISCs). Leucine-rich repeat-containing G protein-coupled receptor 5 (Lgr5)-positive ISCs are located at the bottom of the crypt and harbor self-renewal and differentiation capacities, serving as the resource of all intestinal epithelial cells and CRC cells as well. Here we review recent progress in ISCs both in non-tumoral and tumoral contexts.

View Article and Find Full Text PDF

Background: Numerous studies have assessed the risk of SARS-CoV-2 exposure and infection among health care workers during the pandemic. However, far fewer studies have investigated the impact of SARS-CoV-2 on essential workers in other sectors. Moreover, guidance for maintaining a safely operating workplace in sectors outside of health care remains limited.

View Article and Find Full Text PDF

ConspectusIn the search for efficient and selective electrocatalysts capable of converting greenhouse gases to value-added products, enzymes found in naturally existing bacteria provide the basis for most approaches toward electrocatalyst design. Ni,Fe-carbon monoxide dehydrogenase (Ni,Fe-CODH) is one such enzyme, with a nickel-iron-sulfur cluster named the C-cluster, where CO binds and is converted to CO at high rates near the thermodynamic potential. In this Account, we divide the enzyme's catalytic contributions into three categories based on location and function.

View Article and Find Full Text PDF

Major advances in protein function assignment by remote homolog detection with protein language models - A review.

Curr Opin Struct Biol

January 2025

Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA 50011, USA; Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, IA 50011, USA. Electronic address:

There is an ever-increasing need for accurate and efficient methods to identify protein homologs. Traditionally, sequence similarity-based methods have dominated protein homolog identification for function identification, but these struggle when the sequence identity between the pairs is low. Recently, transformer architecture-based deep learning methods have achieved breakthrough performances in many fields.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!