Classification of protein families from their sequences is an enduring task in Proteomics and related studies. Numerous deep-learning models have been moulded to tackle this challenge, but due to the black-box character, they still fall short in reliability. Here, we present a novel explainability pipeline that explains the pivotal decisions of the deep learning model on the classification of the Eukaryotic kinome.
View Article and Find Full Text PDFThe tiger, a poster child for conservation, remains an endangered apex predator. Continued survival and recovery will require a comprehensive understanding of genetic diversity and the use of such information for population management. A high-quality tiger genome assembly will be an important tool for conservation genetics, especially for the Indian tiger, the most abundant subspecies in the wild.
View Article and Find Full Text PDFThe drug-food interaction brings forth changes in the clinical effects of drugs. While favourable interactions bring positive clinical outcomes, unfavourable interactions may lead to toxicity. This article reviews the impact of food intake on drug-food interactions, the clinical effects of drugs, and the effect of drug-food in correlation with diet and precision medicine.
View Article and Find Full Text PDFUnlabelled: Sequence alignments are models that capture the structural, functional and evolutionary relationships between proteins. Structure-guided sequence alignments are helpful in the case of distantly related proteins with poor sequence identity, thus rendering routine sequence alignment methods ineffective. Protein Alignment organized as Structural Superfamilies or PASS2 database provides such sequence alignments of protein domains within a superfamily as per the Structural Classification of Proteins extended (SCOPe) database.
View Article and Find Full Text PDFThe PRKAG2 syndrome is a rare autosomal dominant phenocopy of sarcomeric hypertrophic cardiomyopathy (HCM), characterized by ventricular pre-excitation, progressive conduction system disease and left ventricular hypertrophy. This study describes the phenotype, genotype and clinical outcomes of a South-Asian PRKAG2 cardiomyopathy cohort over a 7-year period. Clinical, electrocardiographic, echocardiographic, and cardiac MRI data from 22 individuals with PRKAG2 variants (68% men; mean age 39.
View Article and Find Full Text PDFThis protocol describes a stepwise process to identify proteins of interest from a query proteome derived from NGS data. We implemented this protocol on transcriptome to identify proteins involved in secondary metabolite and vitamin biosynthesis and ion transport. This knowledge-driven protocol identifies proteins using an integrated approach involving sensitive sequence search and evolutionary relationships.
View Article and Find Full Text PDFIn this paper, we present the data acquired during transcriptome analysis of the Moringa oleifera [1] from five different tissues (root, stem, leaf, flower and seed) by RNA sequencing. A total of 271 million reads were assembled with an N50 of 2094 bp. The combined transcriptome was assessed for transcript abundance across five tissues.
View Article and Find Full Text PDFSnakebite envenoming is a serious and neglected tropical disease that kills ~100,000 people annually. High-quality, genome-enabled comprehensive characterization of toxin genes will facilitate development of effective humanized recombinant antivenom. We report a de novo near-chromosomal genome assembly of Naja naja, the Indian cobra, a highly venomous, medically important snake.
View Article and Find Full Text PDFWe report the draft genome sequence of sp. strain BPSDS2, isolated from freshwater sediments in Northeast India. The draft genome has a size of 8.
View Article and Find Full Text PDFSepsis is a series of clinical syndromes caused by the immunological response to infection. The clinical evidence for sepsis could typically attribute to bacterial infection or bacterial endotoxins, but infections due to viruses, fungi or parasites could also lead to sepsis. Regardless of the etiology, rapid clinical deterioration, prolonged stay in intensive care units and high risk for mortality correlate with the incidence of sepsis.
View Article and Find Full Text PDFstrain BPSAC147 is an endophytic actinobacterium isolated from root tissues of in Mizoram, Northeast India. The strain showed potentially plant growth-promoting and biocontrol activities. The draft genome consists of 7.
View Article and Find Full Text PDFEndophytic strain BPSAC84, which has antimicrobial potential, was isolated from root tissues of in Mizoram, Northeast India. The draft genome consists of 3.58 Mb and 3,444 protein-coding sequences.
View Article and Find Full Text PDFMoringa oleifera is a plant well-known for its nutrition value, drought resistance and medicinal properties. cDNA libraries from five different tissues (leaf, root, stem, seed and flower) of M. oleifera cultivar Bhagya were generated and sequenced.
View Article and Find Full Text PDFThe number of protein structures is increasing due to the individual initiatives and rapid development of structure determination techniques. Structure-based sequence alignments of distantly related proteins enable the investigation of structural, evolutionary and functional relationships between proteins and their domains leading to their common evolutionary origin. Protein Alignments organized as Structural Superfamilies (PASS2) is a database that provides such alignments of members of protein domain superfamilies of known structure and with less than 40% sequence identity.
View Article and Find Full Text PDFBackground: RNA-binding proteins (RBPs) interact with their cognate RNA(s) to form large biomolecular assemblies. They are versatile in their functionality and are involved in a myriad of processes inside the cell. RBPs with similar structural features and common biological functions are grouped together into families and superfamilies.
View Article and Find Full Text PDFBioinform Biol Insights
August 2016
Protein-protein interactions play a very important role in the process of cellular functionality. Intricate details about the interactions between the proteins in a macromolecular assembly are important to understand the function and significance of protein complexes. We are reporting about a database of protein-protein interactions in huge macromolecular assemblies (PIMADb) that records the intrinsic details of 189,532 interchain interactions in 40,049 complexes from the Protein Data Bank.
View Article and Find Full Text PDFThe modern sequencing technologies are generating large volumes of information at the transcriptome and genome level. Translation of this information into a biological meaning is far behind the race due to which a significant portion of proteins discovered remain as proteins of unknown function (PUFs). Attempts to uncover the functional significance of PUFs are limited due to lack of easy and high throughput functional annotation tools.
View Article and Find Full Text PDFStructure-based sequence alignment is an essential step in assessing and analysing the relationship of distantly related proteins. PASS2 is a database that records such alignments for protein domain superfamilies and has been constantly updated periodically. This update of the PASS2 version, named as PASS2.
View Article and Find Full Text PDFBackground: Krishna Tulsi, a member of Lamiaceae family, is a herb well known for its spiritual, religious and medicinal importance in India. The common name of this plant is 'Tulsi' (or 'Tulasi' or 'Thulasi') and is considered sacred by Hindus. We present the draft genome of Ocimum tenuiflurum L (subtype Krishna Tulsi) in this report.
View Article and Find Full Text PDFBackground: Proteins interact with a variety of other molecules such as nucleic acids, small molecules and other proteins inside the cell. Structure-determination of protein-protein complexes is challenging due to several reasons such as the large molecular weights of these macromolecular complexes, their dynamic nature, difficulty in purification and sample preparation. Computational docking permits an early understanding of the feasibility and mode of protein-protein interactions.
View Article and Find Full Text PDFBiological enrichment analysis using gene ontology (GO) provides a global overview of the functional role of genes or proteins identified from large-scale genomic or proteomic experiments. Phenomic enrichment analysis of gene lists can provide an important layer of information as well as cellular components, molecular functions, and biological processes associated with gene lists. Plant phenomic enrichment analysis will be useful for performing new experiments to better understand plant systems and for the interpretation of gene or proteins identified from high-throughput experiments.
View Article and Find Full Text PDFProtein domains are functionally and structurally independent modules, which add to the functional variety of proteins. This array of functional diversity has been enabled by evolutionary changes, such as amino acid substitutions or insertions or deletions, occurring in these protein domains. Length variations (indels) can introduce changes at structural, functional and interaction levels.
View Article and Find Full Text PDFUnderstanding the principles of abiotic and biotic stress responses, tolerance and adaptation remains important in plant physiology research to develop better varieties of crop plants. Better understanding of plant stress response mechanisms and application of knowledge derived from integrated experimental and bioinformatics approaches are gaining importance. Earlier, we showed that compiling a database of stress-responsive transcription factors and their corresponding target binding sites in the form of Hidden Markov models at promoter, untranslated and upstream regions of stress-up-regulated genes from expression analysis can help in elucidating various aspects of the stress response in Arabidopsis.
View Article and Find Full Text PDF