Comput Struct Biotechnol J
February 2023
Immunopeptidomics has made tremendous contributions to our understanding of antigen processing and presentation, by identifying and quantifying antigenic peptides presented on the cell surface by Major Histocompatibility Complex (MHC) molecules. Large and complex immunopeptidomics datasets can now be routinely generated using Liquid Chromatography-Mass Spectrometry techniques. The analysis of this data - often consisting of multiple replicates/conditions - rarely follows a standard data processing pipeline, hindering the reproducibility and depth of analysis of immunopeptidomic data.
View Article and Find Full Text PDFComput Struct Biotechnol J
February 2023
T cells expressing either alpha-beta or gamma-delta T cell receptors (TCR) are critical sentinels of the adaptive immune system, with receptor diversity being essential for protective immunity against a broad array of pathogens and agents. Programs available to profile TCR clonotypic signatures can be limiting for users with no coding expertise. Current analytical pipelines can be inefficient due to manual processing steps, open to data entry errors and have multiple analytical tools with unique inputs that require coding expertise.
View Article and Find Full Text PDFBMC Bioinformatics
February 2022
Background: Gene ontology (GO) enrichment analysis is frequently undertaken during exploration of various -omics data sets. Despite the wide array of tools available to biologists to perform this analysis, meaningful visualisation of the overrepresented GO in a manner which is easy to interpret is still lacking.
Results: Monash Gene Ontology (MonaGO) is a novel web-based visualisation system that provides an intuitive, interactive and responsive interface for performing GO enrichment analysis and visualising the results.
Background: Simple Sequence Repeats (SSRs) are short tandem repeats of nucleotide sequences. It has been shown that SSRs are associated with human diseases and are of medical relevance. Accordingly, a variety of computational methods have been proposed to mine SSRs from genomes.
View Article and Find Full Text PDFComput Struct Biotechnol J
October 2021
Volcano and other analytical plots (e.g., correlation plots, upset plots, and heatmaps) serve as important data visualization methods for transcriptomic and proteomic analyses.
View Article and Find Full Text PDFSARS-CoV-2 has caused a significant ongoing pandemic worldwide. A number of studies have examined the T cell mediated immune responses against SARS-CoV-2, identifying potential T cell epitopes derived from the SARS-CoV-2 proteome. Such studies will aid in identifying targets for vaccination and immune monitoring.
View Article and Find Full Text PDFBoth protease- and reactive oxygen species (ROS)-mediated proteolysis are thought to be key effectors of tissue remodeling. We have previously shown that comparison of amino acid composition can predict the differential susceptibilities of proteins to photo-oxidation. However, predicting protein susceptibility to endogenous proteases remains challenging.
View Article and Find Full Text PDFPurpose: To measure the prevalence of medically actionable pathogenic variants (PVs) among a population of healthy elderly individuals.
Methods: We used targeted sequencing to detect pathogenic or likely pathogenic variants in 55 genes associated with autosomal dominant medically actionable conditions, among a population of 13,131 individuals aged 70 or older (mean age 75 years) enrolled in the ASPirin in Reducing Events in the Elderly (ASPREE) trial. Participants had no previous diagnosis or current symptoms of cardiovascular disease, physical disability or dementia, and no current diagnosis of life-threatening cancer.
Motivation: Proteases are enzymes that cleave target substrate proteins by catalyzing the hydrolysis of peptide bonds between specific amino acids. While the functional proteolysis regulated by proteases plays a central role in the 'life and death' cellular processes, many of the corresponding substrates and their cleavage sites were not found yet. Availability of accurate predictors of the substrates and cleavage sites would facilitate understanding of proteases' functions and physiological roles.
View Article and Find Full Text PDFPost-translational modifications (PTMs) play very important roles in various cell signaling pathways and biological process. Due to PTMs' extremely important roles, many major PTMs have been studied, while the functional and mechanical characterization of major PTMs is well documented in several databases. However, most currently available databases mainly focus on protein sequences, while the real 3D structures of PTMs have been largely ignored.
View Article and Find Full Text PDFWith the explosive growth of biological sequences generated in the post-genomic era, one of the most challenging problems in bioinformatics and computational biology is to computationally characterize sequences, structures and functions in an efficient, accurate and high-throughput manner. A number of online web servers and stand-alone tools have been developed to address this to date; however, all these tools have their limitations and drawbacks in terms of their effectiveness, user-friendliness and capacity. Here, we present iLearn, a comprehensive and versatile Python-based toolkit, integrating the functionality of feature extraction, clustering, normalization, selection, dimensionality reduction, predictor construction, best descriptor/model selection, ensemble learning and results visualization for DNA, RNA and protein sequences.
View Article and Find Full Text PDFThe roles of proteolytic cleavage have been intensively investigated and discussed during the past two decades. This irreversible chemical process has been frequently reported to influence a number of crucial biological processes (BPs), such as cell cycle, protein regulation and inflammation. A number of advanced studies have been published aiming at deciphering the mechanisms of proteolytic cleavage.
View Article and Find Full Text PDFBackground: A strong focus of the post-genomic era is mining of the non-coding regulatory genome in order to unravel the function of regulatory elements that coordinate gene expression (Nat 489:57-74, 2012; Nat 507:462-70, 2014; Nat 507:455-61, 2014; Nat 518:317-30, 2015). Whole-genome approaches based on next-generation sequencing (NGS) have provided insight into the genomic location of regulatory elements throughout different cell types, organs and organisms. These technologies are now widespread and commonly used in laboratories from various fields of research.
View Article and Find Full Text PDFSummary: Evolutionary information in the form of a Position-Specific Scoring Matrix (PSSM) is a widely used and highly informative representation of protein sequences. Accordingly, PSSM-based feature descriptors have been successfully applied to improve the performance of various predictors of protein attributes. Even though a number of algorithms have been proposed in previous studies, there is currently no universal web server or toolkit available for generating this wide variety of descriptors.
View Article and Find Full Text PDFMeasuring the altered gene expression level and identifying differentially expressed genes/proteins during HIV infection, replication and latency is fundamental for broadening our understanding of the mechanisms of HIV infection and T-cell dysfunction. Such studies are crucial for developing effective strategies for virus eradication from the body. Inspired by the availability and enrichment of gene expression data during HIV infection, replication and latency, in this study, we proposed a novel compendium termed HIVed (HIV expression database; http://hivlatency.
View Article and Find Full Text PDFBacteria translocate effector molecules to host cells through highly evolved secretion systems. By definition, the function of these effector proteins is to manipulate host cell biology and the sequence, structural and functional annotations of these effector proteins will provide a better understanding of how bacterial secretion systems promote bacterial survival and virulence. Here we developed a knowledgebase, termed SecretEPDB (Bacterial Secreted Effector Protein DataBase), for effector proteins of type III secretion system (T3SS), type IV secretion system (T4SS) and type VI secretion system (T6SS).
View Article and Find Full Text PDFGlycosylation plays an important role in cell-cell adhesion, ligand-binding and subcellular recognition. Current approaches for predicting protein glycosylation are primarily based on sequence-derived features, while little work has been done to systematically assess the importance of structural features to glycosylation prediction. Here, we propose a novel bioinformatics method called GlycoMine(http://glycomine.
View Article and Find Full Text PDFThe Bioinformatics Training Platform (BTP) has been developed to provide access to the computational infrastructure required to deliver sophisticated hands-on bioinformatics training courses. The BTP is a cloud-based solution that is in active use for delivering next-generation sequencing training to Australian researchers at geographically dispersed locations. The BTP was built to provide an easy, accessible, consistent and cost-effective approach to delivering workshops at host universities and organizations with a high demand for bioinformatics training but lacking the dedicated bioinformatics training suites required.
View Article and Find Full Text PDFThere is a clear demand for hands-on bioinformatics training. The development of bioinformatics workshop content is both time-consuming and expensive. Therefore, enabling trainers to develop bioinformatics workshops in a way that facilitates reuse is becoming increasingly important.
View Article and Find Full Text PDFThe widespread adoption of high-throughput next-generation sequencing (NGS) technology among the Australian life science research community is highlighting an urgent need to up-skill biologists in tools required for handling and analysing their NGS data. There is currently a shortage of cutting-edge bioinformatics training courses in Australia as a consequence of a scarcity of skilled trainers with time and funding to develop and deliver training courses. To address this, a consortium of Australian research organizations, including Bioplatforms Australia, the Commonwealth Scientific and Industrial Research Organisation and the Australian Bioinformatics Network, have been collaborating with EMBL-EBI training team.
View Article and Find Full Text PDF