Motivation: The versatile binding properties of antibodies have made them an extremely important class of biotherapeutics. However, therapeutic antibody development is a complex, expensive, and time-consuming task, with the final antibody needing to not only have strong and specific binding but also be minimally impacted by developability issues. The success of transformer-based language models in protein sequence space and the availability of vast amounts of antibody sequences, has led to the development of many antibody-specific language models to help guide antibody design.
View Article and Find Full Text PDFT cell activation is governed through T cell receptors (TCRs), heterodimers of two sequence-variable chains (often an α and β chain) that synergistically recognize antigen fragments presented on cell surfaces. Despite this, there only exist repositories dedicated to collecting single-chain, not paired-chain, TCR sequence data. We addressed this gap by creating the Observed TCR Space (OTS) database, a source of consistently processed and annotated, full-length, paired-chain TCR sequences.
View Article and Find Full Text PDFAntibodies are key proteins of the adaptive immune system, and there exists a large body of academic literature and patents dedicated to their study and concomitant conversion into therapeutics, diagnostics, or reagents. These documents often contain extensive functional characterisations of the sets of antibodies they describe. However, leveraging these heterogeneous reports, for example to offer insights into the properties of query antibodies of interest, is currently challenging as there is no central repository through which this wide corpus can be mined by sequence or structure.
View Article and Find Full Text PDFAntibodies with similar amino acid sequences, especially across their complementarity-determining regions, often share properties. Finding that an antibody of interest has a similar sequence to naturally expressed antibodies in healthy or diseased repertoires is a powerful approach for the prediction of antibody properties, such as immunogenicity or antigen specificity. However, as the number of available antibody sequences is now in the billions and continuing to grow, repertoire mining for similar sequences has become increasingly computationally expensive.
View Article and Find Full Text PDFBioinformatics tools were used to predict radical scavenging and metal chelating activities of peptides derived from abundant potato, seaweed, microbial, and spinach proteins. The antioxidant activity was evaluated in 5% oil-in-water emulsions (pH4) and best-performing peptides were tested in mayonnaise and compared with EDTA. Emulsion physical stability was intact.
View Article and Find Full Text PDFMotivation: General protein language models have been shown to summarize the semantics of protein sequences into representations that are useful for state-of-the-art predictive methods. However, for antibody specific problems, such as restoring residues lost due to sequencing errors, a model trained solely on antibodies may be more powerful. Antibodies are one of the few protein types where the volume of sequence data needed for such language models is available, e.
View Article and Find Full Text PDFIn this study, we used a combination of quantitative proteomics and bioinformatic prediction for identifying novel antioxidant peptides. Thirty-five peptides from potato, seaweed, microbial, and spinach proteins were investigated. Based on high DPPH radical scavenging activity (IC ≤ 16 mg/mL), metal chelation activity, isoelectric point, and high relative abundance in the parent protein sources, 11 peptides were selected.
View Article and Find Full Text PDFThe antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B-cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in-depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare.
View Article and Find Full Text PDFGlobal focus on sustainability has accelerated research into alternative non-animal sources of food protein and functional food ingredients. Amphiphilic peptides represent a class of promising biomolecules to replace chemical emulsifiers in food emulsions. In contrast to traditional trial-and-error enzymatic hydrolysis, this study utilizes a bottom-up approach combining quantitative proteomics, bioinformatics prediction, and functional validation to identify novel emulsifier peptides from seaweed, methanotrophic bacteria, and potatoes.
View Article and Find Full Text PDFC-reactive protein (CRP) is widely used as biomarkers of infection and inflammation. It has a well-described ability to bind phosphocholine (PC), as well as PC-clusters from compromised and inflamed cell membranes and tissues. The binding of PC-clusters to CRP is of interest as this binding determines subsequent innate immune activity.
View Article and Find Full Text PDFDietary antioxidants are an important preservative in food and have been suggested to help in disease prevention. With consumer demands for less synthetic and safer additives in food products, the food industry is searching for antioxidants that can be marketed as natural. Peptides derived from natural proteins show promise, as they are generally regarded as safe and potentially contain other beneficial bioactivities.
View Article and Find Full Text PDFMotivation: Monoclonal antibodies are essential tools in the contemporary therapeutic armory. Understanding how these recognize their antigen is a fundamental step in their rational design and engineering. The rising amount of publicly available data is catalyzing the development of computational approaches able to offer valuable, faster and cheaper alternatives to classical experimental methodologies used for the study of antibody-antigen complexes.
View Article and Find Full Text PDFIn this work, we developed a novel approach combining bioinformatics, testing of functionality and bottom-up proteomics to obtain peptide emulsifiers from potato side-streams. This is a significant advancement in the process to obtain emulsifier peptides and it is applicable to any type of protein. Our results indicated that structure at the interface is the major determining factor of the emulsifying activity of peptide emulsifiers.
View Article and Find Full Text PDFThe interaction between the class I major histocompatibility complex (MHC), the peptide presented by the MHC and the T-cell receptor (TCR) is a key determinant of the cellular immune response. Here, we present TCRpMHCmodels, a method for accurate structural modelling of the TCR-peptide-MHC (TCR-pMHC) complex. This TCR-pMHC modelling pipeline takes as input the amino acid sequence and generates models of the TCR-pMHC complex, with a median Cα RMSD of 2.
View Article and Find Full Text PDF