Immunoglobulins (IGs), critical components of the human immune system, are composed of heavy and light protein chains encoded at three genomic loci. The IG Kappa (IGK) chain locus consists of two large, inverted segmental duplications. The complexity of the IG loci has hindered use of standard high-throughput methods for characterizing genetic variation within these regions.
View Article and Find Full Text PDFSummary: Knowledge of immunoglobulin and T cell receptor encoding genes is derived from high-quality genomic sequencing. High-throughput sequencing is delivering large volumes of data, and precise, high-throughput approaches to annotation are needed. Digger is an automated tool that identifies coding and regulatory regions of these genes, with results comparable to those obtained by current expert curational methods.
View Article and Find Full Text PDFIntroduction: Analysis of an individual's immunoglobulin (IG) gene repertoire requires the use of high-quality germline gene reference sets. When sets only contain alleles supported by strong evidence, AIRR sequencing (AIRR-seq) data analysis is more accurate and studies of the evolution of IG genes, their allelic variants and the expressed immune repertoire is therefore facilitated.
Methods: The Adaptive Immune Receptor Repertoire Community (AIRR-C) IG Reference Sets have been developed by including only human IG heavy and light chain alleles that have been confirmed by evidence from multiple high-quality sources.
In adaptive immune receptor repertoire analysis, determining the germline variable (V) allele associated with each T- and B-cell receptor sequence is a crucial step. This process is highly impacted by allele annotations. Aligning sequences, assigning them to specific germline alleles, and inferring individual genotypes are challenging when the repertoire is highly mutated, or sequence reads do not cover the whole V region.
View Article and Find Full Text PDFAnalysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing.
View Article and Find Full Text PDFE1 and E2 (E1E2), the fusion proteins of Hepatitis C Virus (HCV), are unlike that of any other virus yet described, and the detailed molecular mechanisms of HCV entry/fusion remain unknown. Hypervariable region-1 (HVR-1) of E2 is a putative intrinsically disordered protein tail. Here, we demonstrate that HVR-1 has an autoinhibitory function that suppresses the activity of E1E2 on free virions; this is dependent on its conformational entropy.
View Article and Find Full Text PDFAdaptive immune receptor repertoires (AIRRs) are rich with information that can be mined for insights into the workings of the immune system. Gene usage, CDR3 properties, clonal lineage structure, and sequence diversity are all capable of revealing the dynamic immune response to perturbation by disease, vaccination, or other interventions. Here we focus on a conceptual introduction to the many aspects of repertoire analysis and orient the reader toward the uses and advantages of each.
View Article and Find Full Text PDFHigh-throughput sequencing of adaptive immune receptor repertoires (AIRR, i.e., IG and TR) has revolutionized the ability to carry out large-scale experiments to study the adaptive immune response.
View Article and Find Full Text PDFImmunogenomics studies have been largely limited to individuals of European ancestry, restricting the ability to identify variation in human adaptive immune responses across populations. Inclusion of a greater diversity of individuals in immunogenomics studies will substantially enhance our understanding of human immunology.
View Article and Find Full Text PDFThe glycoproteins of hepatitis C virus, E1E2, are unlike any other viral fusion machinery yet described, and are the current focus of immunogen design in HCV vaccine development; thus, making E1E2 both scientifically and medically important. We used pre-existing, but fragmentary, structures to model a complete ectodomain of the major glycoprotein E2 from three strains of HCV. We then performed molecular dynamic simulations to explore the conformational landscape of E2, revealing a number of important features.
View Article and Find Full Text PDFImmunoglobulins or antibodies are the main effector molecules of the B-cell lineage and are encoded by hundreds of variable (V), diversity (D), and joining (J) germline genes, which recombine to generate enormous IG diversity. Recently, high-throughput adaptive immune receptor repertoire sequencing (AIRR-seq) of recombined V-(D)-J genes has offered unprecedented insights into the dynamics of IG repertoires in health and disease. Faithful biological interpretation of AIRR-seq studies depends upon the annotation of raw AIRR-seq data, using reference germline gene databases to identify the germline genes within each rearrangement.
View Article and Find Full Text PDFIn studying the binding of host antibodies to the surface antigens of pathogens, the structural and functional characterization of antibody-antigen complexes by X-ray crystallography and binding assay is important. However, the characterization requires experiments that are typically time consuming and expensive: thus, many antibody-antigen complexes are under-characterized. For vaccine development and disease surveillance, it is often vital to assess the impact of amino acid substitutions on antibody binding.
View Article and Find Full Text PDFNext-generation sequencing is making it possible to study the antibody repertoire of an organism in unprecedented detail, and, by so doing, to characterize its behavior in the response to infection and in pathological conditions such as autoimmunity and cancer. The polymorphic nature of the repertoire poses unique challenges that rule out the use of many commonly used NGS methods and require tradeoffs to be made when considering experimental design.We outline the main contexts in which antibody repertoire analysis has been used, and summarize the key tools that are available.
View Article and Find Full Text PDFThere are at present few tools available to assist with the determination and analysis of B-cell lineage trees from next-generation sequencing data. Here we present two utilities that support automated large-scale analysis and the creation of publication-quality results. The tools are available on the web and are also available for download so that they can be integrated into an automated pipeline.
View Article and Find Full Text PDFThe extent of the role of N-linked glycans (N-glycans) in shielding influenza A hemagglutinin (HA) against host antibodies has proved controversial, with different authors making widely different assumptions. One common assumption is that N-glycans physically shield surface residues that are near to glycosylation sites, thereby preventing antibodies from binding to them. However, it is unclear, from existing experimental evidence, whether antibodies that bind close to N-glycans are a rare or commonplace feature of human herd immune responses to influenza AHA.
View Article and Find Full Text PDFRecently, a number of broad-spectrum human antibodies binding to the stalk region of influenza A haemagglutinin (HA) have been isolated. As this region tends to develop substitutions at a slower rate than other regions of HA, a vaccine eliciting such antibodies could have a longer effective life. But this begs a question: is the stalk resistant to change even in the face of evolutionary pressure? In this paper, we analysed the known epitopes in the H3 stalk and, utilizing a collection of 3440 sequences, present a novel approach for detecting putative B-cell epitopes in regions such as this, in which mutations occur infrequently.
View Article and Find Full Text PDFIn this paper we undertake an analysis of the antigenicity of influenza A virus hemagglutinin. We developed a novel computational approach to the identification of antigenically active regions and showed that the amino acid substitutions between successive predominant seasonal strains form clusters that are consistent, in terms of both their location and their size, with the properties of B-cell epitopes in general and with those epitopes that have been identified experimentally in influenza A virus hemagglutinin to date. Such an interpretation provides a biologically plausible framework for an understanding of the location of antigenically important substitutions that is more specific than the canonical "antigenic site" model and provides an effective basis for deriving models that predict antigenic escape in the H3N2 subtype.
View Article and Find Full Text PDFMotivation: Modelling antigenic shift in influenza A H3N2 can help to predict the efficiency of vaccines. The virus is known to exhibit sudden jumps in antigenic distance, and prediction of such novel strains from amino acid sequence differences remains a challenge.
Results: From analysis of 6624 amino acid sequences of wild-type H3, we propose updates to the frequently referenced list of 131 amino acids located at or near the five identified antibody binding regions in haemagglutinin (HA).