Human endogenous retroviruses (HERVs) occupy a large portion of the human genome. Most HERVs are transcriptionally silent, but they can be reactivated during pathological states such as viral infection and certain cancers. The HERV-K HML-2 clade includes elements that recently integrated have in the human germ line and often contain intact open reading frames that possibly support peptide and protein expression.
View Article and Find Full Text PDFImmunoglobulins (IGs), crucial components of the adaptive immune system, are encoded by three genomic loci. However, the complexity of the IG loci severely limits the effective use of short read sequencing, limiting our knowledge of population diversity in these loci. We leveraged existing long read whole-genome sequencing (WGS) data, fosmid technology, and IG targeted single-molecule, real-time (SMRT) long-read sequencing (IG-Cap) to create haplotype-resolved assemblies of the IG Lambda (IGL) locus from 6 ethnically diverse individuals.
View Article and Find Full Text PDFThe discovery of N-methyldeoxyadenine (6mA) across eukaryotes led to a search for additional epigenetic mechanisms. However, some studies have highlighted confounding factors that challenge the prevalence of 6mA in eukaryotes. We developed a metagenomic method to quantitatively deconvolve 6mA events from a genomic DNA sample into species of interest, genomic regions, and sources of contamination.
View Article and Find Full Text PDF