De Novo Genome Assemblies From Two Indigenous Americans from Arizona Identify New Polymorphisms in Non-Reference Sequences.

Genome Biol Evol

Diabetes Molecular Genetics Section, Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Phoenix, AZ 85004, USA.

Published: September 2024

There is a collective push to diversify human genetic studies by including underrepresented populations. However, analyzing DNA sequence reads involves the initial step of aligning the reads to the GRCh38/hg38 reference genome which is inadequate for non-European ancestries. In this study, using long-read sequencing technology, we constructed de novo genome assemblies from two indigenous Americans from Arizona (IAZ). Each assembly included ∼17 Mb of DNA sequence not present [nonreference sequence (NRS)] in hg38, which consists mostly of repeat elements. Forty NRSs totaling 240 kb were uniquely anchored to the hg38 primary assembly generating a modified hg38-NRS reference genome. DNA sequence alignment and variant calling were then conducted with whole-genome sequencing (WGS) sequencing data from 387 IAZ using both the hg38 and modified hg38-NRS reference maps. Variant calling with the hg38-NRS map identified ∼50,000 single-nucleotide variants present in at least 5% of the WGS samples which were not detected with the hg38 reference map. We also directly assessed the NRSs positioned within genes. Seventeen NRSs anchored to regions including an identical 187 bp NRS found in both de novo assemblies. The NRS is located in HCN2 79 bp downstream of Exon 3 and contains several putative transcriptional regulatory elements. Genotyping of the HCN2-NRS revealed that the insertion is enriched in IAZ (minor allele frequency = 0.45) compared to other reference populations tested. This study shows that inclusion of population-specific NRSs can dramatically change the variant profile in an underrepresented ethnic groups and thereby lead to the discovery of previously missed common variations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11384899PMC
http://dx.doi.org/10.1093/gbe/evae188DOI Listing

Publication Analysis

Top Keywords

dna sequence
12
novo genome
8
genome assemblies
8
assemblies indigenous
8
indigenous americans
8
americans arizona
8
reference genome
8
modified hg38-nrs
8
hg38-nrs reference
8
variant calling
8

Similar Publications

Two novel yeast strains, NYNU 236247 and NYNU 23523, were isolated from the leaves of Hance, collected in the Tianchi Mountain National Forest Park, Henan Province, central China. Phylogenetic analysis of the D1/D2 domain of the large subunit rRNA gene and the internal transcribed spacer (ITS) region revealed the closest relatives of the strains are three described species: , and . The novel species differed from the type strains of these three species by 12 to 22 nucleotide substitutions and 1 gap (~2.

View Article and Find Full Text PDF

Small, obligately anaerobic strains 13CB8C, 13CB11C, 13CB18C and 13GAM1G were isolated from a faecal sample in a patient with Parkinson's disease with a history of duodenal resection. After conducting a comprehensive polyphasic taxonomic analysis including genomic analysis, we propose the establishment of one new genus and four new species. The novel bacteria are sp.

View Article and Find Full Text PDF

Motivation: Predicting RNA-binding proteins (RBPs) is central to understanding post-transcriptional regulatory mechanisms. Here, we introduce EnrichRBP, an automated and interpretable computational platform specifically designed for the comprehensive analysis of RBP interactions with RNA.

Results: EnrichRBP is a web service that enables researchers to develop original deep learning and machine learning architectures to explore the complex dynamics of RNA-binding proteins.

View Article and Find Full Text PDF

Helminths infection of Schizothorax niger in Kashmir, India: morphological and molecular characterization.

Mol Biol Rep

January 2025

Division of Animal Biotechnology, Faculty of Veterinary Sciences & Animal Husbandry, SKUAST-K, Srinagar, India.

Background: The identification of helminth parasites in Schizothorax spp. from Kashmir, including Schyzocotyle acheilognathi, Pomphorhynchus kashmirensis, and Adenoscolex oreini, is hindered by morphological limitations and high intraspecific variation. While previous studies have relied on morphological diagnosis, a comprehensive molecular characterization is lacking.

View Article and Find Full Text PDF

A fluorescent aptasensor was developed based on target-induced hairpin conformation switch coupled with nicking enzyme-assisted signal amplification (NESA) to detect the oligomeric form of ß-amyolid peptide (AβO) in cerebrospinal fluid. The hairpin DNA probe (HP) was specifically designed to recognize AβO. When AβO is present in the sensing system, it induces an HP conformational switch and triggers the NESA reaction.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!