Haplotype-resolved or phased genome assembly provides a complete picture of genomes and their complex genetic variations. However, current algorithms for phased assembly either do not generate chromosome-scale phasing or require pedigree information, which limits their application. We present a method named diploid assembly (DipAsm) that uses long, accurate reads and long-range conformation data for single individuals to generate a chromosome-scale phased assembly within 1 day. Applied to four public human genomes, PGP1, HG002, NA12878 and HG00733, DipAsm produced haplotype-resolved assemblies with minimum contig length needed to cover 50% of the known genome (NG50) up to 25 Mb and phased ~99.5% of heterozygous sites at 98-99% accuracy, outperforming other approaches in terms of both contiguity and phasing completeness. We demonstrate the importance of chromosome-scale phased assemblies for the discovery of structural variants (SVs), including thousands of new transposon insertions, and of highly polymorphic and medically important regions such as the human leukocyte antigen (HLA) and killer cell immunoglobulin-like receptor (KIR) regions. DipAsm will facilitate high-quality precision medicine and studies of individual haplotype variation and population diversity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7954703PMC
http://dx.doi.org/10.1038/s41587-020-0711-0DOI Listing

Publication Analysis

Top Keywords

human genomes
8
phased assembly
8
generate chromosome-scale
8
chromosome-scale phased
8
assembly
5
phased
5
chromosome-scale
4
chromosome-scale haplotype-resolved
4
haplotype-resolved assembly
4
assembly human
4

Similar Publications

This study evaluated influenza A virus (IAV) detection and genetic diversity over time, specifically at the human-swine interface in breeding and nursery farms. Active surveillance was performed monthly in five swine farms in the Midwest United States targeting the employees, the prewean piglets at sow farms, and the same cohort of piglets in downstream nurseries. In addition, information was collected at enrollment for each employee and farm to assess production management practices, IAV vaccination status, diagnostic procedures, and biosecurity.

View Article and Find Full Text PDF

Kaposi's sarcoma-associated herpesvirus (KSHV) is a double-stranded DNA gamma herpesvirus. Like other herpesviruses, KSHV establishes a latent infection with limited gene expression, while KSHV occasionally undergoes the lytic replication phase, which produces KSHV progenies and infects neighboring cells. KSHV genome encodes 80+ open reading frames.

View Article and Find Full Text PDF

Bats are recognized as natural reservoirs for an array of diverse viruses, particularly coronaviruses, which have been linked to major human diseases like SARS-CoV and MERS-CoV. These viruses are believed to have originated in bats, highlighting their role in virus ecology and evolution. Our study focuses on the molecular characterization of bat-derived coronaviruses (CoVs) in Canada.

View Article and Find Full Text PDF

Rewriting Viral Fate: Epigenetic and Transcriptional Dynamics in KSHV Infection.

Viruses

November 2024

State Key Laboratory of Virology, College of Life Sciences, Wuhan University, Wuhan 430072, China.

Kaposi's sarcoma-associated herpesvirus (KSHV), a γ-herpesvirus, is predominantly associated with Kaposi's sarcoma (KS) as well as two lymphoproliferative disorders: primary effusion lymphoma (PEL) and multicentric Castleman disease (MCD). Like other herpesviruses, KSHV employs two distinct life cycles: latency and lytic replication. To establish a lifelong persistent infection, KSHV has evolved various strategies to manipulate the epigenetic machinery of the host.

View Article and Find Full Text PDF

The biting midges Latreille, 1809 (Diptera: Ceratopogonidae) is highly relevant to epidemiology and public health, as it includes species that are potential vectors of human and animal arboviruses. The aim of this study was to investigate the presence of RNA viruses in species of the genus collected in the Carajás mining complex in the state of Pará. The biting midges were collected in the municipalities of Canaã dos Carajás, Curionópolis and Marabá and morphologically identified.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!