In recent years, improved sequencing technology and computational tools have made genome assembly more accessible. Many approaches, however, generate either an unphased or only partially resolved representation of a diploid genome, in which polymorphisms are detected but not assigned to one or the other of the homologous chromosomes. Yet chromosomal phase information is invaluable for the understanding of phenotypic trait inheritance in the cases of compound heterozygosity, allele-specific expression or -acting variants. Here we use a combination of tools and sequencing technologies to generate a diploid assembly of the human primary cell line WI-38. First, data from PacBio single molecule sequencing and Bionano Genomics optical mapping were combined to generate an unphased assembly. Next, 10x Genomics linked reads were combined with the hybrid assembly to generate a partially phased assembly. Lastly, we developed and optimized methods to use short-read (Illumina) sequencing of flow cytometry-sorted metaphase chromosomes to provide phase information. The final genome assembly was almost fully (94%) phased with the addition of approximately 2.5-fold coverage of Illumina data from the sequenced metaphase chromosomes. The diploid nature of the final genome assembly improved the resolution of structural variants between the WI-38 genome and the human reference genome. The phased WI-38 sequence data are available for browsing and download at wi38.research.calicolabs.com. Our work shows that assembling a completely phased diploid genome from the DNA of a single individual is now readily achievable.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7466960 | PMC |
http://dx.doi.org/10.1534/g3.119.400995 | DOI Listing |
BMC Genom Data
January 2025
Key Laboratory of State Forestry and Grassland Administration Conservation and Utilization of Warm Temperate Zone Forest and Grass Germplasm Resources, Shandong Provincial Center of Forest and Grass Germplasm Resources, Ji'nan, 250103, Shandong, China.
Objectives: Toona sinensis, commonly known as Chinese toon, is a perennial woody plant with significant economic and ecological importance. This study employed whole-genome resequencing of 180 T. sinensis samples collected from Shandong to analyze genetic variation and diversity, ultimately identifying 18,231 high-quality SNPs after rigorous quality control and linkage disequilibrium pruning.
View Article and Find Full Text PDFBMC Plant Biol
January 2025
Institute of Tropical Horticulture Research, Hainan Academy of Agricultural Sciences, Haikou, 571100, China.
Background: Tea-oil Camellia within the genus Camellia is renowned for its premium Camellia oil, often described as "Oriental olive oil". So far, only one partial mitochondrial genomes of Tea-oil Camellia have been published (no main Tea-oil Camellia cultivars), and comparative mitochondrial genomic studies of Camellia remain limited.
Results: In this study, we first reconstructed the entire mitochondrial genome of C.
BMC Genomics
January 2025
Department of Food, Bioprocessing, & Nutrition Sciences, North Carolina State University, Raleigh, NC, USA.
Background: The advent of next generation sequencing technologies has enabled a surge in the number of whole genome sequences in public databases, and our understanding of the composition and evolution of bacterial genomes. Besides model organisms and pathogens, some attention has been dedicated to industrial bacteria, notably members of the Lactobacillaceae family that are commonly studied and formulated as probiotic bacteria. Of particular interest is Lactobacillus acidophilus NCFM, an extensively studied strain that has been widely commercialized for decades and is being used for the delivery of vaccines and therapeutics.
View Article and Find Full Text PDFNat Genet
January 2025
Center for Genomics, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou, China.
Modern sugarcane, a highly allo-autopolyploid organism, has a very complex genome. In the present study, the karyotype and genome architecture of modern sugarcane were investigated, resulting in a genome assembly of 97 chromosomes (8.84 Gb).
View Article and Find Full Text PDFNat Commun
January 2025
Institute of Molecular Physiology, Shenzhen Bay Laboratory, Shenzhen, 518132, China.
Nucleosome is the basic structural unit of the genome. During processes like DNA replication and gene transcription, the conformation of nucleosomes undergoes dynamic changes, including DNA unwrapping and rewrapping, as well as histone disassembly and assembly. However, the wrapping characteristics of nucleosomes across the entire genome, including region-specificity and their correlation with higher-order chromatin organization, remains to be studied.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!