198 results match your criteria: "UC Santa Cruz Genomics Institute[Affiliation]"

Article Synopsis
  • * They generated over 427 million long-read sequences and found that longer, more accurate sequences yield better transcript detection, while increased read depth enhances quantification.
  • * The study suggests that using reference-based tools works best for well-annotated genomes and recommends incorporating extra data to better identify rare transcripts, providing a benchmark for improving transcriptome analysis techniques in the future.
View Article and Find Full Text PDF

Phased nanopore assembly with Shasta and modular graph phasing with GFAse.

Genome Res

April 2024

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA;

Reference-free genome phasing is vital for understanding allele inheritance and the impact of single-molecule DNA variation on phenotypes. To achieve thorough phasing across homozygous or repetitive regions of the genome, long-read sequencing technologies are often used to perform phased de novo assembly. As a step toward reducing the cost and complexity of this type of analysis, we describe new methods for accurately phasing Oxford Nanopore Technologies (ONT) sequence data with the Shasta genome assembler and a modular tool for extending phasing to the chromosome scale called GFAse.

View Article and Find Full Text PDF

Towards accurate indel calling for oncopanel sequencing through an international pipeline competition at precisionFDA.

Sci Rep

April 2024

Division of Bioinformatics and Biostatistics, Office of Research, National Center for Toxicological Research, Office of the Chief Scientist, Office of the Commissioner, U.S. Food and Drug Administration, Jefferson, AR, 72079, USA.

Accurately calling indels with next-generation sequencing (NGS) data is critical for clinical application. The precisionFDA team collaborated with the U.S.

View Article and Find Full Text PDF

Most current studies rely on short-read sequencing to detect somatic structural variation (SV) in cancer genomes. Long-read sequencing offers the advantage of better mappability and long-range phasing, which results in substantial improvements in germline SV detection. However, current long-read SV detection methods do not generalize well to the analysis of somatic SVs in tumor genomes with complex rearrangements, heterogeneity, and aneuploidy.

View Article and Find Full Text PDF
Article Synopsis
  • The study explores how transposable elements (TEs) contribute to the evolution of genome architecture across millions of years in eukaryotes.
  • TEs are linked to significant mutations that affect gene expression and lead to structural variations in genomes, showing distinct patterns in both non-pathogenic and pathogenic species.
  • The findings suggest that adaptive selection plays a role in shaping TE-compartmentalized genes, which may lead to a common genomic structure observed across various lineages.
View Article and Find Full Text PDF
Article Synopsis
  • Human centromeres are challenging to sequence due to their large size and repetitive nature, limiting our understanding of their variation and evolutionary function.
  • Using long-read sequencing, researchers completely sequenced and assembled all centromeres from a second human genome, revealing a significant increase in genetic variation and size differences between centromeres.
  • Comparative analysis of centromeric sequences across species, including humans and great apes, highlights the rapid evolution of α-satellite DNA and suggests limited recombination between chromosome arms, aiding in studying centromeric DNA evolution.
View Article and Find Full Text PDF

A Panoramic View of Cell Population Dynamics in Mammalian Aging.

bioRxiv

March 2024

Laboratory of Single Cell Genomics and Population Dynamics, The Rockefeller University, New York, NY, USA.

To elucidate the aging-associated cellular population dynamics throughout the body, here we present PanSci, a single-cell transcriptome atlas profiling over 20 million cells from 623 mouse tissue samples, encompassing a range of organs across different life stages, sexes, and genotypes. This comprehensive dataset allowed us to identify more than 3,000 unique cellular states and catalog over 200 distinct aging-associated cell populations experiencing significant depletion or expansion. Our panoramic analysis uncovered temporally structured, organ- and lineage-specific shifts of cellular dynamics during lifespan progression.

View Article and Find Full Text PDF

Structurally divergent and recurrently mutated regions of primate genomes.

Cell

March 2024

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA. Electronic address:

Article Synopsis
  • * We discovered over 1.3 million lineage-specific structural variants (SVs) that impact thousands of protein-coding genes and regulatory elements, revealing significant genomic differences among primates, especially compared to humans.
  • * Our research identified 1,607 regions with structural variations that are hotspots for gene loss and creation, indicating areas in the genome subject to rapid evolution and natural selection across primate species.
View Article and Find Full Text PDF

Spinal muscular atrophy (SMA) is a genetic disorder that causes progressive degeneration of lower motor neurons and the subsequent loss of muscle function throughout the body. It is the second most common recessive disorder in individuals of European descent and is present in all populations. Accurate tools exist for diagnosing SMA from genome sequencing data.

View Article and Find Full Text PDF

Thalamocortical organoids enable in vitro modeling of 22q11.2 microdeletion associated with neuropsychiatric disorders.

Cell Stem Cell

March 2024

Department of Anatomy, University of California, San Francisco, San Francisco, CA 94158, USA; Department of Psychiatry and Behavioral Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Department of Neurological Surgery, University of California, San Francisco, San Francisco, CA 94158, USA; Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94158, USA; Eli and Edythe Broad Center for Regeneration Medicine and Stem Cell Research, University of California, San Francisco, San Francisco, CA 94158, USA. Electronic address:

Thalamic dysfunction has been implicated in multiple psychiatric disorders. We sought to study the mechanisms by which abnormalities emerge in the context of the 22q11.2 microdeletion, which confers significant genetic risk for psychiatric disorders.

View Article and Find Full Text PDF

Fluorescent in situ hybridization (FISH) is a powerful method for the targeted visualization of nucleic acids in their native contexts. Recent technological advances have leveraged computationally designed oligonucleotide (oligo) probes to interrogate > 100 distinct targets in the same sample, pushing the boundaries of FISH-based assays. However, even in the most highly multiplexed experiments, repetitive DNA regions are typically not included as targets, as the computational design of specific probes against such regions presents significant technical challenges.

View Article and Find Full Text PDF

We present BATH, a tool for highly sensitive annotation of protein-coding DNA based on direct alignment of that DNA to a database of protein sequences or profile hidden Markov models (pHMMs). BATH is built on top of the HMMER3 code base, and simplifies the annotation workflow for pHMM-based annotation by providing a straightforward input interface and easy-to-interpret output. BATH also introduces novel frameshift-aware algorithms to detect frameshift-inducing nucleotide insertions and deletions (indels).

View Article and Find Full Text PDF

During future space missions, astronauts will be exposed to cosmic radiation and microgravity (μG), which are known to be health risk factors. To examine the differentially expressed genes (DEG) and their prevalent biological processes and pathways as a response to these two risk factors simultaneously, 1BR-hTERT human fibroblast cells were cultured under 1 gravity (1G) or simulated μG for 48 h in total and collected at 0 (sham irradiated), 3 or 24 h after 1 Gy of X-ray or Carbon-ion (C-ion) irradiation. A three-dimensional clinostat was used for the simulation of μG and the simultaneous radiation exposure of the samples.

View Article and Find Full Text PDF

Neuronal firing sequences are thought to be the basic building blocks of neural coding and information broadcasting within the brain. However, when sequences emerge during neurodevelopment remains unknown. We demonstrate that structured firing sequences are present in spontaneous activity of human brain organoids and neonatal brain slices from the murine somatosensory cortex.

View Article and Find Full Text PDF

Personalized Pangenome References.

bioRxiv

December 2023

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA.

Pangenomes, by including genetic diversity, should reduce reference bias by better representing new samples compared to them. Yet when comparing a new sample to a pangenome, variants in the pangenome that are not part of the sample can be misleading, for example, causing false read mappings. These irrelevant variants are generally rarer in terms of allele frequency, and have previously been dealt with using allele frequency filters.

View Article and Find Full Text PDF

Background: Phaeochromocytomas and paragangliomas (PPGLs) are rare neuroendocrine tumours. Pathogenic variants have been identified in more than 15 susceptibility genes; associated tumours are grouped into three Clusters, reinforced by their transcriptional profiles. Cluster 1A PPGLs have pathogenic variants affecting enzymes of the tricarboxylic acid cycle, including succinate dehydrogenase.

View Article and Find Full Text PDF
Article Synopsis
  • Noncoding DNA helps scientists understand how genes work and how they relate to diseases in humans.
  • Researchers studied the DNA of many primates to find specific regulatory parts that are important for gene regulation.
  • They discovered a lot of these regulatory elements in humans that are different from those in other mammals, which can help explain human traits and health issues.
View Article and Find Full Text PDF

Single-cell analysis of prenatal and postnatal human cortical development.

Science

October 2023

Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, University of California, San Francisco, CA 94143, USA.

Article Synopsis
  • * By using single-nucleus chromatin accessibility data, we mapped out the gene regulatory networks and transcription factors crucial for the development of different cortical lineages.
  • * Our findings linked certain cortical cell types to genetic risk factors for brain disorders like autism, revealing that female cells show greater susceptibility due to upregulated lineage-specific gene expression related to these risks.
View Article and Find Full Text PDF
Article Synopsis
  • Long-read sequencing technology is enhancing the detection of genetic variants in complex regions of the genome and facilitating quicker genetic diagnoses in clinical settings.
  • Newer third-generation sequencing platforms, such as those from PacBio and Oxford Nanopore, are rapidly advancing, but traditional variant calling methods struggle with increased data complexity.
  • The developed local haplotype approximation method improves variant calling accuracy and allows DeepVariant to work effectively across various long-read sequencing platforms.
View Article and Find Full Text PDF

Long-read sequencing technologies substantially overcome the limitations of short-reads but have not been considered as a feasible replacement for population-scale projects, being a combination of too expensive, not scalable enough or too error-prone. Here we develop an efficient and scalable wet lab and computational protocol, Napu, for Oxford Nanopore Technologies long-read sequencing that seeks to address those limitations. We applied our protocol to cell lines and brain tissue samples as part of a pilot project for the National Institutes of Health Center for Alzheimer's and Related Dementias.

View Article and Find Full Text PDF

DNA nanopores as artificial membrane channels for bioprotonics.

Nat Commun

September 2023

Department of Electrical and Computer Engineering, Jack Baskin School of Engineering, University of California, Santa Cruz, Santa Cruz, CA, 95064, USA.

Biological membrane channels mediate information exchange between cells and facilitate molecular recognition. While tuning the shape and function of membrane channels for precision molecular sensing via de-novo routes is complex, an even more significant challenge is interfacing membrane channels with electronic devices for signal readout, which results in low efficiency of information transfer - one of the major barriers to the continued development of high-performance bioelectronic devices. To this end, we integrate membrane spanning DNA nanopores with bioprotonic contacts to create programmable, modular, and efficient artificial ion-channel interfaces.

View Article and Find Full Text PDF

The complete sequence of a human Y chromosome.

Nature

September 2023

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.

The human Y chromosome has been notoriously difficult to sequence and assemble because of its complex repeat structure that includes long palindromes, tandem repeats and segmental duplications. As a result, more than half of the Y chromosome is missing from the GRCh38 reference sequence and it remains the last human chromosome to be finished. Here, the Telomere-to-Telomere (T2T) consortium presents the complete 62,460,029-base-pair sequence of a human Y chromosome from the HG002 genome (T2T-Y) that corrects multiple errors in GRCh38-Y and adds over 30 million base pairs of sequence to the reference, showing the complete ampliconic structures of gene families TSPY, DAZ and RBMY; 41 additional protein-coding genes, mostly from the TSPY family; and an alternating pattern of human satellite 1 and 3 blocks in the heterochromatic Yq12 region.

View Article and Find Full Text PDF
Article Synopsis
  • The Long-read RNA-Seq Genome Annotation Assessment Project (LRGASP) Consortium aimed to evaluate long-read sequencing for analyzing transcripts by generating over 427 million sequences from various species.
  • The findings highlighted that longer, accurate sequences yield better transcript identification, while increased read depth enhances quantification accuracy, particularly in well-annotated genomes.
  • The study serves as a benchmark for transcriptome analysis strategies and suggests using additional data for detecting rare transcripts or employing reference-free methods.
View Article and Find Full Text PDF

Indigenous peoples have cultivated biodiverse agroecosystems since time immemorial. The rise of metagenomics and high-throughput sequencing technologies in biodiversity studies has rapidly expanded the scale of data collection from these lands. A respectful approach to the data life cycle grounded in the sovereignty of indigenous communities is imperative to not perpetuate harm.

View Article and Find Full Text PDF

We completely sequenced and assembled all centromeres from a second human genome and used two reference sets to benchmark genetic, epigenetic, and evolutionary variation within centromeres from a diversity panel of humans and apes. We find that centromere single-nucleotide variation can increase by up to 4.1-fold relative to other genomic regions, with the caveat that up to 45.

View Article and Find Full Text PDF