Motivation: Nanopore sequencing current signal data can be 'basecalled' into sequence information or analysed directly, with the capacity to identify diverse molecular features, such as DNA/RNA base modifications and secondary structures. However, raw signal data is large and complex, and there is a need for improved visualization strategies to facilitate signal analysis, exploration and tool development.
Results: Squigualiser (Squiggle visualiser) is a toolkit for intuitive, interactive visualization of sequence-aligned signal data, which currently supports both DNA and RNA sequencing data from Oxford Nanopore Technologies instruments.
Pancreatic ductal adenocarcinoma (PDAC) is characterized by increasing fibrosis, which can enhance tumor progression and spread. Here, we undertook an unbiased temporal assessment of the matrisome of the highly metastatic KPC (, , ) and poorly metastatic KPC (, , ) genetically engineered mouse models of pancreatic cancer using mass spectrometry proteomics. Our assessment at early-, mid-, and late-stage disease reveals an increased abundance of nidogen-2 (NID2) in the KPC model compared to KPC, with further validation showing that NID2 is primarily expressed by cancer-associated fibroblasts (CAFs).
View Article and Find Full Text PDFBackground: Loss-of-function variants in MME (membrane metalloendopeptidase) are a known cause of recessive Charcot-Marie-Tooth Neuropathy (CMT). A deep intronic variant, MME c.1188+428A>G (NM_000902.
View Article and Find Full Text PDFThe hereditary cerebellar ataxias (HCAs) are rare, progressive neurologic disorders caused by variants in many different genes. Inheritance may follow autosomal dominant, autosomal recessive, X-linked or mitochondrial patterns. The list of genes associated with adult-onset cerebellar ataxia is continuously growing, with several new genes discovered in the last few years.
View Article and Find Full Text PDFIn silico simulation of high-throughput sequencing data is a technique used widely in the genomics field. However, there is currently a lack of effective tools for creating simulated data from nanopore sequencing devices, which measure DNA or RNA molecules in the form of time-series current signal data. Here, we introduce Squigulator, a fast and simple tool for simulation of realistic nanopore signal data.
View Article and Find Full Text PDFBackground: As adoption of nanopore sequencing technology continues to advance, the need to maintain large volumes of raw current signal data for reanalysis with updated algorithms is a growing challenge. Here we introduce slow5curl, a software package designed to streamline nanopore data sharing, accessibility, and reanalysis.
Results: Slow5curl allows a user to fetch a specified read or group of reads from a raw nanopore dataset stored on a remote server, such as a public data repository, without downloading the entire file.
The expression of genes encompasses their transcription into mRNA followed by translation into protein. In recent years, next-generation sequencing and mass spectrometry methods have profiled DNA, RNA and protein abundance in cells. However, there are currently no reference standards that are compatible across these genomic, transcriptomic and proteomic methods, and provide an integrated measure of gene expression.
View Article and Find Full Text PDFDNA methylation (5mC) is a repressive gene regulatory mark widespread in vertebrate genomes, yet the developmental dynamics in which 5mC patterns are established vary across species. While mammals undergo two rounds of global 5mC erasure, teleosts, for example, exhibit localized maternal-to-paternal 5mC remodeling. Here, we studied 5mC dynamics during the embryonic development of sea lamprey, a jawless vertebrate which occupies a critical phylogenetic position as the sister group of the jawed vertebrates.
View Article and Find Full Text PDFShort tandem repeats (STRs) are highly polymorphic sequences throughout the human genome that are composed of repeated copies of a 1-6-bp motif. Over 1 million variable STR loci are known, some of which regulate gene expression and influence complex traits, such as height. Moreover, variants in at least 60 STR loci cause genetic disorders, including Huntington disease and fragile X syndrome.
View Article and Find Full Text PDFPurpose: Genome sequencing (GS)-specific diagnostic rates in prospective tightly ascertained exome sequencing (ES)-negative intellectual disability (ID) cohorts have not been reported extensively.
Methods: ES, GS, epigenetic signatures, and long-read sequencing diagnoses were assessed in 74 trios with at least moderate ID.
Results: The ES diagnostic yield was 42 of 74 (57%).
Indigenous Australians harbour rich and unique genomic diversity. However, Aboriginal and Torres Strait Islander ancestries are historically under-represented in genomics research and almost completely missing from reference datasets. Addressing this representation gap is critical, both to advance our understanding of global human genomic diversity and as a prerequisite for ensuring equitable outcomes in genomic medicine.
View Article and Find Full Text PDFBackground: Sea snakes underwent a complete transition from land to sea within the last ~ 15 million years, yet they remain a conspicuous gap in molecular studies of marine adaptation in vertebrates.
Results: Here, we generate four new annotated sea snake genomes, three of these at chromosome-scale (Hydrophis major, H. ornatus and H.
Synthetic chromosome engineering is a complex process due to the need to identify and repair growth defects and deal with combinatorial gene essentiality when rearranging chromosomes. To alleviate these issues, we have demonstrated novel approaches for repairing and rearranging synthetic genomes. We have designed, constructed, and restored wild-type fitness to a synthetic 753,096-bp version of chromosome XIV as part of the Synthetic Yeast Genome project.
View Article and Find Full Text PDFChimeric antigen receptor (CAR) T cell therapy is effective in treating B cell malignancies, but factors influencing the persistence of functional CAR T cells, such as product composition, patients' lymphodepletion, and immune reconstitution, are not well understood. To shed light on this issue, here we conduct a single-cell multi-omics analysis of transcriptional, clonal, and phenotypic profiles from pre- to 1-month post-infusion of CAR and CAR T cells from patients from a CARTELL study (ACTRN12617001579381) who received a donor-derived 4-1BB CAR product targeting CD19. Following infusion, CAR T cells and CAR T cells shows similar differentiation profiles with clonally expanded populations across heterogeneous phenotypes, demonstrating clonal lineages and phenotypic plasticity.
View Article and Find Full Text PDFAutosomal dominant polycystic kidney disease (ADPKD) is the most common monogenic cause of kidney failure and is primarily associated with PKD1 or PKD2. Approximately 10% of patients remain undiagnosed after standard genetic testing. We aimed to utilise short and long-read genome sequencing and RNA studies to investigate undiagnosed families.
View Article and Find Full Text PDFMotivation: Nanopore sequencing is emerging as a key pillar in the genomic technology landscape but computational constraints limiting its scalability remain to be overcome. The translation of raw current signal data into DNA or RNA sequence reads, known as 'basecalling', is a major friction in any nanopore sequencing workflow. Here, we exploit the advantages of the recently developed signal data format 'SLOW5' to streamline and accelerate nanopore basecalling on high-performance computing (HPC) and cloud environments.
View Article and Find Full Text PDF