In response to the escalating SARS-CoV-2 pandemic, in March 2020 the COVID-19 Genomics UK (COG-UK) consortium was established to enable national-scale genomic surveillance in the UK. By the end of 2020, 49% of all SARS-CoV-2 genome sequences globally had been generated as part of the COG-UK programme, and to date, this system has generated >3 million SARS-CoV-2 genomes. Rapidly and reliably analysing this unprecedented number of genomes was an enormous challenge. To fulfil this need and to inform public health decision-making, we developed a centralized pipeline that performs quality control, alignment, and variant calling and provides the global phylogenetic context of sequences. We present this pipeline and describe how we tailored it as the pandemic progressed to scale with the increasing amounts of data and to provide the most relevant analyses on a daily basis.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529618PMC
http://dx.doi.org/10.1093/ve/veae083DOI Listing

Publication Analysis

Top Keywords

variant calling
8
phylogenetics variant
4
calling pipeline
4
pipeline support
4
sars-cov-2
4
support sars-cov-2
4
sars-cov-2 genomic
4
genomic epidemiology
4
epidemiology response
4
response escalating
4

Similar Publications

Primary ciliary dyskinesia (PCD, OMIM 244400) is a rare genetic disorder that affects motile cilia and is characterised by impaired mucociliary clearance of the airway epithelium, which results in chronic upper and lower airway infections. While short-read next-generation sequencing technology has been used for the genetic testing of PCD, its effectiveness is limited in identifying variants in the gene because of the nearly identical pseudogene As we confirmed that the gene was not expressed in airway cells, we obtained nasal mucosa biopsy specimens for total RNA sequencing (RNA-seq) with library enrichment using exome oligos. Among the 34 nasal samples from patients suspected of having PCD, three aberrant splicing patterns in were identified in two samples.

View Article and Find Full Text PDF

Variant calling using long-read RNA sequencing (lrRNA-seq) can be applied to diverse tasks, such as capturing full-length isoforms and gene expression profiling. It poses challenges, however, due to higher error rates than DNA data, the complexities of transcript diversity, RNA editing events, etc. In this paper, we propose Clair3-RNA, the first deep learning-based variant caller tailored for lrRNA-seq data.

View Article and Find Full Text PDF

Background And Aims: Familial hypercholesterolemia (FH) and other disorders with similar features are common genetic disorders that remain underdiagnosed and undertreated, due in part to the cost of screening. The aim of this study was to design and implement a whole gene targeted NGS panel for the molecular diagnosis of FH and statin intolerance with an emphasis on high quality variant calling, including copy number analysis.

Methods: A whole gene panel for hybridisation-based short read NGS was designed for the dominant FH-genes low density lipoprotein receptor (), apolipoprotein B (APOB), proproteinconvertas subtilisin/kexin type 9 (PCSK9), apolipoprotein E (APOE) and the recessive FH-genes low density lipoprotein receptor adaptor protein 1 (), ATP binding cassette subfamily member 5/8 (ABCG5/8) and lipase A, lysosomal acid type (), as well as solute carrier organic anion transporter family member 1B1 (), not an FH gene but linked to statin intolerance.

View Article and Find Full Text PDF

Genotypic and phenotypic diversity of Mycobacterium tuberculosis strains from eastern India.

Infect Genet Evol

January 2025

Immunogenomics & Systems Biology group, Institute of Life Sciences (ILS), Bhubaneswar, Odisha, India; School of Biotechnology, Kalinga Institute of Industrial Technology (KIIT), Bhubaneswar, Odisha, India. Electronic address:

Whole genome sequencing has been used to investigate the genomic diversity of M. tuberculosis in the northern and southern states of India, but information about the eastern part of the country is still limited. Through a sequencing-based strategy, this study seeks to comprehend the diversity and drug resistance pattern in the eastern region.

View Article and Find Full Text PDF

Applying artificial intelligence to uncover the genetic landscape of coagulation factors.

J Thromb Haemost

January 2025

Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Milan, Italy; IRCCS Humanitas Research Hospital - via Manzoni 56, 20089 Rozzano, Milan, Italy. Electronic address:

Artificial intelligence (AI) is rapidly advancing our ability to identify and interpret genetic variants associated with coagulation factor deficiencies. This review introduces AI, with a specific focus on machine learning (ML) methods, and examines its applications in the field of coagulation genetics over the past decade. We observed a significant increase in AI-related publications, with a focus on hemophilia A and B.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!