Publications by authors named "Conor Walker"

Recent massively-parallel approaches to decipher gene regulatory circuits have focused on the discovery of either -regulatory elements (CREs) or -acting factors. Here, we develop a scalable approach that pairs - and -regulatory CRISPR screens to systematically dissect how the key immune checkpoint is regulated. In human pancreatic ductal adenocarcinoma (PDAC) cells, we tile the locus using ∼25,000 CRISPR perturbations in constitutive and IFNγ-stimulated conditions.

View Article and Find Full Text PDF
Article Synopsis
  • The growth of publicly available single-cell datasets has greatly improved our understanding of biology, but it raises significant privacy issues.
  • Recent studies on data sharing have mainly focused on bulk gene expression data due to noise and a lack of large single-cell datasets.
  • Our research reveals that individuals in single-cell datasets are at risk of linking attacks that expose sensitive information, and we propose a method for predicting genotypes that operates independently of eQTLs, allowing for the discovery of private information across different studies.
View Article and Find Full Text PDF

Sequence simulators are fundamental tools in bioinformatics, as they allow us to test data processing and inference tools, and are an essential component of some inference methods. The ongoing surge in available sequence data is however testing the limits of our bioinformatics software. One example is the large number of SARS-CoV-2 genomes available, which are beyond the processing power of many methods, and simulating such large datasets is also proving difficult.

View Article and Find Full Text PDF

The COVID-19 pandemic has seen an unprecedented response from the sequencing community. Leveraging the sequence data from more than 140,000 SARS-CoV-2 genomes, we study mutation rates and selective pressures affecting the virus. Understanding the processes and effects of mutation and selection has profound implications for the study of viral evolution, for vaccine design, and for the tracking of viral spread.

View Article and Find Full Text PDF

Sequence simulators are fundamental tools in bioinformatics, as they allow us to test data processing and inference tools, as well as being part of some inference methods. The ongoing surge in available sequence data is however testing the limits of our bioinformatics software. One example is the large number of SARS-CoV-2 genomes available, which are beyond the processing power of many methods, and simulating such large datasets is also proving difficult.

View Article and Find Full Text PDF

Many complex genomic rearrangements arise through template switch errors, which occur in DNA replication when there is a transient polymerase switch to an alternate template nearby in three-dimensional space. While typically investigated at kilobase-to-megabase scales, the genomic and evolutionary consequences of this mutational process are not well characterised at smaller scales, where they are often interpreted as clusters of independent substitutions, insertions and deletions. Here we present an improved statistical approach using pair hidden Markov models, and use it to detect and describe short-range template switches underlying clusters of mutations in the multi-way alignment of hominid genomes.

View Article and Find Full Text PDF

The COVID-19 pandemic has seen an unprecedented response from the sequencing community. Leveraging the sequence data from more than 140,000 SARS-CoV-2 genomes, we study mutation rates and selective pressures affecting the virus. Understanding the processes and effects of mutation and selection has profound implications for the study of viral evolution, for vaccine design, and for the tracking of viral spread.

View Article and Find Full Text PDF

Since the start of the COVID-19 pandemic, an unprecedented number of genomic sequences of SARS-CoV-2 have been generated and shared with the scientific community. The unparalleled volume of available genetic data presents a unique opportunity to gain real-time insights into the virus transmission during the pandemic, but also a daunting computational hurdle if analyzed with gold-standard phylogeographic approaches. To tackle this practical limitation, we here describe and apply a rapid analytical pipeline to analyze the spatiotemporal dispersal history and dynamics of SARS-CoV-2 lineages.

View Article and Find Full Text PDF

The SARS-CoV-2 pandemic has led to unprecedented, nearly real-time genetic tracing due to the rapid community sequencing response. Researchers immediately leveraged these data to infer the evolutionary relationships among viral samples and to study key biological questions, including whether host viral genome editing and recombination are features of SARS-CoV-2 evolution. This global sequencing effort is inherently decentralized and must rely on data collected by many labs using a wide variety of molecular and bioinformatic techniques.

View Article and Find Full Text PDF