Publications by Pier Francesco Palamara

Publications by authors named "Pier Francesco Palamara"

Page 1 of 2

Social and genetic diversity in first farmers of central Europe.

Pere Gelabert Penny Bickle Daniela Hofmann Maria Teschler-Nicola Alexandra Anders Pier Francesco Palamara

Nat Hum Behav

November 2024

Article Synopsis

The Linearbandkeramik (LBK) Neolithic communities were pioneers in spreading agriculture across Europe and this study showcases genetic data from 250 individuals to understand their ancestry.
The findings reveal a notable difference in ancestry, with eastern LBK sites having a higher percentage of western hunter-gatherer genetics compared to western sites, indicating separate genetic paths for these groups.
Additionally, the research suggests a patrilocal social structure, featuring more genetic connections among males within sites, and points out that the massacre at Asparn-Schletz likely involved individuals from a large population rather than a small community.

View Article and Find Full Text PDF

A Genomics England haplotype reference panel and imputation of UK Biobank.

Sinan Shi Simone Rubinacci Sile Hu Loukas Moutsianas Alex Stuckey Pier Francesco Palamara

Nat Genet

September 2024

We built a reference panel with 342 million autosomal variants using 78,195 individuals from the Genomics England (GEL) dataset, achieving a phasing switch error rate of 0.18% for European samples and imputation quality of r = 0.75 for variants with minor allele frequencies as low as 2 × 10 in white British samples.

View Article and Find Full Text PDF

The Genetic Origin of the Indo-Europeans.

Iosif Lazaridis Nick Patterson David Anthony Leonid Vyazov Romain Fournier Pier Francesco Palamara

bioRxiv

April 2024

The Yamnaya archaeological complex appeared around 3300BCE across the steppes north of the Black and Caspian Seas, and by 3000BCE reached its maximal extent from Hungary in the west to Kazakhstan in the east. To localize the ancestral and geographical origins of the Yamnaya among the diverse Eneolithic people that preceded them, we studied ancient DNA data from 428 individuals of which 299 are reported for the first time, demonstrating three previously unknown Eneolithic genetic clines. First, a "Caucasus-Lower Volga" (CLV) Cline suffused with Caucasus hunter-gatherer (CHG) ancestry extended between a Caucasus Neolithic southern end in Neolithic Armenia, and a steppe northern end in Berezhnovka in the Lower Volga.

View Article and Find Full Text PDF

Genome-wide classification of epigenetic activity reveals regions of enriched heritability in immune-related traits.

Miriam Stricker Weijiao Zhang Wei-Yi Cheng Steven Gazal Calliope Dendrou Pier Francesco Palamara

Cell Genom

March 2024

View Article and Find Full Text PDF

Genome-wide classification of epigenetic activity reveals regions of enriched heritability in immune-related traits.

Miriam Stricker Weijiao Zhang Wei-Yi Cheng Steven Gazal Calliope Dendrou Pier Francesco Palamara

Cell Genom

January 2024

Epigenetics underpins the regulation of genes known to play a key role in the adaptive and innate immune system (AIIS). We developed a method, EpiNN, that leverages epigenetic data to detect AIIS-relevant genomic regions and used it to detect 2,765 putative AIIS loci. Experimental validation of one of these loci, DNMT1, provided evidence for a novel AIIS-specific transcription start site.

View Article and Find Full Text PDF

Haplotype-based inference of recent effective population size in modern and ancient DNA samples.

Romain Fournier Zoi Tsangalidou David Reich Pier Francesco Palamara

Nat Commun

December 2023

Individuals sharing recent ancestors are likely to co-inherit large identical-by-descent (IBD) genomic regions. The distribution of these IBD segments in a population may be used to reconstruct past demographic events such as effective population size variation, but accurate IBD detection is difficult in ancient DNA data and in underrepresented populations with limited reference data. In this work, we introduce an accurate method for inferring effective population size variation during the past ~2000 years in both modern and ancient DNA data, called HapNe.

View Article and Find Full Text PDF

Inference of Coalescence Times and Variant Ages Using Convolutional Neural Networks.

Juba Nait Saada Zoi Tsangalidou Miriam Stricker Pier Francesco Palamara

Mol Biol Evol

October 2023

Accurate inference of the time to the most recent common ancestor (TMRCA) between pairs of individuals and of the age of genomic variants is key in several population genetic analyses. We developed a likelihood-free approach, called CoalNN, which uses a convolutional neural network to predict pairwise TMRCAs and allele ages from sequencing or SNP array data. CoalNN is trained through simulation and can be adapted to varying parameters, such as demographic history, using transfer learning.

View Article and Find Full Text PDF

HAPNEST: efficient, large-scale generation and evaluation of synthetic datasets for genotypes and phenotypes.

Sophie Wharrie Zhiyu Yang Vishnu Raj Remo Monti Rahul Gupta Pier Francesco Palamara

Bioinformatics

September 2023

Motivation: Existing methods for simulating synthetic genotype and phenotype datasets have limited scalability, constraining their usability for large-scale analyses. Moreover, a systematic approach for evaluating synthetic data quality and a benchmark synthetic dataset for developing and evaluating methods for polygenic risk scores are lacking.

Results: We present HAPNEST, a novel approach for efficiently generating diverse individual-level genotypic and phenotypic data.

View Article and Find Full Text PDF

Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits.

Brian C Zhang Arjun Biddanda Árni Freyr Gunnarsson Fergus Cooper Pier Francesco Palamara

Nat Genet

May 2023

Genome-wide genealogies compactly represent the evolutionary history of a set of genomes and inferring them from genetic data has the potential to facilitate a wide range of analyses. We introduce a method, ARG-Needle, for accurately inferring biobank-scale genealogies from sequencing or genotyping array data, as well as strategies to utilize genealogies to perform association and other complex trait analyses. We use these methods to build genome-wide genealogies using genotyping data for 337,464 UK Biobank individuals and test for association across seven complex traits.

View Article and Find Full Text PDF

Cell-free DNA TAPS provides multimodal information for early cancer detection.

Paulina Siejka-Zielińska Jingfei Cheng Felix Jackson Yibin Liu Zahir Soonawalla Pier Francesco Palamara

Sci Adv

September 2021

Multimodal, genome-wide characterization of epigenetic and genetic information in circulating cell-free DNA (cfDNA) could enable more sensitive early cancer detection, but it is technologically challenging. Recently, we developed TET-assisted pyridine borane sequencing (TAPS), which is a mild, bisulfite-free method for base-resolution direct DNA methylation sequencing. Here, we optimized TAPS for cfDNA (cfTAPS) to provide high-quality and high-depth whole-genome cell-free methylomes.

View Article and Find Full Text PDF

Identity-by-descent detection across 487,409 British samples reveals fine scale population structure and ultra-rare variant associations.

Juba Nait Saada Georgios Kalantzis Derek Shyr Fergus Cooper Martin Robinson Pier Francesco Palamara

Nat Commun

November 2020

Detection of Identical-By-Descent (IBD) segments provides a fundamental measure of genetic relatedness and plays a key role in a wide range of analyses. We develop FastSMC, an IBD detection algorithm that combines a fast heuristic search with accurate coalescent-based likelihood calculations. FastSMC enables biobank-scale detection and dating of IBD segments within several thousands of years in the past.

View Article and Find Full Text PDF

Genome-Wide Natural Selection Signatures Are Linked to Genetic Risk of Modern Phenotypes in the Japanese Population.

Yoshiaki Yasumizu Saori Sakaue Takahiro Konuma Ken Suzuki Koichi Matsuda Pier Francesco Palamara

Mol Biol Evol

May 2020

Elucidation of natural selection signatures and relationships with phenotype spectra is important to understand adaptive evolution of modern humans. Here, we conducted a genome-wide scan of selection signatures of the Japanese population by estimating locus-specific time to the most recent common ancestor using the ascertained sequentially Markovian coalescent (ASMC), from the biobank-based large-scale genome-wide association study data of 170,882 subjects. We identified 29 genetic loci with selection signatures satisfying the genome-wide significance.

View Article and Find Full Text PDF

Author Correction: Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection.

Steven Gazal Hilary K Finucane Nicholas A Furlotte Po-Ru Loh Pier Francesco Palamara

Nat Genet

August 2019

In the version of the paper initially published, information on competing interests for author Benjamin M. Neale was missing. The 'Competing interests' statement should have included the sentence 'B.

View Article and Find Full Text PDF

Detecting genome-wide directional effects of transcription factor binding on polygenic disease risk.

Yakir A Reshef Hilary K Finucane David R Kelley Alexander Gusev Dylan Kotliar Pier Francesco Palamara

Nat Genet

October 2018

Biological interpretation of genome-wide association study data frequently involves assessing whether SNPs linked to a biological process, for example, binding of a transcription factor, show unsigned enrichment for disease signal. However, signed annotations quantifying whether each SNP allele promotes or hinders the biological process can enable stronger statements about disease mechanism. We introduce a method, signed linkage disequilibrium profile regression, for detecting genome-wide directional effects of signed functional annotations on disease risk.

View Article and Find Full Text PDF

High-throughput inference of pairwise coalescence times identifies signals of selection and enriched disease heritability.

Pier Francesco Palamara Jonathan Terhorst Yun S Song Alkes L Price

Nat Genet

September 2018

Interest in reconstructing demographic histories has motivated the development of methods to estimate locus-specific pairwise coalescence times from whole-genome sequencing data. Here we introduce a powerful new method, ASMC, that can estimate coalescence times using only SNP array data, and is orders of magnitude faster than previous approaches. We applied ASMC to detect recent positive selection in 113,851 phased British samples from the UK Biobank, and detected 12 genome-wide significant signals, including 6 novel loci.

View Article and Find Full Text PDF

Insights into clonal haematopoiesis from 8,342 mosaic chromosomal alterations.

Po-Ru Loh Giulio Genovese Robert E Handsaker Hilary K Finucane Yakir A Reshef Pier Francesco Palamara

Nature

July 2018

The selective pressures that shape clonal evolution in healthy individuals are largely unknown. Here we investigate 8,342 mosaic chromosomal alterations, from 50 kb to 249 Mb long, that we uncovered in blood-derived DNA from 151,202 UK Biobank participants using phase-based computational techniques (estimated false discovery rate, 6-9%). We found six loci at which inherited variants associated strongly with the acquisition of deletions or loss of heterozygosity in cis.

View Article and Find Full Text PDF

Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection.

Steven Gazal Hilary K Finucane Nicholas A Furlotte Po-Ru Loh Pier Francesco Palamara

Nat Genet

October 2017

Recent work has hinted at the linkage disequilibrium (LD)-dependent architecture of human complex traits, where SNPs with low levels of LD (LLD) have larger per-SNP heritability. Here we analyzed summary statistics from 56 complex traits (average N = 101,401) by extending stratified LD score regression to continuous annotations. We determined that SNPs with low LLD have significantly larger per-SNP heritability and that roughly half of this effect can be explained by functional annotations negatively correlated with LLD, such as DNase I hypersensitivity sites (DHSs).

View Article and Find Full Text PDF

Reference-based phasing using the Haplotype Reference Consortium panel.

Po-Ru Loh Petr Danecek Pier Francesco Palamara Christian Fuchsberger Yakir A Reshef

Nat Genet

November 2016

Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing.

View Article and Find Full Text PDF

ARGON: fast, whole-genome simulation of the discrete time Wright-fisher process.

Pier Francesco Palamara

Bioinformatics

October 2016

Motivation: Simulation under the coalescent model is ubiquitous in the analysis of genetic data. The rapid growth of real data sets from multiple human populations led to increasing interest in simulating very large sample sizes at whole-chromosome scales. When the sample size is large, the coalescent model becomes an increasingly inaccurate approximation of the discrete time Wright-Fisher model (DTWF).

View Article and Find Full Text PDF

Fast and accurate long-range phasing in a UK Biobank cohort.

Po-Ru Loh Pier Francesco Palamara Alkes L Price

Nat Genet

July 2016

Recent work has leveraged the extensive genotyping of the Icelandic population to perform long-range phasing (LRP), enabling accurate imputation and association analysis of rare variants in target samples typed on genotyping arrays. Here we develop a fast and accurate LRP method, Eagle, that extends this paradigm to populations with much smaller proportions of genotyped samples by harnessing long (>4-cM) identical-by-descent (IBD) tracts shared among distantly related individuals. We applied Eagle to N ≈ 150,000 samples (0.

View Article and Find Full Text PDF

Leveraging Distant Relatedness to Quantify Human Mutation and Gene-Conversion Rates.

Pier Francesco Palamara Laurent C Francioli Peter R Wilton Giulio Genovese Alexander Gusev

Am J Hum Genet

December 2015

The rate at which human genomes mutate is a central biological parameter that has many implications for our ability to understand demographic and evolutionary phenomena. We present a method for inferring mutation and gene-conversion rates by using the number of sequence differences observed in identical-by-descent (IBD) segments together with a reconstructed model of recent population-size history. This approach is robust to, and can quantify, the presence of substantial genotyping error, as validated in coalescent simulations.

View Article and Find Full Text PDF

Inference of historical migration rates via haplotype sharing.

Pier Francesco Palamara Itsik Pe'er

Bioinformatics

July 2013

Summary: Pairs of individuals from a study cohort will often share long-range haplotypes identical-by-descent. Such haplotypes are transmitted from common ancestors that lived tens to hundreds of generations in the past, and they can now be efficiently detected in high-resolution genomic datasets, providing a novel source of information in several domains of genetic analysis. Recently, haplotype sharing distributions were studied in the context of demographic inference, and they were used to reconstruct recent demographic events in several populations.

View Article and Find Full Text PDF

The variance of identity-by-descent sharing in the Wright-Fisher model.

Shai Carmi Pier Francesco Palamara Vladimir Vacic Todd Lencz Ariel Darvasi

Genetics

March 2013

Widespread sharing of long, identical-by-descent (IBD) genetic segments is a hallmark of populations that have experienced recent genetic drift. Detection of these IBD segments has recently become feasible, enabling a wide range of applications from phasing and imputation to demographic inference. Here, we study the distribution of IBD sharing in the Wright-Fisher model.

View Article and Find Full Text PDF

Length distributions of identity by descent reveal fine-scale demographic history.

Pier Francesco Palamara Todd Lencz Ariel Darvasi Itsik Pe'er

Am J Hum Genet

November 2012

Data-driven studies of identity by descent (IBD) were recently enabled by high-resolution genomic data from large cohorts and scalable algorithms for IBD detection. Yet, haplotype sharing currently represents an underutilized source of information for population-genetics research. We present analytical results on the relationship between haplotype sharing across purportedly unrelated individuals and a population's demographic history.

View Article and Find Full Text PDF

The architecture of long-range haplotypes shared within and across populations.

Alexander Gusev Pier Francesco Palamara Gregory Aponte Zhong Zhuang Ariel Darvasi

Mol Biol Evol

February 2012

Homologous long segments along the genomes of close or remote relatives that are identical by descent (IBD) from a common ancestor provide clues for recent events in human genetics. We set out to extensively map such IBD segments in large cohorts and investigate their distribution within and across different populations. We report analysis of several data sets, demonstrating that IBD is more common than expected by naïve models of population genetics.

View Article and Find Full Text PDF