Publications by Babak Alipanahi

Publications by authors named "Babak Alipanahi"

Page 1 of 1

Deep generative AI models analyzing circulating orphan non-coding RNAs enable detection of early-stage lung cancer.

Mehran Karimzadeh Amir Momen-Roknabadi Taylor B Cavazos Yuqi Fang Nae-Chyun Chen Babak Alipanahi

Nat Commun

November 2024

Article Synopsis

* The study utilizes a generative AI model called Orion to analyze blood samples from 1,050 individuals with non-small cell lung cancer (NSCLC) and matched controls, focusing on orphan non-coding RNAs.
* Orion significantly outperforms traditional methods, achieving 94% sensitivity and 87% specificity in cancer detection, and shows over 30% higher sensitivity on validation datasets compared to other approaches.

View Article and Find Full Text PDF

Inference of chronic obstructive pulmonary disease with deep learning on raw spirograms identifies new genetic loci and improves risk models.

Justin Cosentino Babak Behsaz Babak Alipanahi Zachary R McCaw Davin Hill

Nat Genet

May 2023

Chronic obstructive pulmonary disease (COPD), the third leading cause of death worldwide, is highly heritable. While COPD is clinically defined by applying thresholds to summary measures of lung function, a quantitative liability score has more power to identify genetic signals. Here we train a deep convolutional neural network on noisy self-reported and International Classification of Diseases labels to predict COPD case-control status from high-dimensional raw spirograms and use the model's predictions as a liability score.

View Article and Find Full Text PDF

DeepNull models non-linear covariate effects to improve phenotypic prediction and association power.

Zachary R McCaw Thomas Colthurst Taedong Yun Nicholas A Furlotte Andrew Carroll Babak Alipanahi

Nat Commun

January 2022

Genome-wide association studies (GWASs) examine the association between genotype and phenotype while adjusting for a set of covariates. Although the covariates may have non-linear or interactive effects, due to the challenge of specifying the model, GWAS often neglect such terms. Here we introduce DeepNull, a method that identifies and adjusts for non-linear and interactive covariate effects using a deep neural network.

View Article and Find Full Text PDF

Large-scale machine-learning-based phenotyping significantly improves genomic discovery for optic nerve head morphology.

Babak Alipanahi Farhad Hormozdiari Babak Behsaz Justin Cosentino Zachary R McCaw

Am J Hum Genet

July 2021

Genome-wide association studies (GWASs) require accurate cohort phenotyping, but expert labeling can be costly, time intensive, and variable. Here, we develop a machine learning (ML) model to predict glaucomatous optic nerve head features from color fundus photographs. We used the model to predict vertical cup-to-disc ratio (VCDR), a diagnostic parameter and cardinal endophenotype for glaucoma, in 65,680 Europeans in the UK Biobank (UKB).

View Article and Find Full Text PDF

Genomewide Association Studies of LRRK2 Modifiers of Parkinson's Disease.

Dongbing Lai Babak Alipanahi Pierre Fontanillas Tae-Hwi Schwantes-An Jan Aasly

Ann Neurol

July 2021

Objective: The aim of this study was to search for genes/variants that modify the effect of LRRK2 mutations in terms of penetrance and age-at-onset of Parkinson's disease.

Methods: We performed the first genomewide association study of penetrance and age-at-onset of Parkinson's disease in LRRK2 mutation carriers (776 cases and 1,103 non-cases at their last evaluation). Cox proportional hazard models and linear mixed models were used to identify modifiers of penetrance and age-at-onset of LRRK2 mutations, respectively.

View Article and Find Full Text PDF

Author Correction: The effect of LRRK2 loss-of-function variants in humans.

Nicola Whiffin Irina M Armean Aaron Kleinman Jamie L Marshall Eric V Minikel Babak Alipanahi

Nat Med

February 2021

View Article and Find Full Text PDF

Disease risk scores for skin cancers.

Pierre Fontanillas Babak Alipanahi Nicholas A Furlotte Michaela Johnson Catherine H Wilson

Nat Commun

January 2021

We trained and validated risk prediction models for the three major types of skin cancer- basal cell carcinoma (BCC), squamous cell carcinoma (SCC), and melanoma-on a cross-sectional and longitudinal dataset of 210,000 consented research participants who responded to an online survey covering personal and family history of skin cancer, skin susceptibility, and UV exposure. We developed a primary disease risk score (DRS) that combined all 32 identified genetic and non-genetic risk factors. Top percentile DRS was associated with an up to 13-fold increase (odds ratio per standard deviation increase >2.

View Article and Find Full Text PDF

The effect of LRRK2 loss-of-function variants in humans.

Nicola Whiffin Irina M Armean Aaron Kleinman Jamie L Marshall Eric V Minikel Babak Alipanahi

Nat Med

June 2020

Human genetic variants predicted to cause loss-of-function of protein-coding genes (pLoF variants) provide natural in vivo models of human gene inactivation and can be valuable indicators of gene function and the potential toxicity of therapeutic inhibitors targeting these genes. Gain-of-kinase-function variants in LRRK2 are known to significantly increase the risk of Parkinson's disease, suggesting that inhibition of LRRK2 kinase activity is a promising therapeutic strategy. While preclinical studies in model organisms have raised some on-target toxicity concerns, the biological consequences of LRRK2 inhibition have not been well characterized in humans.

View Article and Find Full Text PDF

The Parkinson's phenome-traits associated with Parkinson's disease in a broadly phenotyped cohort.

Karl Heilbron Alastair J Noyce Pierre Fontanillas Babak Alipanahi Mike A Nalls

NPJ Parkinsons Dis

March 2019

In order to systematically describe the Parkinson's disease phenome, we performed a series of 832 cross-sectional case-control analyses in a large database. Responses to 832 online survey-based phenotypes including diseases, medications, and environmental exposures were analyzed in 23andMe research participants. For each phenotype, survey respondents were used to construct a cohort of Parkinson's disease cases and age-matched and sex-matched controls, and an association test was performed using logistic regression.

View Article and Find Full Text PDF

Correspondence between cerebral glucose metabolism and BOLD reveals relative power and cost in human brain.

Ehsan Shokri-Kojori Dardo Tomasi Babak Alipanahi Corinde E Wiers Gene-Jack Wang

Nat Commun

February 2019

The correspondence between cerebral glucose metabolism (indexing energy utilization) and synchronous fluctuations in blood oxygenation (indexing neuronal activity) is relevant for neuronal specialization and is affected by brain disorders. Here, we define novel measures of relative power (rPWR, extent of concurrent energy utilization and activity) and relative cost (rCST, extent that energy utilization exceeds activity), derived from FDG-PET and fMRI. We show that resting-state networks have distinct energetic signatures and that brain could be classified into major bilateral segments based on rPWR and rCST.

View Article and Find Full Text PDF

Does conservation account for splicing patterns?

Michael Wainberg Babak Alipanahi Brendan Frey

BMC Genomics

October 2016

Background: Alternative mRNA splicing is critical to proteomic diversity and tissue and species differentiation. Exclusion of cassette exons, also called exon skipping, is the most common type of alternative splicing in mammals.

Results: We present a computational model that predicts absolute (though not tissue-differential) percent-spliced-in of cassette exons more accurately than previous models, despite not using any 'hand-crafted' biological features such as motif counts.

View Article and Find Full Text PDF

Genome-wide characteristics of mutations in autism.

Ryan K C Yuen Daniele Merico Hongzhi Cao Giovanna Pellecchia Babak Alipanahi

NPJ Genom Med

August 2016

mutations (DNMs) are important in Autism Spectrum Disorder (ASD), but so far analyses have mainly been on the ~1.5% of the genome encoding genes. Here, we performed whole genome sequencing (WGS) of 200 ASD parent-child trios and characterized germline and somatic DNMs.

View Article and Find Full Text PDF

Whole Genome Sequencing Expands Diagnostic Utility and Improves Clinical Management in Pediatric Medicine.

Dimitri J Stavropoulos Daniele Merico Rebekah Jobling Sarah Bowdin Nasim Monfared Babak Alipanahi

NPJ Genom Med

January 2016

Article Synopsis

* WGS not only detected all rare clinically significant copy number variations (CNVs) found by CMA but also identified additional mutations (indels and missense) in several patients, some with multiple genetic disorders.
* The findings suggest that WGS should be considered as a primary test in clinical settings, as it offers higher diagnostic rates and could streamline the process for obtaining genetic diagnoses and facilitating genetic counseling

View Article and Find Full Text PDF

Whole-Genome Sequencing Suggests Schizophrenia Risk Mechanisms in Humans with 22q11.2 Deletion Syndrome.

Daniele Merico Mehdi Zarrei Gregory Costain Lucas Ogura Babak Alipanahi

G3 (Bethesda)

September 2015

Chromosome 22q11.2 microdeletions impart a high but incomplete risk for schizophrenia. Possible mechanisms include genome-wide effects of DGCR8 haploinsufficiency.

View Article and Find Full Text PDF

Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Babak Alipanahi Andrew Delong Matthew T Weirauch Brendan J Frey

Nat Biotechnol

August 2015

Knowing the sequence specificities of DNA- and RNA-binding proteins is essential for developing models of the regulatory processes in biological systems and for identifying causal disease variants. Here we show that sequence specificities can be ascertained from experimental data with 'deep learning' techniques, which offer a scalable, flexible and unified computational approach for pattern discovery. Using a diverse array of experimental data and evaluation metrics, we find that deep learning outperforms other state-of-the-art methods, even when training on in vitro data and testing on in vivo data.

View Article and Find Full Text PDF

RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease.

Hui Y Xiong Babak Alipanahi Leo J Lee Hannes Bretschneider Daniele Merico

Science

January 2015

To facilitate precision medicine and whole-genome annotation, we developed a machine-learning technique that scores how strongly genetic variants affect RNA splicing, whose alteration contributes to many diseases. Analysis of more than 650,000 intronic and exonic variants revealed widespread patterns of mutation-driven aberrant splicing. Intronic disease mutations that are more than 30 nucleotides from any splice site alter splicing nine times as often as common variants, and missense exonic disease mutations that have the least impact on protein function are five times as likely as others to alter splicing.

View Article and Find Full Text PDF

Widespread intron retention in mammals functionally tunes transcriptomes.

Ulrich Braunschweig Nuno L Barbosa-Morais Qun Pan Emil N Nachman Babak Alipanahi

Genome Res

November 2014

Alternative splicing (AS) of precursor RNAs is responsible for greatly expanding the regulatory and functional capacity of eukaryotic genomes. Of the different classes of AS, intron retention (IR) is the least well understood. In plants and unicellular eukaryotes, IR is the most common form of AS, whereas in animals, it is thought to represent the least prevalent form.

View Article and Find Full Text PDF

Brain-expressed exons under purifying selection are enriched for de novo mutations in autism spectrum disorder.

Mohammed Uddin Kristiina Tammimies Giovanna Pellecchia Babak Alipanahi Pingzhao Hu

Nat Genet

July 2014

A universal challenge in genetic studies of autism spectrum disorders (ASDs) is determining whether a given DNA sequence alteration will manifest as disease. Among different population controls, we observed, for specific exons, an inverse correlation between exon expression level in brain and burden of rare missense mutations. For genes that harbor de novo mutations predicted to be deleterious, we found that specific critical exons were significantly enriched in individuals with ASD relative to their siblings without ASD (P < 1.

View Article and Find Full Text PDF

Network cleanup.

Babak Alipanahi Brendan J Frey

Nat Biotechnol

August 2013

View Article and Find Full Text PDF

MBNL proteins repress ES-cell-specific alternative splicing and reprogramming.

Hong Han Manuel Irimia P Joel Ross Hoon-Ki Sung Babak Alipanahi

Nature

June 2013

Previous investigations of the core gene regulatory circuitry that controls the pluripotency of embryonic stem (ES) cells have largely focused on the roles of transcription, chromatin and non-coding RNA regulators. Alternative splicing represents a widely acting mode of gene regulation, yet its role in regulating ES-cell pluripotency and differentiation is poorly understood. Here we identify the muscleblind-like RNA binding proteins, MBNL1 and MBNL2, as conserved and direct negative regulators of a large program of cassette exon alternative splicing events that are differentially regulated between ES cells and other cell types.

View Article and Find Full Text PDF

Protein Structure Idealization: How accurately is it possible to model protein structures with dihedral angles?

Xuefeng Cui Shuai Cheng Li Dongbo Bu Babak Alipanahi Ming Li

Algorithms Mol Biol

February 2013

: Previous studies show that the same type of bond lengths and angles fit Gaussian distributions well with small standard deviations on high resolution protein structure data. The mean values of these Gaussian distributions have been widely used as ideal bond lengths and angles in bioinformatics. However, we are not aware of any research done to evaluate how accurately we can model protein structures with dihedral angles and ideal bond lengths and angles.

View Article and Find Full Text PDF

Determining protein structures from NOESY distance constraints by semidefinite programming.

Babak Alipanahi Nathan Krislock Ali Ghodsi Henry Wolkowicz Logan Donaldson

J Comput Biol

April 2013

Contemporary practical methods for protein nuclear magnetic resonance (NMR) structure determination use molecular dynamics coupled with a simulated annealing schedule. The objective of these methods is to minimize the error of deviating from the nuclear overhauser effect (NOE) distance constraints. However, the corresponding objective function is highly nonconvex and, consequently, difficult to optimize.

View Article and Find Full Text PDF

Error tolerant NMR backbone resonance assignment and automated structure generation.

Babak Alipanahi Xin Gao Emre Karakoc Shuai Cheng Li Frank Balbach

J Bioinform Comput Biol

February 2011

Error tolerant backbone resonance assignment is the cornerstone of the NMR structure determination process. Although a variety of assignment approaches have been developed, none works sufficiently well on noisy fully automatically picked peaks to enable the subsequent automatic structure determination steps. We have designed an integer linear programming (ILP) based assignment system (IPASS) that has enabled fully automatic protein structure determination for four test proteins.

View Article and Find Full Text PDF

Protein secondary structure prediction using NMR chemical shift data.

Yuzhong Zhao Babak Alipanahi Shuai Cheng Li Ming Li

J Bioinform Comput Biol

October 2010

Accurate determination of protein secondary structure from the chemical shift information is a key step for NMR tertiary structure determination. Relatively few work has been done on this subject. There needs to be a systematic investigation of algorithms that are (a) robust for large datasets; (b) easily extendable to (the dynamic) new databases; and (c) approaching to the limit of accuracy.

View Article and Find Full Text PDF

PICKY: a novel SVD-based NMR spectra peak picking method.

Babak Alipanahi Xin Gao Emre Karakoc Logan Donaldson Ming Li

Bioinformatics

June 2009

Motivation: Picking peaks from experimental NMR spectra is a key unsolved problem for automated NMR protein structure determination. Such a process is a prerequisite for resonance assignment, nuclear overhauser enhancement (NOE) distance restraint assignment, and structure calculation tasks. Manual or semi-automatic peak picking, which is currently the prominent way used in NMR labs, is tedious, time consuming and costly.

View Article and Find Full Text PDF