Publications by Brandon Jew

Publications by authors named "Brandon Jew"

Page 1 of 1

The Effect of Model Directionality on Cell-Type-Specific Differential DNA Methylation Analysis.

Front Bioinform

January 2022

Calling differential methylation at a cell-type level from tissue-level bulk data is a fundamental challenge in genomics that has recently received more attention. These studies most often aim at identifying statistical associations rather than causal effects. However, existing methods typically make an implicit assumption about the direction of effects, and thus far, little to no attention has been given to the fact that this directionality assumption may not hold and can consequently affect statistical power and control for false positives.

View Article and Find Full Text PDF

Rare variants in the endocytic pathway are associated with Alzheimer's disease, its related phenotypes, and functional consequences.

Lingyu Zhan Jiajin Li Brandon Jew Jae Hoon Sul

PLoS Genet

September 2021

Late-onset Alzheimer's disease (LOAD) is the most common type of dementia causing irreversible brain damage to the elderly and presents a major public health challenge. Clinical research and genome-wide association studies have suggested a potential contribution of the endocytic pathway to AD, with an emphasis on common loci. However, the contribution of rare variants in this pathway to AD has not been thoroughly investigated.

View Article and Find Full Text PDF

xGAP: a python based efficient, modular, extensible and fault tolerant genomic analysis pipeline for variant discovery.

Aditya Gorla Brandon Jew Luke Zhang Jae Hoon Sul

Bioinformatics

April 2021

Motivation: Since the first human genome was sequenced in 2001, there has been a rapid growth in the number of bioinformatic methods to process and analyze next-generation sequencing (NGS) data for research and clinical studies that aim to identify genetic variants influencing diseases and traits. To achieve this goal, one first needs to call genetic variants from NGS data, which requires multiple computationally intensive analysis steps. Unfortunately, there is a lack of an open-source pipeline that can perform all these steps on NGS data in a manner, which is fully automated, efficient, rapid, scalable, modular, user-friendly and fault tolerant.

View Article and Find Full Text PDF

A machine learning algorithm to increase COVID-19 inpatient diagnostic capacity.

David Goodman-Meza Akos Rudas Jeffrey N Chiang Paul C Adamson Joseph Ebinger Brandon Jew

PLoS One

October 2020

Worldwide, testing capacity for SARS-CoV-2 is limited and bottlenecks in the scale up of polymerase chain reaction (PCR-based testing exist. Our aim was to develop and evaluate a machine learning algorithm to diagnose COVID-19 in the inpatient setting. The algorithm was based on basic demographic and laboratory features to serve as a screening tool at hospitals where testing is scarce or unavailable.

View Article and Find Full Text PDF

The causal effect of obesity on prediabetes and insulin resistance reveals the important role of adipose tissue in insulin resistance.

Zong Miao Marcus Alvarez Arthur Ko Yash Bhagat Elior Rahmani Brandon Jew

PLoS Genet

September 2020

Reverse causality has made it difficult to establish the causal directions between obesity and prediabetes and obesity and insulin resistance. To disentangle whether obesity causally drives prediabetes and insulin resistance already in non-diabetic individuals, we utilized the UK Biobank and METSIM cohort to perform a Mendelian randomization (MR) analyses in the non-diabetic individuals. Our results suggest that both prediabetes and systemic insulin resistance are caused by obesity (p = 1.

View Article and Find Full Text PDF

Enhancing droplet-based single-nucleus RNA-seq resolution using the semi-supervised machine learning classifier DIEM.

Marcus Alvarez Elior Rahmani Brandon Jew Kristina M Garske Zong Miao

Sci Rep

July 2020

Single-nucleus RNA sequencing (snRNA-seq) measures gene expression in individual nuclei instead of cells, allowing for unbiased cell type characterization in solid tissues. We observe that snRNA-seq is commonly subject to contamination by high amounts of ambient RNA, which can lead to biased downstream analyses, such as identification of spurious cell types if overlooked. We present a novel approach to quantify contamination and filter droplets in snRNA-seq experiments, called Debris Identification using Expectation Maximization (DIEM).

View Article and Find Full Text PDF

Publisher Correction: Accurate estimation of cell composition in bulk expression through robust integration of single-cell information.

Brandon Jew Marcus Alvarez Elior Rahmani Zong Miao Arthur Ko

Nat Commun

June 2020

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

View Article and Find Full Text PDF

Accurate estimation of cell composition in bulk expression through robust integration of single-cell information.

Brandon Jew Marcus Alvarez Elior Rahmani Zong Miao Arthur Ko

Nat Commun

April 2020

We present Bisque, a tool for estimating cell type proportions in bulk expression. Bisque implements a regression-based approach that utilizes single-cell RNA-seq (scRNA-seq) or single-nucleus RNA-seq (snRNA-seq) data to generate a reference expression profile and learn gene-specific bulk expression transformations to robustly decompose RNA-seq data. These transformations significantly improve decomposition performance compared to existing methods when there is significant technical variation in the generation of the reference profile and observed bulk expression.

View Article and Find Full Text PDF

ForestQC: Quality control on genetic variants from next-generation sequencing data using random forest.

Jiajin Li Brandon Jew Lingyu Zhan Sungoo Hwang Giovanni Coppola

PLoS Comput Biol

December 2019

Next-generation sequencing technology (NGS) enables the discovery of nearly all genetic variants present in a genome. A subset of these variants, however, may have poor sequencing quality due to limitations in NGS or variant callers. In genetic studies that analyze a large number of sequenced individuals, it is critical to detect and remove those variants with poor quality as they may cause spurious findings.

View Article and Find Full Text PDF

Leveraging allelic imbalance to refine fine-mapping for eQTL studies.

Jennifer Zou Farhad Hormozdiari Brandon Jew Stephane E Castel Tuuli Lappalainen

PLoS Genet

December 2019

Many disease risk loci identified in genome-wide association studies are present in non-coding regions of the genome. Previous studies have found enrichment of expression quantitative trait loci (eQTLs) in disease risk loci, indicating that identifying causal variants for gene expression is important for elucidating the genetic basis of not only gene expression but also complex traits. However, detecting causal variants is challenging due to complex genetic correlation among variants known as linkage disequilibrium (LD) and the presence of multiple causal variants within a locus.

View Article and Find Full Text PDF

An automated machine learning-based model predicts postoperative mortality using readily-extractable preoperative electronic health record data.

Brian L Hill Robert Brown Eilon Gabel Nadav Rakocz Christine Lee Brandon Jew

Br J Anaesth

December 2019

Background: Rapid, preoperative identification of patients with the highest risk for medical complications is necessary to ensure that limited infrastructure and human resources are directed towards those most likely to benefit. Existing risk scores either lack specificity at the patient level or utilise the American Society of Anesthesiologists (ASA) physical status classification, which requires a clinician to review the chart.

Methods: We report on the use of machine learning algorithms, specifically random forests, to create a fully automated score that predicts postoperative in-hospital mortality based solely on structured data available at the time of surgery.

View Article and Find Full Text PDF

Variant calling and quality control of large-scale human genome sequencing data.

Brandon Jew Jae Hoon Sul

Emerg Top Life Sci

August 2019

Next-generation sequencing has allowed genetic studies to collect genome sequencing data from a large number of individuals. However, raw sequencing data are not usually interpretable due to fragmentation of the genome and technical biases; therefore, analysis of these data requires many computational approaches. First, for each sequenced individual, sequencing data are aligned and further processed to account for technical biases.

View Article and Find Full Text PDF

An efficient linear mixed model framework for meta-analytic association studies across multiple contexts.

Brandon Jew Jiajin Li Sriram Sankararaman Jae Hoon Sul

Lebniz Int Proc Inform

December 2016

Linear mixed models (LMMs) can be applied in the meta-analyses of responses from individuals across multiple contexts, increasing power to detect associations while accounting for confounding effects arising from within-individual variation. However, traditional approaches to fitting these models can be computationally intractable. Here, we describe an efficient and exact method for fitting a multiple-context linear mixed model.

View Article and Find Full Text PDF