Publications by Jack Lanchantin

Publications by authors named "Jack Lanchantin"

Page 1 of 1

FastSK: fast sequence analysis with gapped string kernels.

Derrick Blakely Eamon Collins Ritambhara Singh Andrew Norton Jack Lanchantin

Bioinformatics

December 2020

Motivation: Gapped k-mer kernels with support vector machines (gkm-SVMs) have achieved strong predictive performance on regulatory DNA sequences on modestly sized training sets. However, existing gkm-SVM algorithms suffer from slow kernel computation time, as they depend exponentially on the sub-sequence feature length, number of mismatch positions, and the task's alphabet size.

Results: In this work, we introduce a fast and scalable algorithm for calculating gapped k-mer string kernels.

View Article and Find Full Text PDF

Graph convolutional networks for epigenetic state prediction using both sequence and 3D genome data.

Jack Lanchantin Yanjun Qi

Bioinformatics

December 2020

Motivation: Predictive models of DNA chromatin profile (i.e. epigenetic state), such as transcription factor binding, are essential for understanding regulatory processes and developing gene therapies.

View Article and Find Full Text PDF

Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin.

Ritambhara Singh Jack Lanchantin Arshdeep Sekhon Yanjun Qi

Adv Neural Inf Process Syst

December 2017

The past decade has seen a revolution in genomic technologies that enabled a flood of genome-wide profiling of chromatin marks. Recent literature tried to understand gene regulation by predicting gene expression from large-scale chromatin measurements. Two fundamental challenges exist for such learning tasks: (1) genome-wide chromatin signals are spatially structured, high-dimensional and highly modular; and (2) the core aim is to understand what the relevant factors are and how they work together.

View Article and Find Full Text PDF

Opportunities and obstacles for deep learning in biology and medicine.

Travers Ching Daniel S Himmelstein Brett K Beaulieu-Jones Alexandr A Kalinin Brian T Do Jack Lanchantin

J R Soc Interface

April 2018

Deep learning describes a class of machine learning algorithms that are capable of combining raw inputs into layers of intermediate features. These algorithms have recently shown impressive results across a variety of domains. Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood.

View Article and Find Full Text PDF

DEEP MOTIF DASHBOARD: VISUALIZING AND UNDERSTANDING GENOMIC SEQUENCES USING DEEP NEURAL NETWORKS.

Jack Lanchantin Ritambhara Singh Beilun Wang Yanjun Qi

Pac Symp Biocomput

June 2017

Deep neural network (DNN) models have recently obtained state-of-the-art prediction accuracy for the transcription factor binding (TFBS) site classification task. However, it remains unclear how these approaches identify meaningful DNA sequence signals and give insights as to why TFs bind to certain locations. In this paper, we propose a toolkit called the Deep Motif Dashboard (DeMo Dashboard) which provides a suite of visualization strategies to extract motifs, or sequence patterns from deep neural network models for TFBS classification.

View Article and Find Full Text PDF

Transfer String Kernel for Cross-Context DNA-Protein Binding Prediction.

Ritambhara Singh Jack Lanchantin Gabriel Robins Yanjun Qi

IEEE/ACM Trans Comput Biol Bioinform

March 2020

Through sequence-based classification, this paper tries to accurately predict the DNA binding sites of transcription factors (TFs) in an unannotated cellular context. Related methods in the literature fail to perform such predictions accurately, since they do not consider sample distribution shift of sequence segments from an annotated (source) context to an unannotated (target) context. We, therefore, propose a method called "Transfer String Kernel" (TSK) that achieves improved prediction of transcription factor binding site (TFBS) using knowledge transfer via cross-context sample adaptation.

View Article and Find Full Text PDF

DeepChrome: deep-learning for predicting gene expression from histone modifications.

Ritambhara Singh Jack Lanchantin Gabriel Robins Yanjun Qi

Bioinformatics

September 2016

Motivation: Histone modifications are among the most important factors that control gene regulation. Computational methods that predict gene expression from histone modification signals are highly desirable for understanding their combinatorial effects in gene regulation. This knowledge can help in developing 'epigenetic drugs' for diseases like cancer.

View Article and Find Full Text PDF