Publications by Eamon Collins

Publications by authors named "Eamon Collins"

Page 1 of 1

FastSK: fast sequence analysis with gapped string kernels.

Derrick Blakely Eamon Collins Ritambhara Singh Andrew Norton Jack Lanchantin

Bioinformatics

December 2020

Motivation: Gapped k-mer kernels with support vector machines (gkm-SVMs) have achieved strong predictive performance on regulatory DNA sequences on modestly sized training sets. However, existing gkm-SVM algorithms suffer from slow kernel computation time, as they depend exponentially on the sub-sequence feature length, number of mismatch positions, and the task's alphabet size.

Results: In this work, we introduce a fast and scalable algorithm for calculating gapped k-mer string kernels.

View Article and Find Full Text PDF