Publications by Katrin Sophie Bohnsack

Publications by authors named "Katrin Sophie Bohnsack"

Page 1 of 1

Alignment-Free Sequence Comparison: A Systematic Survey From a Machine Learning Perspective.

Katrin Sophie Bohnsack Marika Kaden Julia Abel Thomas Villmann

IEEE/ACM Trans Comput Biol Bioinform

April 2023

The encounter of large amounts of biological sequence data generated during the last decades and the algorithmic and hardware improvements have offered the possibility to apply machine learning techniques in bioinformatics. While the machine learning community is aware of the necessity to rigorously distinguish data transformation from data comparison and adopt reasonable combinations thereof, this awareness is often lacking in the field of comparative sequence analysis. With realization of the disadvantages of alignments for sequence comparison, some typical applications use more and more so-called alignment-free approaches.

View Article and Find Full Text PDF

The Resolved Mutual Information Function as a Structural Fingerprint of Biomolecular Sequences for Interpretable Machine Learning Classifiers.

Katrin Sophie Bohnsack Marika Kaden Julia Abel Sascha Saralajew Thomas Villmann

Entropy (Basel)

October 2021

In the present article we propose the application of variants of the mutual information function as characteristic fingerprints of biomolecular sequences for classification analysis. In particular, we consider the resolved mutual information functions based on Shannon-, Rényi-, and Tsallis-entropy. In combination with interpretable machine learning classifier models based on generalized learning vector quantization, a powerful methodology for sequence classification is achieved which allows substantial knowledge extraction in addition to the high classification ability due to the model-inherent robustness.

View Article and Find Full Text PDF

Learning vector quantization as an interpretable classifier for the detection of SARS-CoV-2 types based on their RNA sequences.

Marika Kaden Katrin Sophie Bohnsack Mirko Weber Mateusz Kudła Kaja Gutowska

Neural Comput Appl

April 2021

Unlabelled: We present an approach to discriminate SARS-CoV-2 virus types based on their RNA sequence descriptions avoiding a sequence alignment. For that purpose, sequences are preprocessed by feature extraction and the resulting feature vectors are analyzed by prototype-based classification to remain interpretable. In particular, we propose to use variants of learning vector quantization (LVQ) based on dissimilarity measures for RNA sequence data.

View Article and Find Full Text PDF

Virxicon: a lexicon of viral sequences.

Mateusz Kudla Kaja Gutowska Jaroslaw Synak Mirko Weber Katrin Sophie Bohnsack

Bioinformatics

April 2021

Motivation: Viruses are the most abundant biological entities and constitute a large reservoir of genetic diversity. In recent years, knowledge about them has increased significantly as a result of dynamic development in life sciences and rapid technological progress. This knowledge is scattered across various data repositories, making a comprehensive analysis of viral data difficult.

View Article and Find Full Text PDF

Publications by authors named "Katrin Sophie Bohnsack"

Alignment-Free Sequence Comparison: A Systematic Survey From a Machine Learning Perspective.

The Resolved Mutual Information Function as a Structural Fingerprint of Biomolecular Sequences for Interpretable Machine Learning Classifiers.

Learning vector quantization as an interpretable classifier for the detection of SARS-CoV-2 types based on their RNA sequences.

Virxicon: a lexicon of viral sequences.

A PHP Error was encountered

A PHP Error was encountered