Publications by Helena Andres-Terre

Publications by authors named "Helena Andres-Terre"

Page 1 of 1

A novel interpretable machine learning algorithm to identify optimal parameter space for cancer growth.

Helena Coggan Helena Andres Terre Pietro Liò

Front Big Data

September 2022

Recent years have seen an increase in the application of machine learning to the analysis of physical and biological systems, including cancer progression. A fundamental downside to these tools is that their complexity and nonlinearity makes it almost impossible to establish a deterministic, relationship between their input and output, and thus their predictions are not wholly accountable. We begin with a series of proofs establishing that this holds even for the simplest possible model of a neural network; the effects of specific loss functions are explored more fully in Appendices.

View Article and Find Full Text PDF

Unsupervised construction of computational graphs for gene expression data with explicit structural inductive biases.

Paul Scherer Maja Trębacz Nikola Simidjievski Ramon Viñas Zohreh Shams

Bioinformatics

February 2022

Motivation: Gene expression data are commonly used at the intersection of cancer research and machine learning for better understanding of the molecular status of tumour tissue. Deep learning predictive models have been employed for gene expression data due to their ability to scale and remove the need for manual feature engineering. However, gene expression data are often very high dimensional, noisy and presented with a low number of samples.

View Article and Find Full Text PDF

CellVGAE: an unsupervised scRNA-seq analysis workflow with graph attention networks.

David Buterez Ioana Bica Ifrah Tariq Helena Andrés-Terré Pietro Liò

Bioinformatics

February 2022

Motivation: Single-cell RNA sequencing allows high-resolution views of individual cells for libraries of up to millions of samples, thus motivating the use of deep learning for analysis. In this study, we introduce the use of graph neural networks for the unsupervised exploration of scRNA-seq data by developing a variational graph autoencoder architecture with graph attention layers that operates directly on the connectivity between cells, focusing on dimensionality reduction and clustering. With the help of several case studies, we show that our model, named CellVGAE, can be effectively used for exploratory analysis even on challenging datasets, by extracting meaningful features from the data and providing the means to visualize and interpret different aspects of the model.

View Article and Find Full Text PDF

Adversarial generation of gene expression data.

Ramon Viñas Helena Andrés-Terré Pietro Liò Kevin Bryson

Bioinformatics

January 2022

Motivation: High-throughput gene expression can be used to address a wide range of fundamental biological problems, but datasets of an appropriate size are often unavailable. Moreover, existing transcriptomics simulators have been criticized because they fail to emulate key properties of gene expression data. In this article, we develop a method based on a conditional generative adversarial network to generate realistic transcriptomics data for Escherichia coli and humans.

View Article and Find Full Text PDF

Unsupervised generative and graph representation learning for modelling cell differentiation.

Ioana Bica Helena Andrés-Terré Ana Cvejic Pietro Liò

Sci Rep

June 2020

Using machine learning techniques to build representations from biomedical data can help us understand the latent biological mechanism of action and lead to important discoveries. Recent developments in single-cell RNA-sequencing protocols have allowed measuring gene expression for individual cells in a population, thus opening up the possibility of finding answers to biomedical questions about cell differentiation. In this paper, we explore unsupervised generative neural methods, based on the variational autoencoder, that can model cell differentiation by building meaningful representations from the high dimensional and complex gene expression data.

View Article and Find Full Text PDF

Variational Autoencoders for Cancer Data Integration: Design Principles and Computational Practice.

Nikola Simidjievski Cristian Bodnar Ifrah Tariq Paul Scherer Helena Andres Terre

Front Genet

December 2019

International initiatives such as the Molecular Taxonomy of Breast Cancer International Consortium are collecting multiple data sets at different genome-scales with the aim to identify novel cancer bio-markers and predict patient survival. To analyze such data, several machine learning, bioinformatics, and statistical methods have been applied, among them neural networks such as autoencoders. Although these models provide a good statistical learning framework to analyze multi-omic and/or clinical data, there is a distinct lack of work on how to integrate diverse patient data and identify the optimal design best suited to the available data.

View Article and Find Full Text PDF