Publications by Roeland C H J van Ham | LitMetric

Publications by authors named "Roeland C H J van Ham"

Page 1 of 2

Automatic Gene Function Prediction in the 2020's.

Stavros Makrodimitris Roeland C H J van Ham Marcel J T Reinders

Genes (Basel)

October 2020

The current rate at which new DNA and protein sequences are being generated is too fast to experimentally discover the functions of those sequences, emphasizing the need for accurate Automatic Function Prediction (AFP) methods. AFP has been an active and growing research field for decades and has made considerable progress in that time. However, it is certainly not solved.

View Article and Find Full Text PDF

Unsupervised protein embeddings outperform hand-crafted sequence and structure features at predicting molecular function.

Amelia Villegas-Morcillo Stavros Makrodimitris Roeland C H J van Ham Angel M Gomez Victoria Sanchez

Bioinformatics

April 2021

Motivation: Protein function prediction is a difficult bioinformatics problem. Many recent methods use deep neural networks to learn complex sequence representations and predict function from these. Deep supervised models require a lot of labeled training data which are not available for this task.

View Article and Find Full Text PDF

Metric learning on expression data for gene function prediction.

Stavros Makrodimitris Marcel J T Reinders Roeland C H J van Ham

Bioinformatics

February 2020

Motivation: Co-expression of two genes across different conditions is indicative of their involvement in the same biological process. However, when using RNA-Seq datasets with many experimental conditions from diverse sources, only a subset of the experimental conditions is expected to be relevant for finding genes related to a particular Gene Ontology (GO) term. Therefore, we hypothesize that when the purpose is to find similarly functioning genes, the co-expression of genes should not be determined on all samples but only on those samples informative for the GO term of interest.

View Article and Find Full Text PDF

Improving protein function prediction using protein sequence and GO-term similarities.

Stavros Makrodimitris Roeland C H J van Ham Marcel J T Reinders

Bioinformatics

April 2019

Motivation: Most automatic functional annotation methods assign Gene Ontology (GO) terms to proteins based on annotations of highly similar proteins. We advocate that proteins that are less similar are still informative. Also, despite their simplicity and structure, GO terms seem to be hard for computers to learn, in particular the Biological Process ontology, which has the most terms (>29 000).

View Article and Find Full Text PDF

Correction: The Genomes of the Fungal Plant Pathogens Cladosporium fulvum and Dothistroma septosporum Reveal Adaptation to Different Hosts and Lifestyles But Also Signatures of Common Ancestry.

Pierre J G M de Wit Ate van der Burgt Bilal Ökmen Ioannis Stergiopoulos Kamel A Abd-Elsalam Braham Dhillon Richard C Hamelin Roeland C H J van Ham

PLoS Genet

December 2015

View Article and Find Full Text PDF

Homologues of potato chromosome 5 show variable collinearity in the euchromatin, but dramatic absence of sequence similarity in the pericentromeric heterochromatin.

Jan M de Boer Erwin Datema Xiaomin Tang Theo J A Borm Erin H Bakker Roeland C H J van Ham

BMC Genomics

May 2015

Background: In flowering plants it has been shown that de novo genome assemblies of different species and genera show a significant drop in the proportion of alignable sequence. Within a plant species, however, it is assumed that different haplotypes of the same chromosome align well. In this paper we have compared three de novo assemblies of potato chromosome 5 and report on the sequence variation and the proportion of sequence that can be aligned.

View Article and Find Full Text PDF

A quantitative and dynamic model of the Arabidopsis flowering time gene regulatory network.

Felipe Leal Valentim Simon van Mourik David Posé Min C Kim Markus Schmid Roeland C H J van Ham

PLoS One

January 2016

Various environmental signals integrate into a network of floral regulatory genes leading to the final decision on when to flower. Although a wealth of qualitative knowledge is available on how flowering time genes regulate each other, only a few studies incorporated this knowledge into predictive models. Such models are invaluable as they enable to investigate how various types of inputs are combined to give a quantitative readout.

View Article and Find Full Text PDF

The genome of the stress-tolerant wild tomato species Solanum pennellii.

Anthony Bolger Federico Scossa Marie E Bolger Christa Lanz Florian Maumus Roeland C H J van Ham

Nat Genet

September 2014

Solanum pennellii is a wild tomato species endemic to Andean regions in South America, where it has evolved to thrive in arid habitats. Because of its extreme stress tolerance and unusual morphology, it is an important donor of germplasm for the cultivated tomato Solanum lycopersicum. Introgression lines (ILs) in which large genomic regions of S.

View Article and Find Full Text PDF

Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing.

Plant J

October 2014

We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars.

View Article and Find Full Text PDF

The genomes of the fungal plant pathogens Cladosporium fulvum and Dothistroma septosporum reveal adaptation to different hosts and lifestyles but also signatures of common ancestry.

Pierre J G M de Wit Ate van der Burgt Bilal Ökmen Ioannis Stergiopoulos Kamel A Abd-Elsalam Braham Dhillon Richard C Hamelin Roeland C H J van Ham

PLoS Genet

May 2013

We sequenced and compared the genomes of the Dothideomycete fungal plant pathogens Cladosporium fulvum (Cfu) (syn. Passalora fulva) and Dothistroma septosporum (Dse) that are closely related phylogenetically, but have different lifestyles and hosts. Although both fungi grow extracellularly in close contact with host mesophyll cells, Cfu is a biotroph infecting tomato, while Dse is a hemibiotroph infecting pine.

View Article and Find Full Text PDF

Mutational robustness of gene regulatory networks.

Aalt D J van Dijk Simon van Mourik Roeland C H J van Ham

PLoS One

June 2012

Mutational robustness of gene regulatory networks refers to their ability to generate constant biological output upon mutations that change network structure. Such networks contain regulatory interactions (transcription factor-target gene interactions) but often also protein-protein interactions between transcription factors. Using computational modeling, we study factors that influence robustness and we infer several network properties governing it.

View Article and Find Full Text PDF

Predicting the impact of alternative splicing on plant MADS domain protein function.

Edouard I Severing Aalt D J van Dijk Giuseppa Morabito Jacqueline Busscher-Lange Richard G H Immink Roeland C H J van Ham

PLoS One

June 2012

Several genome-wide studies demonstrated that alternative splicing (AS) significantly increases the transcriptome complexity in plants. However, the impact of AS on the functional diversity of proteins is difficult to assess using genome-wide approaches. The availability of detailed sequence annotations for specific genes and gene families allows for a more detailed assessment of the potential effect of AS on their function.

View Article and Find Full Text PDF

Correlated mutations via regularized multinomial regression.

Janardanan Sreekumar Cajo J F ter Braak Roeland C H J van Ham Aalt D J van Dijk

BMC Bioinformatics

November 2011

Background: In addition to sequence conservation, protein multiple sequence alignments contain evolutionary signal in the form of correlated variation among amino acid positions. This signal indicates positions in the sequence that influence each other, and can be applied for the prediction of intra- or intermolecular contacts. Although various approaches exist for the detection of such correlated mutations, in general these methods utilize only pairwise correlations.

View Article and Find Full Text PDF

Genome sequence and analysis of the tuber crop potato.

Nature

July 2011

Potato (Solanum tuberosum L.) is the world's most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression.

View Article and Find Full Text PDF

Finished genome of the fungal wheat pathogen Mycosphaerella graminicola reveals dispensome structure, chromosome plasticity, and stealth pathogenesis.

Stephen B Goodwin Sarrah Ben M'barek Braham Dhillon Alexander H J Wittenberg Charles F Crane Roeland C H J van Ham Kim E Hammond-Kosack

PLoS Genet

June 2011

The plant-pathogenic fungus Mycosphaerella graminicola (asexual stage: Septoria tritici) causes septoria tritici blotch, a disease that greatly reduces the yield and quality of wheat. This disease is economically important in most wheat-growing areas worldwide and threatens global food production. Control of the disease has been hampered by a limited understanding of the genetic and biochemical bases of pathogenicity, including mechanisms of infection and of resistance in the host.

View Article and Find Full Text PDF

PRI-CAT: a web-tool for the analysis, storage and visualization of plant ChIP-seq experiments.

Jose M Muiño Marlous Hoogstraat Roeland C H J van Ham Aalt D J van Dijk

Nucleic Acids Res

July 2011

Although several tools for the analysis of ChIP-seq data have been published recently, there is a growing demand, in particular in the plant research community, for computational resources with which such data can be processed, analyzed, stored, visualized and integrated within a single, user-friendly environment. To accommodate this demand, we have developed PRI-CAT (Plant Research International ChIP-seq analysis tool), a web-based workflow tool for the management and analysis of ChIP-seq experiments. PRI-CAT is currently focused on Arabidopsis, but will be extended with other plant species in the near future.

View Article and Find Full Text PDF

Assessing the contribution of alternative splicing to proteome diversity in Arabidopsis thaliana using proteomics data.

Edouard I Severing Aalt D J van Dijk Roeland C H J van Ham

BMC Plant Biol

May 2011

Background: Large-scale analyses of genomics and transcriptomics data have revealed that alternative splicing (AS) substantially increases the complexity of the transcriptome in higher eukaryotes. However, the extent to which this complexity is reflected at the level of the proteome remains unclear. On the basis of a lack of conservation of AS between species, we previously concluded that AS does not frequently serve as a mechanism that enables the production of multiple functional proteins from a single gene.

View Article and Find Full Text PDF

SLIDER: a generic metaheuristic for the discovery of correlated motifs in protein-protein interaction networks.

Peter Boyen Dries Van Dyck Frank Neven Roeland C H J van Ham Aalt D J van Dijk

IEEE/ACM Trans Comput Biol Bioinform

January 2012

Correlated motif mining (cmm) is the problem of finding overrepresented pairs of patterns, called motifs, in sequences of interacting proteins. Algorithmic solutions for cmm thereby provide a computational method for predicting binding sites for protein interaction. In this paper, we adopt a motif-driven approach where the support of candidate motif pairs is evaluated in the network.

View Article and Find Full Text PDF

Sequence motifs in MADS transcription factors responsible for specificity and diversification of protein-protein interaction.

Aalt D J van Dijk Giuseppa Morabito Martijn Fiers Roeland C H J van Ham Gerco C Angenent

PLoS Comput Biol

November 2010

Protein sequences encompass tertiary structures and contain information about specific molecular interactions, which in turn determine biological functions of proteins. Knowledge about how protein sequences define interaction specificity is largely missing, in particular for paralogous protein families with high sequence similarity, such as the plant MADS domain transcription factor family. In comparison to the situation in mammalian species, this important family of transcription regulators has expanded enormously in plant species and contains over 100 members in the model plant species Arabidopsis thaliana.

View Article and Find Full Text PDF

Genome-wide computational function prediction of Arabidopsis proteins by integration of multiple data sources.

Yiannis A I Kourmpetis Aalt D J van Dijk Roeland C H J van Ham Cajo J F ter Braak

Plant Physiol

January 2011

Although Arabidopsis (Arabidopsis thaliana) is the best studied plant species, the biological role of one-third of its proteins is still unknown. We developed a probabilistic protein function prediction method that integrates information from sequences, protein-protein interactions, and gene expression. The method was applied to proteins from Arabidopsis.

View Article and Find Full Text PDF

Conserved and variable correlated mutations in the plant MADS protein network.

Aalt D J van Dijk Roeland C H J van Ham

BMC Genomics

October 2010

Background: Plant MADS domain proteins are involved in a variety of developmental processes for which their ability to form various interactions is a key requisite. However, not much is known about the structure of these proteins or their complexes, whereas such knowledge would be valuable for a better understanding of their function. Here, we analyze those proteins and the complexes they form using a correlated mutation approach in combination with available structural, bioinformatics and experimental data.

View Article and Find Full Text PDF

Continuous-time modeling of cell fate determination in Arabidopsis flowers.

Simon van Mourik Aalt D J van Dijk Maarten de Gee Richard G H Immink Kerstin Kaufmann Roeland C H J van Ham

BMC Syst Biol

July 2010

Background: The genetic control of floral organ specification is currently being investigated by various approaches, both experimentally and through modeling. Models and simulations have mostly involved boolean or related methods, and so far a quantitative, continuous-time approach has not been explored.

Results: We propose an ordinary differential equation (ODE) model that describes the gene expression dynamics of a gene regulatory network that controls floral organ formation in the model plant Arabidopsis thaliana.

View Article and Find Full Text PDF

Bayesian Markov Random Field analysis for protein function prediction based on network data.

Yiannis A I Kourmpetis Aalt D J van Dijk Marco C A M Bink Roeland C H J van Ham Cajo J F ter Braak

PLoS One

February 2010

Inference of protein functions is one of the most important aims of modern biology. To fully exploit the large volumes of genomic data typically produced in modern-day genomic experiments, automated computational methods for protein function prediction are urgently needed. Established methods use sequence or structure similarity to infer functions but those types of data do not suffice to determine the biological context in which proteins act.

View Article and Find Full Text PDF

In silico miRNA prediction in metazoan genomes: balancing between sensitivity and specificity.

Ate van der Burgt Mark W J E Fiers Jan-Peter Nap Roeland C H J van Ham

BMC Genomics

April 2009

Background: MicroRNAs (miRNAs), short approximately 21-nucleotide RNA molecules, play an important role in post-transcriptional regulation of gene expression. The number of known miRNA hairpins registered in the miRBase database is rapidly increasing, but recent reports suggest that many miRNAs with restricted temporal or tissue-specific expression remain undiscovered. Various strategies for in silico miRNA identification have been proposed to facilitate miRNA discovery.

View Article and Find Full Text PDF

Comparative analysis indicates that alternative splicing in plants has a limited role in functional expansion of the proteome.

Edouard I Severing Aalt D J van Dijk Willem J Stiekema Roeland C H J van Ham

BMC Genomics

April 2009

Background: Alternative splicing (AS) is a widespread phenomenon in higher eukaryotes but the extent to which it leads to functional protein isoforms and to proteome expansion at large is still a matter of debate. In contrast to animal species, for which AS has been studied extensively at the protein and functional level, protein-centered studies of AS in plant species are scarce. Here we investigate the functional impact of AS in dicot and monocot plant species using a comparative approach.

View Article and Find Full Text PDF