Issues in bioinformatics benchmarking: the case study of multiple sequence alignment.

Nucleic Acids Res

Department of Structural Biology and Genomics, Institut de Génétique et de Biologie Moléculaire et Cellulaire, The Centre National de la Recherche Scientifique, UMR7104, F-67400 Illkirch and Université de Strasbourg, F-67000 Strasbourg, France.

Published: November 2010

The post-genomic era presents many new challenges for the field of bioinformatics. Novel computational approaches are now being developed to handle the large, complex and noisy datasets produced by high throughput technologies. Objective evaluation of these methods is essential (i) to assure high quality, (ii) to identify strong and weak points of the algorithms, (iii) to measure the improvements introduced by new methods and (iv) to enable non-specialists to choose an appropriate tool. Here, we discuss the development of formal benchmarks, designed to represent the current problems encountered in the bioinformatics field. We consider several criteria for building good benchmarks and the advantages to be gained when they are used intelligently. To illustrate these principles, we present a more detailed discussion of benchmarks for multiple alignments of protein sequences. As in many other domains, significant progress has been achieved in the multiple alignment field and the datasets have become progressively more challenging as the existing algorithms have evolved. Finally, we propose directions for future developments that will ensure that the bioinformatics benchmarks correspond to the challenges posed by the high throughput data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2995051PMC
http://dx.doi.org/10.1093/nar/gkq625DOI Listing

Publication Analysis

Top Keywords

high throughput
8
issues bioinformatics
4
bioinformatics benchmarking
4
benchmarking case
4
case study
4
study multiple
4
multiple sequence
4
sequence alignment
4
alignment post-genomic
4
post-genomic era
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!