An intuitive Python interface for Bioconductor libraries demonstrates the utility of language translators.

BMC Bioinformatics

DMAC, Centre for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Matematiktorvet, 2100 Lyngby, Denmark.

Published: December 2010

AI Article Synopsis

  • Computer languages can be specialized for different fields, and projects often require knowledge of multiple languages for effective implementation.
  • The Bioconductor project, primarily using the R language, is essential for analyzing data from biological studies, while Python is recognized for its ease in developing applications and handling large datasets.
  • A new development now allows the use of Bioconductor's functionalities directly in Python, facilitating access to complex biological data without needing to know R, and making it easier for Python developers to work with such data.

Article Abstract

Background: Computer languages can be domain-related, and in the case of multidisciplinary projects, knowledge of several languages will be needed in order to quickly implements ideas. Moreover, each computer language has relative strong points, making some languages better suited than others for a given task to be implemented. The Bioconductor project, based on the R language, has become a reference for the numerical processing and statistical analysis of data coming from high-throughput biological assays, providing a rich selection of methods and algorithms to the research community. At the same time, Python has matured as a rich and reliable language for the agile development of prototypes or final implementations, as well as for handling large data sets.

Results: The data structures and functions from Bioconductor can be exposed to Python as a regular library. This allows a fully transparent and native use of Bioconductor from Python, without one having to know the R language and with only a small community of translators required to know both. To demonstrate this, we have implemented such Python representations for key infrastructure packages in Bioconductor, letting a Python programmer handle annotation data, microarray data, and next-generation sequencing data.

Conclusions: Bioconductor is now not solely reserved to R users. Building a Python application using Bioconductor functionality can be done just like if Bioconductor was a Python package. Moreover, similar principles can be applied to other languages and libraries. Our Python package is available at: http://pypi.python.org/pypi/rpy2-bioconductor-extensions/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3040525PMC
http://dx.doi.org/10.1186/1471-2105-11-S12-S11DOI Listing

Publication Analysis

Top Keywords

bioconductor
8
python
8
bioconductor python
8
python package
8
language
5
data
5
intuitive python
4
python interface
4
interface bioconductor
4
bioconductor libraries
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!