Computational solutions to large-scale data management and analysis.

Nat Rev Genet

Pacific Biosciences, Menlo Park, California 94025, USA.

Published: September 2010

Today we can generate hundreds of gigabases of DNA and RNA sequencing data in a week for less than US$5,000. The astonishing rate of data generation by these low-cost, high-throughput technologies in genomics is being matched by that of other technologies, such as real-time imaging and mass spectrometry-based flow cytometry. Success in the life sciences will depend on our ability to properly interpret the large-scale, high-dimensional data sets that are generated by these technologies, which in turn requires us to adopt advances in informatics. Here we discuss how we can master the different types of computational environments that exist - such as cloud and heterogeneous computing - to successfully tackle our big data problems.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3124937PMC
http://dx.doi.org/10.1038/nrg2857DOI Listing

Publication Analysis

Top Keywords

data
5
computational solutions
4
solutions large-scale
4
large-scale data
4
data management
4
management analysis
4
analysis today
4
today generate
4
generate hundreds
4
hundreds gigabases
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!