While advances in genome sequencing technology make population-scale genomics a possibility, current approaches for analysis of these data rely upon parallelization strategies that have limited scalability, complex implementation and lack reproducibility. Churchill, a balanced regional parallelization strategy, overcomes these challenges, fully automating the multiple steps required to go from raw sequencing reads to variant discovery. Through implementation of novel deterministic parallelization techniques, Churchill allows computationally efficient analysis of a high-depth whole genome sample in less than two hours. The method is highly scalable, enabling full analysis of the 1000 Genomes raw sequence dataset in a week using cloud resources. http://churchill.nchri.org/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4333267PMC
http://dx.doi.org/10.1186/s13059-014-0577-xDOI Listing

Publication Analysis

Top Keywords

highly scalable
8
parallelization strategy
8
population-scale genomics
8
churchill ultra-fast
4
ultra-fast deterministic
4
deterministic highly
4
scalable balanced
4
parallelization
4
balanced parallelization
4
strategy discovery
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!