ARTS: automated randomization of multiple traits for study design.

Bioinformatics

Center for Research Informatics, Institute for Interventional Health Informatics, Department of Medicine, Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, Computation Institute of The University of Chicago, Chicago and Argonne National Laboratory, The University of Chicago, Lemont, IL, USACenter for Research Informatics, Institute for Interventional Health Informatics, Department of Medicine, Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, Computation Institute of The University of Chicago, Chicago and Argonne National Laboratory, The University of Chicago, Lemont, IL, USACenter for Research Informatics, Institute for Interventional Health Informatics, Department of Medicine, Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, Computation Institute of The University of Chicago, Chicago and Argonne National Laboratory, The University of Chicago, Lemont, IL, USACenter for Research Informatics, Institute for Interventional Health Informatics, Department of Medicine, Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, Computation Institute of The University of Chicago, Chicago and Argonne National Laboratory, The University of Chicago, Lemont, IL, USACenter for Research Informatics, Institute for Interventional Health Informatics, Department of Medicine, Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, Computation Institute of The University of Chicago, Chicago and Argonne National Laboratory, The University of Chicago, Lemont, IL, USA.

Published: June 2014

Unlabelled: Collecting data from large studies on high-throughput platforms, such as microarray or next-generation sequencing, typically requires processing samples in batches. There are often systematic but unpredictable biases from batch-to-batch, so proper randomization of biologically relevant traits across batches is crucial for distinguishing true biological differences from experimental artifacts. When a large number of traits are biologically relevant, as is common for clinical studies of patients with varying sex, age, genotype and medical background, proper randomization can be extremely difficult to prepare by hand, especially because traits may affect biological inferences, such as differential expression, in a combinatorial manner. Here we present ARTS (automated randomization of multiple traits for study design), which aids researchers in study design by automatically optimizing batch assignment for any number of samples, any number of traits and any batch size.

Availability And Implementation: ARTS is implemented in Perl and is available at github.com/mmaiensc/ARTS. ARTS is also available in the Galaxy Tool Shed, and can be used at the Galaxy installation hosted by the UIC Center for Research Informatics (CRI) at galaxy.cri.uic.edu.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4029038PMC
http://dx.doi.org/10.1093/bioinformatics/btu075DOI Listing

Publication Analysis

Top Keywords

study design
12
arts automated
8
automated randomization
8
randomization multiple
8
multiple traits
8
traits study
8
proper randomization
8
biologically relevant
8
number traits
8
traits
6

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!