Motivation: With high-throughput genotyping systems now available, it has become feasible to fully integrate genotyping information into breeding programs. To make use of this information effectively requires DNA extraction facilities and marker production facilities that can efficiently deploy the desired set of markers across samples with a rapid turnaround time that allows for selection before crosses needed to be made. In reality, breeders often have a short window of time to make decisions by the time they are able to collect all their phenotyping data and receive corresponding genotyping data. This presents a challenge to organize information and utilize it in downstream analyses to support decisions made by breeders. In order to implement genomic selection routinely as part of breeding programs, one would need an efficient genotyping data storage system. We selected and benchmarked six popular open-source data storage systems, including relational database management and columnar storage systems.

Results: We found that data extract times are greatly influenced by the orientation in which genotype data is stored in a system. HDF5 consistently performed best, in part because it can more efficiently work with both orientations of the allele matrix.

Availability: http://gobiin1.bti.cornell.edu:6083/projects/GBM/repos/benchmarking/browse.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6737464PMC
http://dx.doi.org/10.1093/database/baz096DOI Listing

Publication Analysis

Top Keywords

genomic selection
8
breeding programs
8
genotyping data
8
data storage
8
data
6
benchmarking database
4
database systems
4
systems genomic
4
selection implementation
4
implementation motivation
4

Similar Publications

Unraveling the potential mechanism and prognostic value of pentose phosphate pathway in hepatocellular carcinoma: a comprehensive analysis integrating bulk transcriptomics and single-cell sequencing data.

Funct Integr Genomics

January 2025

Institute of Infectious Diseases, Guangdong Province, Guangzhou Eighth People's Hospital, Guangzhou Medical University, 8 Huaying Road, Baiyun District, Guangzhou, 510440, China.

Hepatocellular carcinoma (HCC) remains a malignant and life-threatening tumor with an extremely poor prognosis, posing a significant global health challenge. Despite the continuous emergence of novel therapeutic agents, patients exhibit substantial heterogeneity in their responses to anti-tumor drugs and overall prognosis. The pentose phosphate pathway (PPP) is highly activated in various tumor cells and plays a pivotal role in tumor metabolic reprogramming.

View Article and Find Full Text PDF

Phenomic selection based on parental spectra can be used to predict GCA and SCA in a sparse factorial design. Prediction approaches such as genomic selection can be game changers in hybrid breeding. They allow predicting the genetic values of hybrids without the need for their physical production.

View Article and Find Full Text PDF

Synthetic rational design of live-attenuated Zika viruses based on a computational model.

Nucleic Acids Res

January 2025

SynVaccine Ltd, Ramat Hachayal, 3 Golda Meir Street, Science Park, Nes Ziona 7403648, Israel.

Many viruses of the Flaviviridae family, including the Zika virus (ZIKV), are human pathogens of significant public health concerns. Despite extensive research, there are currently no approved vaccines available for ZIKV and specifically no live-attenuated Zika vaccine. In this current study, we suggest a novel computational algorithm for generating live-attenuated vaccines via the introduction of silent mutation into regions that undergo selection for strong or weak local RNA folding or into regions that exhibit medium levels of sequence conservation.

View Article and Find Full Text PDF

Roadmap to discovery and early development of an mRNA loaded LNP formulation for liver therapeutic genome editing.

Expert Opin Drug Deliv

January 2025

Advanced Drug Delivery, Pharmaceutical Sciences, R&D, AstraZeneca, Macclesfield, UK.

Introduction: mRNA therapeutics were a niche area in drug development before COVIDvaccines. Now they are used in vaccine development, for non-viral therapeuticgenome editing, chimericantigen receptor T  (CAR T) celltherapies and protein replacement.  mRNAis large, charged, and easily degraded by nucleases.

View Article and Find Full Text PDF

Transgene expression in stem cells is a powerful means of regulating cellular properties and differentiation into various cell types. However, existing vectors for transgene expression in stem cells suffer from limitations such as the need for genomic integration, the transient nature of gene expression, and the inability to temporally regulate transgene expression, which hinder biomedical and clinical applications. Here we report a new class of RNA virus-based vectors for scalable and integration-free transgene expression in mouse embryonic stem cells (mESCs).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!