Unlabelled: Public genomic databases, which are often used to guide genetic studies of human disease, are now being applied to genomic medicine through in silico integrative genomics. These databases, however, often lack tools for systematically determining the experimental origins of the data.
Results: We introduce a new data provenance model that we have implemented in a public web application, BioQ, for assessing the reliability of the data by systematically tracing its experimental origins to the original subjects and biologics.
Genome-wide association studies often incorporate information from public biological databases in order to provide a biological reference for interpreting the results. The dbSNP database is an extensive source of information on single nucleotide polymorphisms (SNPs) for many different organisms, including humans. We have developed free software that will download and install a local MySQL implementation of the dbSNP relational database for a specified organism.
View Article and Find Full Text PDFEnvironmental samples have been analysed for viruses in metagenomic studies, but these studies have not linked individual viruses to their hosts. We designed a strategy to isolate double-stranded RNA, a hallmark of RNA virus infection, from individual plants and convert this to cDNA with a unique four nucleotide Tag at each end. Using 96 different Tags allowed us to pool samples and still retain the link to the original sample.
View Article and Find Full Text PDF