Publications by Tanya Barrett | LitMetric

Publications by authors named "Tanya Barrett"

Page 1 of 2

NCBI GEO: archive for gene expression and epigenomics data sets: 23-year update.

Emily Clough Tanya Barrett Stephen E Wilhite Pierre Ledoux Carlos Evangelista

Nucleic Acids Res

January 2024

The Gene Expression Omnibus (GEO) is an international public repository that archives gene expression and epigenomics data sets generated by next-generation sequencing and microarray technologies. Data are typically submitted to GEO by researchers in compliance with widespread journal and funder mandates to make generated data publicly accessible. The resource handles raw data files, processed data files and descriptive metadata for over 200 000 studies and 6.

View Article and Find Full Text PDF

Future-proofing and maximizing the utility of metadata: The PHA4GE SARS-CoV-2 contextual data specification package.

Emma J Griffiths Ruth E Timme Catarina Inês Mendes Andrew J Page Nabil-Fareed Alikhan Tanya Barrett

Gigascience

February 2022

Background: The Public Health Alliance for Genomic Epidemiology (PHA4GE) (https://pha4ge.org) is a global coalition that is actively working to establish consensus standards, document and share best practices, improve the availability of critical bioinformatics tools and resources, and advocate for greater openness, interoperability, accessibility, and reproducibility in public health microbial bioinformatics. In the face of the current pandemic, PHA4GE has identified a need for a fit-for-purpose, open-source SARS-CoV-2 contextual data standard.

View Article and Find Full Text PDF

Cell Lines as Biological Models: Practical Steps for More Reliable Research.

Amanda Capes-Davis Amos Bairoch Tanya Barrett Edward C Burnett Wilhelm G Dirks

Chem Res Toxicol

September 2019

Research in toxicology relies on models such as cell lines. These living models are prone to change and may be described in publications with insufficient information or quality control testing. This article sets out recommendations to improve the reliability of cell-based research.

View Article and Find Full Text PDF

The Gene Expression Omnibus Database.

Emily Clough Tanya Barrett

Methods Mol Biol

December 2016

The Gene Expression Omnibus (GEO) database is an international public repository that archives and freely distributes high-throughput gene expression and other functional genomics data sets. Created in 2000 as a worldwide resource for gene expression studies, GEO has evolved with rapidly changing technologies and now accepts high-throughput data for many other data applications, including those that examine genome methylation, chromatin structure, and genome-protein interactions. GEO supports community-derived reporting standards that specify provision of several critical study elements including raw data, processed data, and descriptive metadata.

View Article and Find Full Text PDF

Toward richer metadata for microbial sequences: replacing strain-level NCBI taxonomy taxids with BioProject, BioSample and Assembly records.

Scott Federhen Karen Clark Tanya Barrett Helen Parkinson James Ostell

Stand Genomic Sci

June 2014

Microbial genome sequence submissions to the International Nucleotide Sequence Database Collaboration (INSDC) have been annotated with organism names that include the strain identifier. Each of these strain-level names has been assigned a unique 'taxid' in the NCBI Taxonomy Database. With the significant growth in genome sequencing, it is not possible to continue with the curation of strain-level taxids.

View Article and Find Full Text PDF

Standardized metadata for human pathogen/vector genomic sequences.

Vivien G Dugan Scott J Emrich Gloria I Giraldo-Calderón Omar S Harb Ruchi M Newman Tanya Barrett

PLoS One

February 2015

High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats.

View Article and Find Full Text PDF

Beware imposters: MA-1, a novel MALT lymphoma cell line, is misidentified and corresponds to Pfeiffer, a diffuse large B-cell lymphoma cell line.

Amanda Capes-Davis Christine Alston-Roberts Liz Kerrigan Yvonne A Reid Tanya Barrett

Genes Chromosomes Cancer

October 2013

View Article and Find Full Text PDF

NCBI GEO: archive for functional genomics data sets--update.

Tanya Barrett Stephen E Wilhite Pierre Ledoux Carlos Evangelista Irene F Kim

Nucleic Acids Res

January 2013

The Gene Expression Omnibus (GEO, http://www.ncbi.nlm.

View Article and Find Full Text PDF

Database resources of the National Center for Biotechnology Information.

Eric W Sayers Tanya Barrett Dennis A Benson Evan Bolton Stephen H Bryant

Nucleic Acids Res

January 2012

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Website. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Genome and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Probe, Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets.

View Article and Find Full Text PDF

BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata.

Tanya Barrett Karen Clark Robert Gevorgyan Vyacheslav Gorelenkov Eugene Gribov

Nucleic Acids Res

January 2012

As the volume and complexity of data sets archived at NCBI grow rapidly, so does the need to gather and organize the associated metadata. Although metadata has been collected for some archival databases, previously, there was no centralized approach at NCBI for collecting this information and using it across databases. The BioProject database was recently established to facilitate organization and classification of project data submitted to NCBI, EBI and DDBJ databases.

View Article and Find Full Text PDF

Strategies to explore functional genomics data sets in NCBI's GEO database.

Stephen E Wilhite Tanya Barrett

Methods Mol Biol

April 2012

The Gene Expression Omnibus (GEO) database is a major repository that stores high-throughput functional genomics data sets that are generated using both microarray-based and sequence-based technologies. Data sets are submitted to GEO primarily by researchers who are publishing their results in journals that require original data to be made freely available for review and analysis. In addition to serving as a public archive for these data, GEO has a suite of tools that allow users to identify, analyze, and visualize data relevant to their specific interests.

View Article and Find Full Text PDF

NCBI GEO: archive for functional genomics data sets--10 years on.

Tanya Barrett Dennis B Troup Stephen E Wilhite Pierre Ledoux Carlos Evangelista

Nucleic Acids Res

January 2011

A decade ago, the Gene Expression Omnibus (GEO) database was established at the National Center for Biotechnology Information (NCBI). The original objective of GEO was to serve as a public repository for high-throughput gene expression data generated mostly by microarray technology. However, the research community quickly applied microarrays to non-gene-expression studies, including examination of genome copy number variation and genome-wide profiling of DNA-binding proteins.

View Article and Find Full Text PDF

Database resources of the National Center for Biotechnology Information.

Eric W Sayers Tanya Barrett Dennis A Benson Evan Bolton Stephen H Bryant

Nucleic Acids Res

January 2011

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Electronic PCR, OrfFinder, Splign, ProSplign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), IBIS, Biosystems, Peptidome, OMSSA, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets.

View Article and Find Full Text PDF

NCBI Peptidome: a new repository for mass spectrometry proteomics data.

Li Ji Tanya Barrett Oluwabukunmi Ayanbule Dennis B Troup Dmitry Rudnev

Nucleic Acids Res

January 2010

Peptidome is a public repository that archives and freely distributes tandem mass spectrometry peptide and protein identification data generated by the scientific community. Data from all stages of a mass spectrometry experiment are captured, including original mass spectra files, experimental metadata and conclusion-level results. The submission process is facilitated through acceptance of data in commonly used open formats, and all submissions undergo syntactic validation and curation in an effort to uphold data integrity and quality.

View Article and Find Full Text PDF

Database resources of the National Center for Biotechnology Information.

Eric W Sayers Tanya Barrett Dennis A Benson Evan Bolton Stephen H Bryant

Nucleic Acids Res

January 2010

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, Reference Sequence, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Entrez Probe, GENSAT, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool, Biosystems, Peptidome, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized data sets.

View Article and Find Full Text PDF

NCBI Peptidome: a new public repository for mass spectrometry peptide identifications.

Douglas J Slotta Tanya Barrett Ron Edgar

Nat Biotechnol

July 2009

View Article and Find Full Text PDF

Database resources of the National Center for Biotechnology Information.

Eric W Sayers Tanya Barrett Dennis A Benson Stephen H Bryant Kathi Canese

Nucleic Acids Res

January 2009

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART) and the PubChem suite of small molecule databases. Augmenting many of the web applications is custom implementation of the BLAST program optimized to search specialized data sets.

View Article and Find Full Text PDF

NCBI GEO: archive for high-throughput functional genomic data.

Tanya Barrett Dennis B Troup Stephen E Wilhite Pierre Ledoux Dmitry Rudnev

Nucleic Acids Res

January 2009

The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data. Additionally, GEO hosts other categories of high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding. These data are generated by the research community using high-throughput technologies like microarrays and, more recently, next-generation sequencing.

View Article and Find Full Text PDF

Reannotation of array probes at NCBI's GEO database.

Tanya Barrett Ron Edgar

Nat Methods

February 2008

View Article and Find Full Text PDF

Database resources of the National Center for Biotechnology Information.

David L Wheeler Tanya Barrett Dennis A Benson Stephen H Bryant Kathi Canese

Nucleic Acids Res

January 2008

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data available through NCBI's web site. NCBI resources include Entrez, the Entrez Programming Utilities, My NCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link, Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genome, Genome Project and related tools, the Trace, Assembly, and Short Read Archives, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups, Influenza Viral Resources, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Entrez Probe, GENSAT, Database of Genotype and Phenotype, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool and the PubChem suite of small molecule databases. Augmenting the web applications are custom implementations of the BLAST program optimized to search specialized data sets.

View Article and Find Full Text PDF

Database resources of the National Center for Biotechnology Information.

David L Wheeler Tanya Barrett Dennis A Benson Stephen H Bryant Kathi Canese

Nucleic Acids Res

January 2007

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's Web site. NCBI resources include Entrez, the Entrez Programming Utilities, My NCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link(BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genome, Genome Project and related tools, the Trace and Assembly Archives, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Viral Genotyping Tools, Influenza Viral Resources, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART) and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets.

View Article and Find Full Text PDF

NCBI GEO standards and services for microarray data.

Ron Edgar Tanya Barrett

Nat Biotechnol

December 2006

View Article and Find Full Text PDF

NCBI GEO: mining tens of millions of expression profiles--database and tools update.

Tanya Barrett Dennis B Troup Stephen E Wilhite Pierre Ledoux Dmitry Rudnev

Nucleic Acids Res

January 2007

The Gene Expression Omnibus (GEO) repository at the National Center for Biotechnology Information (NCBI) archives and freely disseminates microarray and other forms of high-throughput data generated by the scientific community. The database has a minimum information about a microarray experiment (MIAME)-compliant infrastructure that captures fully annotated raw and processed data. Several data deposit options and formats are supported, including web forms, spreadsheets, XML and Simple Omnibus Format in Text (SOFT).

View Article and Find Full Text PDF

Gene expression omnibus: microarray data storage, submission, retrieval, and analysis.

Tanya Barrett Ron Edgar

Methods Enzymol

December 2006

The Gene Expression Omnibus (GEO) repository at the National Center for Biotechnology Information archives and freely distributes high-throughput molecular abundance data, predominantly gene expression data generated by DNA microarray technology. The database has a flexible design that can handle diverse styles of both unprocessed and processed data in a Minimum Information About a Microarray Experiment-supportive infrastructure that promotes fully annotated submissions. GEO currently stores about a billion individual gene expression measurements, derived from over 100 organisms, submitted by over 1500 laboratories, addressing a wide range of biological phenomena.

View Article and Find Full Text PDF

Mining microarray data at NCBI's Gene Expression Omnibus (GEO)*.

Tanya Barrett Ron Edgar

Methods Mol Biol

September 2006

The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) has emerged as the leading fully public repository for gene expression data. This chapter describes how to use Web-based interfaces, applications, and graphics to effectively explore, visualize, and interpret the hundreds of microarray studies and millions of gene expression patterns stored in GEO. Data can be examined from both experiment-centric and gene-centric perspectives using user-friendly tools that do not require specialized expertise in microarray analysis or time-consuming download of massive data sets.

View Article and Find Full Text PDF