Publications by T Goldfarb

Publications by authors named "T Goldfarb"

Page 1 of 6

NCBI RefSeq: reference sequence standards through 25 years of curation and annotation.

Tamara Goldfarb Vamsi K Kodali Shashikant Pujar Vyacheslav Brover Barbara Robbertse

Nucleic Acids Res

January 2025

Reference sequences and annotations serve as the foundation for many lines of research today, from organism and sequence identification to providing a core description of the genes, transcripts and proteins found in an organism's genome. Interpretation of data including transcriptomics, proteomics, sequence variation and comparative analyses based on reference gene annotations informs our understanding of gene function and possible disease mechanisms, leading to new biomedical discoveries. The Reference Sequence (RefSeq) resource created at the National Center for Biotechnology Information (NCBI) leverages both automatic processes and expert curation to create a robust set of reference sequences of genomic, transcript and protein data spanning the tree of life.

View Article and Find Full Text PDF

A joint NCBI and EMBL-EBI transcript set for clinical genomics and research.

Joannella Morales Shashikant Pujar Jane E Loveland Alex Astashyn Ruth Bennett Tamara Goldfarb

Nature

April 2022

Comprehensive genome annotation is essential to understand the impact of clinically relevant variants. However, the absence of a standard for clinical reporting and browser display complicates the process of consistent interpretation and reporting. To address these challenges, Ensembl/GENCODE and RefSeq launched a joint initiative, the Matched Annotation from NCBI and EMBL-EBI (MANE) collaboration, to converge on human gene and transcript annotation and to jointly define a high-value set of transcripts and corresponding proteins.

View Article and Find Full Text PDF

RefSeq Functional Elements as experimentally assayed nongenic reference standards and functional interactions in human and mouse.

Catherine M Farrell Tamara Goldfarb Sanjida H Rangwala Alexander Astashyn Olga D Ermolaeva

Genome Res

January 2022

Eukaryotic genomes contain many nongenic elements that function in gene regulation, chromosome organization, recombination, repair, or replication, and mutation of those elements can affect genome function and cause disease. Although numerous epigenomic studies provide high coverage of gene regulatory regions, those data are not usually exposed in traditional genome annotation and can be difficult to access and interpret without field-specific expertise. The National Center for Biotechnology Information (NCBI) therefore provides RefSeq Functional Elements (RefSeqFEs), which represent experimentally validated human and mouse nongenic elements derived from the literature.

View Article and Find Full Text PDF

Estimating the Attributable Cost of Physician Burnout in the United States.

Donald E Girard David A Nardone David H Hickam Timothy Goldfarb

Ann Intern Med

October 2019

View Article and Find Full Text PDF

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

Shashikant Pujar Nuala A O'Leary Catherine M Farrell Jane E Loveland Jonathan M Mudge Tamara Goldfarb

Nucleic Acids Res

January 2018

The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID).

View Article and Find Full Text PDF