Publications by Bhanu Rajput

Publications by authors named "Bhanu Rajput"

Page 1 of 1

A joint NCBI and EMBL-EBI transcript set for clinical genomics and research.

Joannella Morales Shashikant Pujar Jane E Loveland Alex Astashyn Ruth Bennett Bhanu Rajput

Nature

April 2022

Comprehensive genome annotation is essential to understand the impact of clinically relevant variants. However, the absence of a standard for clinical reporting and browser display complicates the process of consistent interpretation and reporting. To address these challenges, Ensembl/GENCODE and RefSeq launched a joint initiative, the Matched Annotation from NCBI and EMBL-EBI (MANE) collaboration, to converge on human gene and transcript annotation and to jointly define a high-value set of transcripts and corresponding proteins.

View Article and Find Full Text PDF

RefSeq curation and annotation of stop codon recoding in vertebrates.

Bhanu Rajput Kim D Pruitt Terence D Murphy

Nucleic Acids Res

January 2019

Recoding of stop codons as amino acid-specifying codons is a co-translational event that enables C-terminal extension of a protein. Synthesis of selenoproteins requires recoding of internal UGA stop codons to the 21st non-standard amino acid selenocysteine (Sec) and plays a vital role in human health and disease. Separately, canonical stop codons can be recoded to specify standard amino acids in a process known as stop codon readthrough (SCR), producing extended protein isoforms with potential novel functions.

View Article and Find Full Text PDF

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

Shashikant Pujar Nuala A O'Leary Catherine M Farrell Jane E Loveland Jonathan M Mudge Bhanu Rajput

Nucleic Acids Res

January 2018

The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID).

View Article and Find Full Text PDF

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.

Nuala A O'Leary Mathew W Wright J Rodney Brister Stacy Ciufo Diana Haddad Bhanu Rajput

Nucleic Acids Res

January 2016

The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.

View Article and Find Full Text PDF

Mouse genome annotation by the RefSeq project.

Kelly M McGarvey Tamara Goldfarb Eric Cox Catherine M Farrell Tripti Gupta Bhanu Rajput

Mamm Genome

October 2015

Complete and accurate annotation of the mouse genome is critical to the advancement of research conducted on this important model organism. The National Center for Biotechnology Information (NCBI) develops and maintains many useful resources to assist the mouse research community. In particular, the reference sequence (RefSeq) database provides high-quality annotation of multiple mouse genome assemblies using a combinatorial approach that leverages computation, manual curation, and collaboration.

View Article and Find Full Text PDF

RefSeq curation and annotation of antizyme and antizyme inhibitor genes in vertebrates.

Bhanu Rajput Terence D Murphy Kim D Pruitt

Nucleic Acids Res

September 2015

Polyamines are ubiquitous cations that are involved in regulating fundamental cellular processes such as cell growth and proliferation; hence, their intracellular concentration is tightly regulated. Antizyme and antizyme inhibitor have a central role in maintaining cellular polyamine levels. Antizyme is unique in that it is expressed via a novel programmed ribosomal frameshifting mechanism.

View Article and Find Full Text PDF

RefSeq: an update on mammalian reference sequences.

Kim D Pruitt Garth R Brown Susan M Hiatt Françoise Thibaud-Nissen Alexander Astashyn Bhanu Rajput

Nucleic Acids Res

January 2014

The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of annotated genomic, transcript and protein sequence records derived from data in public sequence archives and from computation, curation and collaboration (http://www.ncbi.nlm.

View Article and Find Full Text PDF

Current status and new features of the Consensus Coding Sequence database.

Catherine M Farrell Nuala A O'Leary Rachel A Harte Jane E Loveland Laurens G Wilming Bhanu Rajput

Nucleic Acids Res

January 2014

Article Synopsis

- The Consensus Coding Sequence (CCDS) project is a collaboration between NCBI, Ensembl, and other institutions to maintain high-quality, consistently annotated datasets of protein-coding regions in human and mouse genomes, identifiable by stable CCDS IDs.
- The project undergoes continuous review to ensure accuracy and has recently updated its web and FTP sites with clearer reporting on annotation releases, improved search and display functionalities, and additional biological information.
- The document highlights the current status of the CCDS dataset, recent expansions, and plans for future curation priorities to enhance the dataset's reliability and usefulness.

View Article and Find Full Text PDF

The completion of the Mammalian Gene Collection (MGC).

Genome Res

December 2009

Since its start, the Mammalian Gene Collection (MGC) has sought to provide at least one full-protein-coding sequence cDNA clone for every human and mouse gene with a RefSeq transcript, and at least 6200 rat genes. The MGC cloning effort initially relied on random expressed sequence tag screening of cDNA libraries. Here, we summarize our recent progress using directed RT-PCR cloning and DNA synthesis.

View Article and Find Full Text PDF

The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes.

Kim D Pruitt Jennifer Harrow Rachel A Harte Craig Wallin Mark Diekhans Bhanu Rajput

Genome Res

July 2009

Effective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on the NCBI, Ensembl, and UCSC Genome Browsers.

View Article and Find Full Text PDF