Publications by authors named "Mike P Kay"

The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID).

View Article and Find Full Text PDF
Article Synopsis
  • - The Consensus Coding Sequence (CCDS) project is a collaboration between NCBI, Ensembl, and other institutions to maintain high-quality, consistently annotated datasets of protein-coding regions in human and mouse genomes, identifiable by stable CCDS IDs.
  • - The project undergoes continuous review to ensure accuracy and has recently updated its web and FTP sites with clearer reporting on annotation releases, improved search and display functionalities, and additional biological information.
  • - The document highlights the current status of the CCDS dataset, recent expansions, and plans for future curation priorities to enhance the dataset's reliability and usefulness.
View Article and Find Full Text PDF

The African trypanosome, Trypanosoma brucei, causes sleeping sickness in humans in sub-Saharan Africa. Here we report the sequence and analysis of the 1.1 Mb chromosome I, which encodes approximately 400 predicted genes organised into directional clusters, of which more than 100 are located in the largest cluster of 250 kb.

View Article and Find Full Text PDF