CHESS 3 represents an improved human gene catalog based on nearly 10,000 RNA-seq experiments across 54 body sites. It significantly improves current genome annotation by integrating the latest reference data and algorithms, machine learning techniques for noise filtering, and new protein structure prediction methods. CHESS 3 contains 41,356 genes, including 19,839 protein-coding genes and 158,377 transcripts, with 14,863 protein-coding transcripts not in other catalogs. It includes all MANE transcripts and at least one transcript for most RefSeq and GENCODE genes. On the CHM13 human genome, the CHESS 3 catalog contains an additional 129 protein-coding genes. CHESS 3 is available at http://ccb.jhu.edu/chess .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10614308PMC
http://dx.doi.org/10.1186/s13059-023-03088-4DOI Listing

Publication Analysis

Top Keywords

protein structure
8
protein-coding genes
8
chess
5
genes
5
chess improved
4
improved comprehensive
4
comprehensive catalog
4
catalog human
4
human genes
4
transcripts
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!