TranscriptDB: a transcript-centric database to study eukaryotic transcript conservation and evolution.

Nucleic Acids Res

Department of Computer Science, Faculté des sciences, Université de Sherbrooke, 2500 Boulevard de l'Université, Sherbrooke, QC J1K 2R1, Canada.

Published: January 2025

Eukaryotic genes can encode multiple distinct transcripts through the alternative splicing (AS) of genes. Interest in the AS mechanism and its evolution across different species has stimulated numerous studies, leading to several databases that provide information on AS and transcriptome data across multiple eukaryotic species. However, existing resources do not offer information on transcript conservation and evolution between genes of multiple species. Similarly to genes, identifying conserved transcripts-those from homologous genes that have retained a similar exon composition-is useful for determining transcript homology relationships, studying transcript functions and reconstructing transcript phylogenies. To address this gap, we have developed TranscriptDB, a database dedicated to studying the conservation and evolution of transcripts within gene families. TranscriptDB offers an extensive catalog of conserved transcripts and phylogenies for 317 annotated eukaryotic species, sourced from Ensembl database version 111. It serves multiple purposes, including the exploration of gene and transcript evolution. Users can access TranscriptDB through various browsing and querying tools, including a user-friendly web interface. The incorporated web servers enable users to retrieve information on transcript evolution using their own data as input. Additionally, a REST application programming interface is available for programmatic data retrieval. A data directory is also available for bulk downloads. TranscriptDB and its resources are freely accessible at https://transcriptdb.cobius.usherbrooke.ca.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11701637PMC
http://dx.doi.org/10.1093/nar/gkae995DOI Listing

Publication Analysis

Top Keywords

conservation evolution
12
transcript conservation
8
eukaryotic species
8
transcript evolution
8
transcript
7
evolution
6
transcriptdb
5
genes
5
transcriptdb transcript-centric
4
transcript-centric database
4

Similar Publications

Background: Drought stress is a significant global challenge that negatively impacts cotton fiber yield and quality. Although many drought-stress responsive genes have been identified in cotton species (Gossypium spp.), the diversity of drought response mechanisms across cotton species remains largely unexplored.

View Article and Find Full Text PDF

Complete datasets of genetic variants are key to biodiversity genomic studies. Long-read sequencing technologies allow the routine assembly of highly contiguous, haplotype-resolved reference genomes. However, even when complete, reference genomes from a single individual may bias downstream analyses and fail to adequately represent genetic diversity within a population or species.

View Article and Find Full Text PDF

Freshwater ecosystems are highly biodiverse and important for livelihoods and economic development, but are under substantial stress. To date, comprehensive global assessments of extinction risk have not included any speciose groups primarily living in freshwaters. Consequently, data from predominantly terrestrial tetrapods are used to guide environmental policy and conservation prioritization, whereas recent proposals for target setting in freshwaters use abiotic factors.

View Article and Find Full Text PDF

Satellite DNA shapes dictate pericentromere packaging in female meiosis.

Nature

January 2025

Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

The abundance and sequence of satellite DNA at and around centromeres is evolving rapidly despite the highly conserved and essential process through which the centromere directs chromosome inheritance. The impact of such rapid evolution is unclear. Here we find that sequence-dependent DNA shape dictates packaging of pericentromeric satellites in female meiosis through a conserved DNA-shape-recognizing chromatin architectural protein, high mobility group AT-hook 1 (HMGA1).

View Article and Find Full Text PDF

Breast cancer is a highly heterogeneous disease whose prognosis and treatment as defined by the expression of three receptors-oestrogen receptor (ER), progesterone receptor and human epidermal growth factor receptor 2 (HER2; encoded by ERBB2)-is insufficient to capture the full spectrum of clinical outcomes and therapeutic vulnerabilities. Previously, we demonstrated that transcriptional and genomic profiles define eleven integrative subtypes with distinct clinical outcomes, including four ER subtypes with increased risk of relapse decades after diagnosis. Here, to determine whether these subtypes reflect distinct evolutionary histories, interactions with the immune system and pathway dependencies, we established a meta-cohort of 1,828 breast tumours spanning pre-invasive, primary invasive and metastatic disease with whole-genome and transcriptome sequencing.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!