Large-scale clustering of CAGE tag expression data.

BMC Bioinformatics

Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, Japan.

Published: May 2007

Background: Recent analyses have suggested that many genes possess multiple transcription start sites (TSSs) that are differentially utilized in different tissues and cell lines. We have identified a huge number of TSSs mapped onto the mouse genome using the cap analysis of gene expression (CAGE) method. The standard hierarchical clustering algorithm, which gives us easily understandable graphical tree images, has difficulties in processing such huge amounts of TSS data and a better method to calculate and display the results is needed.

Results: We use a combination of hierarchical and non-hierarchical clustering to cluster expression profiles of TSSs based on a large amount of CAGE data to profit from the best of both methods. We processed the genome-wide expression data, including 159,075 TSSs derived from 127 RNA samples of various organs of mouse, and succeeded in categorizing them into 70-100 clusters. The clusters exhibited intriguing biological features: a cluster supergroup with a ubiquitous expression profile, tissue-specific patterns, a distinct distribution of non-coding RNA and functional TSS groups.

Conclusion: Our approach succeeded in greatly reducing the calculation cost, and is an appropriate solution for analyzing large-scale TSS usage data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1890301PMC
http://dx.doi.org/10.1186/1471-2105-8-161DOI Listing

Publication Analysis

Top Keywords

expression data
8
expression
5
data
5
large-scale clustering
4
clustering cage
4
cage tag
4
tag expression
4
data background
4
background analyses
4
analyses suggested
4

Similar Publications

External delay and dispersion correction of automatically sampled arterial blood with dual flow rates.

Biomed Phys Eng Express

January 2025

Brain Health Imaging Centre, Centre for Addiction and Mental Health, B68-250 College St, Toronto, Ontario, M5T 1R8, CANADA.

Objective: Arterial sampling for PET imaging often involves continuously measuring the radiotracer activity concentration in blood using an automatic blood sampling system (ABSS). We proposed and validated an external delay and dispersion correction procedure needed when a change in flow rate occurs during data acquisition. We also measured the external dispersion constant of [11C]CURB, [18F]FDG, [18F]FEPPA, and [18F]SynVesT-1.

View Article and Find Full Text PDF

Purpose: Fibroblast growth factor receptor 2 isoform IIIb (FGFR2b) protein overexpression is an emerging biomarker in gastric cancer and gastroesophageal junction cancer (GC). We assessed FGFR2b protein overexpression prevalence in nearly 3,800 tumor samples as part of the prescreening process for a global phase III study in patients with newly diagnosed advanced or metastatic GC.

Methods: As of June 28, 2024, 3,782 tumor samples from prescreened patients from 37 countries for the phase III FORTITUDE-101 trial (ClinicalTrials.

View Article and Find Full Text PDF

Purpose: To investigate whether hormone receptor-positive, human epidermal growth factor receptor 2-low (HR+HER2-low) versus HR+HER2-zero early breast cancers have distinct genomic and clinical characteristics.

Methods: This study included HR+, HER2-negative early breast cancers from patients enrolled in the phase III, randomized BIG 1-98 and SOFT clinical trials that had undergone tumor genomic sequencing. Tumors were classified HR+HER2-low if they had a centrally reviewed HER2 immunohistochemistry (IHC) score of 1+ or 2+ with negative in situ hybridization and HR+HER2-zero if they had an HER2 IHC score of 0.

View Article and Find Full Text PDF

Pollen germination and pollen tube (PT) growth are extremely sensitive to high temperatures. During heat stress (HS), global translation shuts down and favors the maintenance of the essential cellular proteome for cell viability and protection against protein misfolding. Here, we demonstrate that under normal conditions, the Arabidopsis (Arabidopsis thaliana) eukaryotic translation initiation factor subunit eif3m1/eif3m2 double mutant exhibits poor pollen germination, loss of PT integrity and an increased rate of aborted seeds.

View Article and Find Full Text PDF

Background: Transgender and gender diverse (TGD) people seek gender-affirming care at any age to manage gender identities or expressions that differ from their birth gender. Gender-affirming hormone treatment (GAHT) and gender-affirming surgery may alter reproductive function and/or anatomy, limiting future reproductive options to varying degrees, if individuals desire to either give birth or become a biological parent.

Objective And Rationale: TGD people increasingly pursue help for their reproductive questions, including fertility, fertility preservation, active desire for children, and future options.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!