Strategies for identifying statistically significant dense regions in microarray data.

IEEE/ACM Trans Comput Biol Bioinform

Department of Mathematics, National University of Singapore, 2, Science Drive 2, Singapore 117543, Singapore.

Published: October 2007

We propose and study the notion of dense regions for the analysis of categorized gene expression data and present some searching algorithms for discovering them. The algorithms can be applied to any categorical data matrices derived from gene expression level matrices. We demonstrate that dense regions are simple but useful and statistically significant patterns that can be used to 1) identify genes and/or samples of interest and 2) eliminate genes and/or samples corresponding to outliers, noise, or abnormalities. Some theoretical studies on the properties of the dense regions are presented which allow us to characterize dense regions into several classes and to derive tailor-made algorithms for different classes of regions. Moreover, an empirical simulation study on the distribution of the size of dense regions is carried out which is then used to assess the significance of dense regions and to derive effective pruning methods to speed up the searching algorithms. Real microarray data sets are employed to test our methods. Comparisons with six other well-known clustering algorithms using synthetic and real data are also conducted which confirm the superiority of our methods in discovering dense regions. The DRIFT code and a tutorial are available as supplemental material, which can be found on the Computer Society Digital Library at http://computer.org/tcbb/archives.htm.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2007.1022DOI Listing

Publication Analysis

Top Keywords

dense regions
32
regions
9
dense
8
microarray data
8
gene expression
8
searching algorithms
8
genes and/or
8
and/or samples
8
data
5
algorithms
5

Similar Publications

2-Cyanoindene is one of the few specific aromatic or polycyclic aromatic hydrocarbon (PAH) molecules positively identified in Taurus molecular cloud-1 (TMC-1), a cold, dense molecular cloud that is considered the nearest star-forming region to Earth. We report cryogenic mid-infrared (550-3200 cm) and visible (16,500-20,000 cm, over the ← electronic transition) spectra of 2-cyanoindene radical cations (2CNI), measured using messenger tagging (He and Ne) photodissociation spectroscopy. The infrared spectra reveal the prominence of anharmonic couplings, particularly over the fingerprint region.

View Article and Find Full Text PDF

Invariant Spatial Pattern Across Mediterranean Scrublands in the Iberian Pear ().

Ecol Evol

January 2025

Centro de Investigaciones sobre Desertificación CIDE CSIC-UVEG-GV Valencia Spain.

The spatial distribution pattern of plant species is frequently driven by a combination of biotic and abiotic factors that jointly influence the arrival, establishment, and reproduction of plants. Comparing the spatial distribution of a target plant species in different populations represents a robust approach to identify the underlying mechanisms. We mapped all reproductive individuals of the Iberian pear () in five plots (1.

View Article and Find Full Text PDF

YHSeqY3000 panel captures all founding lineages in the Chinese paternal genomic diversity database.

BMC Biol

January 2025

Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China.

Background: The advancements in second-/third-generation sequencing technologies, alongside computational innovations, have significantly enhanced our understanding of the genomic structure of Y-chromosomes and their unique phylogenetic characteristics. These researches, despite the challenges posed by the lack of population-scale genomic databases, have the potential to revolutionize our approach to high-resolution, population-specific Y-chromosome panels and databases for anthropological and forensic applications.

Objectives: This study aimed to develop the highest-resolution Y-targeted sequencing panel, utilizing time-stamped, core phylogenetic informative mutations identified from high-coverage sequences in the YanHuang cohort.

View Article and Find Full Text PDF

Patients with estrogen receptor-positive (ER+), human epidermal growth factor receptor 2-negative (HER2-) primary breast cancer (BC) have low pathological complete response (pCR) rates with neoadjuvant chemotherapy. A subset of ER+/HER2- BC contains dense lymphocytic infiltration. We hypothesized that addition of an anti-programmed death 1 agent may increase pCR rates in this BC subtype.

View Article and Find Full Text PDF

Background: The medial malleolus is involved in up to 50 % of ankle fractures. When surgery is required, a thorough understanding of bone mass distribution within the distal tibia is crucial for selecting and positioning screws to ensure stable fixation. Despite its clinical significance, data on the bone mass distribution in the distal tibia remains limited.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!