Current studies of gene × air pollution interaction typically seek to identify unknown heritability of common complex illnesses arising from variability in the host's susceptibility to environmental pollutants of interest. Accordingly, a single component generalized linear models are often used to model the risk posed by an environmental exposure variable of interest in relation to a priori determined DNA variants. However, reducing the phenotypic heterogeneity may further optimize such approach, primarily represented by the modeled DNA variants. Here, we reduce phenotypic heterogeneity of asthma severity, and also identify single nucleotide polymorphisms (SNP) associated with phenotype subgroups. Specifically, we first apply an unsupervised learning algorithm method and a non-parametric regression to find a biclustering structure of children according to their allergy and asthma severity. We then identify a set of SNPs most closely correlated with each sub-group. We subsequently fit a logistic regression model for each group against the healthy controls using benzo[]pyrene (B[]P) as a representative airborne carcinogen. Application of such approach in a case-control data set shows that SNP clustering may help to partly explain heterogeneity in children's asthma susceptibility in relation to ambient B[]P concentration with greater efficiency.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5800205PMC
http://dx.doi.org/10.3390/ijerph15010106DOI Listing

Publication Analysis

Top Keywords

unsupervised learning
8
learning algorithm
8
dna variants
8
phenotypic heterogeneity
8
asthma severity
8
severity identify
8
modeling unobserved
4
heterogeneity
4
unobserved heterogeneity
4
heterogeneity susceptibility
4

Similar Publications

Prediction and unsupervised clustering of fertility intention among migrant workers based on machine learning: a cross-sectional survey from Henan, China.

BMC Public Health

January 2025

Department of Health Management of Public Health, College of Public Health, Zhengzhou University, 100 Kexue Road, Gaoxin district, Zhengzhou, Henan, 450001, China.

Background: Although China has implemented multiple policies to encourage childbirth, the results have been underwhelming. Migrant workers account for a considerable proportion of China's population, most of whom are of childbearing age. However, few articles focus on their fertility intentions.

View Article and Find Full Text PDF

Geometric morphometrics is used in the biological sciences to quantify morphological traits. However, the need for manual landmark placement hampers scalability, which is both time-consuming, labor-intensive, and open to human error. The selected landmarks embody a specific hypothesis regarding the critical geometry relevant to the biological question.

View Article and Find Full Text PDF

Multimodal hierarchical classification of CITE-seq data delineates immune cell states across lineages and tissues.

Cell Rep Methods

January 2025

Department of Systems Biology, Columbia University Irving Medical Center, New York, NY 10032, USA; Department of Biochemistry and Molecular Biophysics, Columbia University Irving Medical Center, New York, NY 10032, USA. Electronic address:

Single-cell RNA sequencing (scRNA-seq) is invaluable for profiling cellular heterogeneity and transcriptional states, but transcriptomic profiles do not always delineate subsets defined by surface proteins. Cellular indexing of transcriptomes and epitopes (CITE-seq) enables simultaneous profiling of single-cell transcriptomes and surface proteomes; however, accurate cell-type annotation requires a classifier that integrates multimodal data. Here, we describe multimodal classifier hierarchy (MMoCHi), a marker-based approach for accurate cell-type classification across multiple single-cell modalities that does not rely on reference atlases.

View Article and Find Full Text PDF

Dual-view global and local category-attentive domain alignment for unsupervised conditional adversarial domain adaptation.

Neural Netw

January 2025

School of Computer Engineering and Science, Shanghai University, Shanghai, 200444, China. Electronic address:

Conditional adversarial domain adaptation (CADA) is one of the most commonly used unsupervised domain adaptation (UDA) methods. CADA introduces multimodal information to the adversarial learning process to align the distributions of the labeled source domain and unlabeled target domain with mode match. However, CADA provides wrong multimodal information for challenging target features due to utilizing classifier predictions as the multimodal information, leading to distribution mismatch and less robust domain-invariant features.

View Article and Find Full Text PDF

GMmorph: dynamic spatial matching registration model for 3D medical image based on gated Mamba.

Phys Med Biol

January 2025

School of Software Engineering, Xi'an Jiaotong University, Xi 'an Jiaotong University Innovation Port, Xi 'an, Shaanxi Province, Xi'an, Shaanxi, 710049, CHINA.

Deformable registration aims to achieve nonlinear alignment of image space by estimating a dense displacement field. It is commonly used as a preprocessing step in clinical and image analysis applications, such as surgical planning, diagnostic assistance, and surgical navigation. We aim to overcome these challenges: Deep learning-based registration methods often struggle with complex displacements and lack effective interaction between global and local feature information.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!