Implications of human genome structural heterogeneity: functionally related genes tend to reside in organizationally similar genomic regions.

BMC Genomics

Department of Evolutionary and Environmental Biology and Institute of Evolution, University of Haifa, Mount Carmel, Haifa 31905, Israel.

Published: March 2014

Background: In an earlier study, we hypothesized that genomic segments with different sequence organization patterns (OPs) might display functional specificity despite their similar GC content. Here we tested this hypothesis by dividing the human genome into 100 kb segments, classifying these segments into five compositional groups according to GC content, and then characterizing each segment within the five groups by oligonucleotide counting (k-mer analysis; also referred to as compositional spectrum analysis, or CSA), to examine the distribution of sequence OPs in the segments. We performed the CSA on the entire DNA, i.e., its coding and non-coding parts the latter being much more abundant in the genome than the former.

Results: We identified 38 OP-type clusters of segments that differ in their compositional spectrum (CS) organization. Many of the segments that shared the same OP type were enriched with genes related to the same biological processes (developmental, signaling, etc.), components of biochemical complexes, or organelles. Thirteen OP-type clusters showed significant enrichment in genes connected to specific gene-ontology terms. Some of these clusters seemed to reflect certain events during periods of horizontal gene transfer and genome expansion, and subsequent evolution of genomic regions requiring coordinated regulation.

Conclusions: There may be a tendency for genes that are involved in the same biological process, complex or organelle to use the same OP, even at a distance of ~ 100 kb from the genes. Although the intergenic DNA is non-coding, the general pattern of sequence organization (e.g., reflected in over-represented oligonucleotide "words") may be important and were protected, to some extent, in the course of evolution.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234528PMC
http://dx.doi.org/10.1186/1471-2164-15-252DOI Listing

Publication Analysis

Top Keywords

human genome
8
genomic regions
8
sequence organization
8
compositional spectrum
8
op-type clusters
8
segments
6
genes
5
implications human
4
genome
4
genome structural
4

Similar Publications

Molecular Epidemiology of Type F Among Diarrheal Patients and Virulence-Resistance Dynamics - 11 Provinces, China, 2024.

China CDC Wkly

January 2025

Department of Clinical Laboratory, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou City, Zhejiang Province, China.

Introduction: Type F () represents a significant pathogen in human gastrointestinal diseases, primarily through its gene encoding enterotoxin (CPE). This investigation examined the prevalence, antimicrobial resistance patterns, and genetic characteristics of Type F within the Chinese population.

Methods: The study analyzed 2,068 stool samples collected from 11 provincial hospitals in 2024.

View Article and Find Full Text PDF

Disentangling protein metabolic costs in human cells and tissues.

PNAS Nexus

January 2025

Logic of Genomic Systems Laboratory (CNB-CSIC), Madrid E-28049, Spain.

While more data are becoming available on gene activity at different levels of biological organization, our understanding of the underlying biology remains incomplete. Here, we introduce a metabolic efficiency framework that considers highly expressed proteins (HEPs), their length, and biosynthetic costs in terms of the amino acids (AAs) they contain to address the observed balance of expression costs in cells, tissues, and cancer transformation. Notably, the combined set of HEPs in either cells or tissues shows an abundance of large and costly proteins, yet tissues compensate this with short HEPs comprised of economical AAs, indicating a stronger tendency toward mitigating costs.

View Article and Find Full Text PDF

Targeted barcoding of variable antibody domains and individual transcriptomes of the human B-cell repertoire using Link-Seq.

PNAS Nexus

January 2025

Institute of Bioengineering, School of Engineering, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland.

Here, we present Link-Seq, a highly efficient droplet microfluidic method for combined sequencing of antibody-encoding genes and the transcriptome of individual B cells at large scale. The method is based on 3' barcoding of the transcriptome and subsequent single-molecule PCR in droplets, which freely shift the barcode along specific gene regions, such as the antibody heavy- and light-chain genes. Using the immune repertoire of COVID-19 patients and healthy donors as a model system, we obtain up to 91.

View Article and Find Full Text PDF

In cutaneous melanoma, epigenetic dysregulation is implicated in drug resistance and tumor immune escape. However, the epigenetic mechanisms that influence immune escape remain poorly understood. To elucidate how epigenetic dysregulation alters the expression of surface proteins that may be involved in drug targeting and immune escape, we performed a 3-dimensional surfaceome screen in primary melanoma cultures and identified the DNA-methyltransferase inhibitor decitabine as significantly upregulating the costimulatory molecule ICAM-1.

View Article and Find Full Text PDF

Descriptions of one hundred new species of Hesperiidae.

Insecta mundi

December 2023

Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX, 75390-9050 USA.

A century and a half since the time of Hewitson, we are experiencing a renaissance in species discovery fueled by whole genome sequencing. A large-scale genomic analysis of Hesperiidae Latreille, 1809 (Lepidoptera), including primary type specimens, reveals a deluge of species new to science. One hundred of them (one in a new genus) are described here from the New World (type localities are given in parenthesis): () Grishin, (Ecuador: Napo), Grishin, (Ecuador: Esmeraldas), Grishin, (Panama: Panama), () Grishin, (Ecuador: Zamora-Chinchipe), () Grishin, (Panama: Darien), () Grishin, (Ecuador: Napo), () Grishin, (Panama: Panama), () Grishin, (Ecuador: Pastaza), Grishin, (Panama: Colón), Grishin, (Ecuador: Napo), Grishin, (Ecuador: Esmeraldas), Grishin, (Guyana), Grishin, (Peru: Cuzco), Grishin, (Peru: Cuzco), Grishin, (Panama: Panama), Grishin, (Brazil: Mato Grosso), () Grishin, (Brazil: Rondônia), () Grishin, (Panama: Panama), () Grishin, (Ecuador: Napo), Grishin, (Brazil: Rondônia), Grishin, (Peru: Loreto), Grishin, (Suriname: Para), Grishin, (Brazil: Santa Catarina), Grishin, (Panama: Panama), Grishin, (Brazil: Santa Catarina), Grishin, (Ecuador: Orellana), Grishin, (Venezuela: Aragua), Grishin, (Guatemala), Grishin, (Guyana), () Grishin, (Panama: Darien), () Grishin, (Mexico: Guerrero.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!