Segmenting the Human Genome into Isochores.

Evol Bioinform Online

National Research Council, Institute for Biomedical Technologies, Segrate, Milan, Italy. ; Science Department, Rome 3 University, Rome, Italy.

Published: December 2015

The human genome is a mosaic of isochores, which are long (>200 kb) DNA sequences that are fairly homogeneous in base composition and can be assigned to five families comprising 33%-59% of GC composition. Although the compartmentalized organization of the mammalian genome has been investigated for more than 40 years, no satisfactory automatic procedure for segmenting the genome into isochores is available so far. We present a critical discussion of the currently available methods and a new approach called isoSegmenter which allows segmenting the genome into isochores in a fast and completely automatic manner. This approach relies on two types of experimentally defined parameters, the compositional boundaries of isochore families and an optimal window size of 100 kb. The approach represents an improvement over the existing methods, is ideally suited for investigating long-range features of sequenced and assembled genomes, and is publicly available at https://github.com/bunop/isoSegmenter.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4662427PMC
http://dx.doi.org/10.4137/EBO.S27693DOI Listing

Publication Analysis

Top Keywords

genome isochores
12
human genome
8
segmenting genome
8
genome
5
segmenting human
4
isochores
4
isochores human
4
genome mosaic
4
mosaic isochores
4
isochores long
4

Similar Publications

Background: Genomes are inherently inhomogeneous, with features such as base composition, recombination, gene density, and gene expression varying along chromosomes. Evolutionary, biological, and biomedical analyses aim to quantify this variation, account for it during inference procedures, and ultimately determine the causal processes behind it. Since sequential observations along chromosomes are not independent, it is unsurprising that autocorrelation patterns have been observed e.

View Article and Find Full Text PDF

Abandoning the Isochore Theory Can Help Explain Genome Compositional Organization in Fish.

Int J Mol Sci

August 2023

Department of Computer Science, Faculty of Science, University of South Bohemia, Branišovská 1760, 370-05 České Budějovice, Czech Republic.

The organization of the genome nucleotide (AT/GC) composition in vertebrates remains poorly understood despite the numerous genome assemblies available. Particularly, the origin of the AT/GC heterogeneity in amniotes, in comparison to the homogeneity in anamniotes, is controversial. Recently, several exceptions to this dichotomy were confirmed in an ancient fish lineage with mammalian AT/GC heterogeneity.

View Article and Find Full Text PDF

Background: The radiation of mammals at the extinction of the dinosaurs produced a plethora of new forms-as diverse as bats, dolphins, and elephants-in only 10-20 million years. Behind the scenes, adaptation to new niches is accompanied by extensive innovation in large families of genes that allow animals to contact the environment, including chemosensors, xenobiotic enzymes, and immune and barrier proteins. Genes in these "outward-looking" families are allelically diverse among humans and exhibit tissue-specific and sometimes stochastic expression.

View Article and Find Full Text PDF

Compositional Structure of the Genome: A Review.

Biology (Basel)

June 2023

Department of Genetics, Faculty of Sciences, 18071 and Laboratory of Bioinformatics, Institute of Biotechnology, Center of Biomedical Research, University of Granada, 18100 Granada, Spain.

As the genome carries the historical information of a species' biotic and environmental interactions, analyzing changes in genome structure over time by using powerful statistical physics methods (such as entropic segmentation algorithms, fluctuation analysis in DNA walks, or measures of compositional complexity) provides valuable insights into genome evolution. Nucleotide frequencies tend to vary along the DNA chain, resulting in a hierarchically patchy chromosome structure with heterogeneities at different length scales that range from a few nucleotides to tens of millions of them. Fluctuation analysis reveals that these compositional structures can be classified into three main categories: (1) short-range heterogeneities (below a few kilobase pairs (Kbp)) primarily attributed to the alternation of coding and noncoding regions, interspersed or tandem repeats densities, etc.

View Article and Find Full Text PDF

Thermodynamics of liquid water in terms of a non-standard approach-the ion-molecular model-is considered. Water is represented as a dense gas of neutral HO molecules and single charged HO and OH ions. The molecules and ions perform thermal collisional motion and interconvert due to ion exchange.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!