Birth and death of protein domains: a simple model of evolution explains power law behavior.

BMC Evol Biol

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.

Published: October 2002

Background: Power distributions appear in numerous biological, physical and other contexts, which appear to be fundamentally different. In biology, power laws have been claimed to describe the distributions of the connections of enzymes and metabolites in metabolic networks, the number of interactions partners of a given protein, the number of members in paralogous families, and other quantities. In network analysis, power laws imply evolution of the network with preferential attachment, i.e. a greater likelihood of nodes being added to pre-existing hubs. Exploration of different types of evolutionary models in an attempt to determine which of them lead to power law distributions has the potential of revealing non-trivial aspects of genome evolution.

Results: A simple model of evolution of the domain composition of proteomes was developed, with the following elementary processes: i) domain birth (duplication with divergence), ii) death (inactivation and/or deletion), and iii) innovation (emergence from non-coding or non-globular sequences or acquisition via horizontal gene transfer). This formalism can be described as a birth, death and innovation model (BDIM). The formulas for equilibrium frequencies of domain families of different size and the total number of families at equilibrium are derived for a general BDIM. All asymptotics of equilibrium frequencies of domain families possible for the given type of models are found and their appearance depending on model parameters is investigated. It is proved that the power law asymptotics appears if, and only if, the model is balanced, i.e. domain duplication and deletion rates are asymptotically equal up to the second order. It is further proved that any power asymptotic with the degree not equal to -1 can appear only if the hypothesis of independence of the duplication/deletion rates on the size of a domain family is rejected. Specific cases of BDIMs, namely simple, linear, polynomial and rational models, are considered in details and the distributions of the equilibrium frequencies of domain families of different size are determined for each case. We apply the BDIM formalism to the analysis of the domain family size distributions in prokaryotic and eukaryotic proteomes and show an excellent fit between these empirical data and a particular form of the model, the second-order balanced linear BDIM. Calculation of the parameters of these models suggests surprisingly high innovation rates, comparable to the total domain birth (duplication) and elimination rates, particularly for prokaryotic genomes.

Conclusions: We show that a straightforward model of genome evolution, which does not explicitly include selection, is sufficient to explain the observed distributions of domain family sizes, in which power laws appear as asymptotic. However, for the model to be compatible with the data, there has to be a precise balance between domain birth, death and innovation rates, and this is likely to be maintained by selection. The developed approach is oriented at a mathematical description of evolution of domain composition of proteomes, but a simple reformulation could be applied to models of other evolving networks with preferential attachment.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC137606PMC
http://dx.doi.org/10.1186/1471-2148-2-18DOI Listing

Publication Analysis

Top Keywords

birth death
12
power law
12
power laws
12
domain
12
domain birth
12
equilibrium frequencies
12
frequencies domain
12
domain families
12
domain family
12
model
8

Similar Publications

Background: Over the past few decades, China has experienced significant demographic and epidemiological changes. The sharp decline in fertility and mortality rates has accelerated population aging, contributing to an increase in the prevalence of chronic diseases. The nutritional condition during early life is associated with the onset of chronic illnesses later in adulthood.

View Article and Find Full Text PDF

Background: Tuberculosis (TB) is one of the oldest infectious diseases and continues to be a major killer of human beings. This paper was designed to provide insights into the disease burden of TB.

Methods: The data was retrieved and downloaded from the latest GBD database.

View Article and Find Full Text PDF

Introduction: Malnutrition contributes to approximately 45% of deaths among under 5 years children in low and middle-income countries. Poor maternal knowledge and failure to comply with recommended Infant and Young Child Feeding (IYCF) practices are known risk factors for malnutrition but there are inconsistencies in the literature. Therefore, this cross-sectional study of 100 mother-child pairs in district Gujranwala aimed to assess maternal nutritional literacy (MNL) and maternal feeding practices (MFP) and their ultimate impacts on child growth.

View Article and Find Full Text PDF

Background: The centralization of childbirth and newborn care in large maternity units has become increasingly prevalent in Europe. While this trend offers potential benefits such as specialized care and improved outcomes, it can also lead to longer travel and waiting times, especially for women in rural areas.

Objective: This study aimed to evaluate the association between hospital maternity unit (HMU) volumes, road travel distance (RTD) to the hospital, and other neonatal outcomes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!