Gene-Family Extension Measures and Correlations.

Life (Basel)

Department of Evolutionary and Environmental Biology, University of Haifa, Haifa 3498838, Israel.

Published: August 2016

The existence of multiple copies of genes is a well-known phenomenon. A gene family is a set of sufficiently similar genes, formed by gene duplication. In earlier works conducted on a limited number of completely sequenced and annotated genomes it was found that size of gene family and size of genome are positively correlated. Additionally, it was found that several atypical microbes deviated from the observed general trend. In this study, we reexamined these associations on a larger dataset consisting of 1484 prokaryotic genomes and using several ranking approaches. We applied ranking methods in such a way that genomes with lower numbers of gene copies would have lower rank. Until now only simple ranking methods were used; we applied the Kemeny optimal aggregation approach as well. Regression and correlation analysis were utilized in order to accurately quantify and characterize the relationships between measures of paralog indices and genome size. In addition, boxplot analysis was employed as a method for outlier detection. We found that, in general, all paralog indexes positively correlate with an increase of genome size. As expected, different groups of atypical prokaryotic genomes were found for different types of paralog quantities. Mycoplasmataceae and Halobacteria appeared to be among the most interesting candidates for further research of evolution through gene duplication.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5041006PMC
http://dx.doi.org/10.3390/life6030030DOI Listing

Publication Analysis

Top Keywords

gene family
8
gene duplication
8
prokaryotic genomes
8
ranking methods
8
genome size
8
gene
5
gene-family extension
4
extension measures
4
measures correlations
4
correlations existence
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!