Background: Data on the number of Open Reading Frames (ORFs) coded by genomes from the 3 domains of Life show the presence of some notable general features. These include essential differences between the Prokaryotes and Eukaryotes, with the number of ORFs growing linearly with total genome size for the former, but only logarithmically for the latter.
Results: Simply by assuming that the (protein) coding and non-coding fractions of the genome must have different dynamics and that the non-coding fraction must be particularly versatile and therefore be controlled by a variety of (unspecified) probability distribution functions (pdf's), we are able to predict that the number of ORFs for Eukaryotes follows a Benford distribution and must therefore have a specific logarithmic form.
Phys Rev C Nucl Phys
September 1996
Phys Rev C Nucl Phys
December 1994
Phys Rev C Nucl Phys
March 1994
Phys Rev C Nucl Phys
December 1992
Phys Rev C Nucl Phys
November 1990
Phys Rev C Nucl Phys
October 1990
Phys Rev C Nucl Phys
June 1990