Markov models of codon substitution are powerful inferential tools for studying biological processes such as natural selection and preferences in amino acid substitution. The equilibrium character distributions of these models are almost always estimated using nucleotide frequencies observed in a sequence alignment, primarily as a matter of historical convention. In this note, we demonstrate that a popular class of such estimators are biased, and that this bias has an adverse effect on goodness of fit and estimates of substitution rates. We propose a "corrected" empirical estimator that begins with observed nucleotide counts, but accounts for the nucleotide composition of stop codons. We show via simulation that the corrected estimates outperform the de facto standard estimates not just by providing better estimates of the frequencies themselves, but also by leading to improved estimation of other parameters in the evolutionary models. On a curated collection of sequence alignments, our estimators show a significant improvement in goodness of fit compared to the approach. Maximum likelihood estimation of the frequency parameters appears to be warranted in many cases, albeit at a greater computational cost. Our results demonstrate that there is little justification, either statistical or computational, for continued use of the -style estimators.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2912764PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0011230PLOS

Publication Analysis

Top Keywords

goodness fit
8
correcting bias
4
bias empirical
4
empirical frequency
4
frequency parameter
4
estimators
4
parameter estimators
4
estimators codon
4
models
4
codon models
4

Similar Publications

Electric heaters are widely used owing to their portability, fast heating, single-focus heating, and energy efficiency advantages. Manufacturers provide customers with information on the power consumption and energy efficiency classes of heaters but do not provide any information on heating patterns. Knowing the heating pattern enables users to select the correct heater, which has a significant effect on comfort, health, energy efficiency, industrial process performance, plant growth, and climate change.

View Article and Find Full Text PDF

The Geometric Series Hypothesis of Leaf Area Distribution and Its Link to the Calculation of the Total Leaf Area per Shoot of 'Aureostriatus'.

Plants (Basel)

December 2024

Co-Innovation Center for Sustainable Forestry in Southern China, Bamboo Research Institute, Nanjing Forestry University, #159 Longpan Road, Nanjing 210037, China.

Total leaf area per shoot () can reflect the photosynthetic capacity of a shoot. A prior study hypothesized that is proportional to the product of the sum of the individual leaf widths per shoot () and the maximum individual leaf length per shoot (), referred to as the Montgomery-Koyama-Smith equation (MKSE). However, empirical evidence does not support such a proportional relationship hypothesis, as was found to allometrically scale with , i.

View Article and Find Full Text PDF

Background: The 2019 American Heart Association/American Stroke Association (AHA/ASA) guidelines strongly advise using non-contrast CT (NCCT) of the head as a mandatory test for all patients with suspected acute ischemic stroke (AIS) due to CT's advantages of affordability and speed of imaging. Therefore, our objective was to combine patient clinical data with head CT signs to create a nomogram to predict poor outcomes in AIS patients.

Methods: A retrospective analysis was conducted on 161 patients with acute ischemic stroke who underwent mechanical thrombectomy at the Guangzhou Hospital of Integrated Traditional and Western Medicine from January 2019 to June 2023.

View Article and Find Full Text PDF

Psychosocial risks and mental health of preschool care providers in Kuala Lumpur, Malaysia: a cross-sectional study.

BMC Psychol

January 2025

Health Department of Kuala Lumpur and Putrajaya, Health office of Lembah Pantai District, Ministry of Health, Kuala Lumpur, Malaysia.

Background: Child maltreatment in daycare is a public health issue. As childcare is stressful, high care provider negativity independently predicts more internalizing behaviour problems, affecting children's psycho-neurological development. This study aimed to determine psychosocial factors associated with the mental health of preschool care providers in Kuala Lumpur.

View Article and Find Full Text PDF

Objectives: The high incidence of coronary artery heart disease (CHD) poses a significant burden and challenge to public health systems globally. Effective prevention and early diagnosis of CHD have become key strategies to alleviate this burden. This study aims to explore the application of advanced machine learning techniques to enhance the accuracy of early screening and risk assessment for CHD.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!