GTest: a software tool for graphical assessment of empirical distributions' Gaussianity.

Environ Monit Assess

Water Research Institute, National Research Council, Viale De Blasio, 5-70125, Bari, Italy.

Published: March 2016

In the present paper, the novel software GTest is introduced, designed for testing the normality of a user-specified empirical distribution. It has been implemented with two unusual characteristics; the first is the user option of selecting four different versions of the normality test, each of them suited to be applied to a specific dataset or goal, and the second is the inferential paradigm that informs the output of such tests: it is basically graphical and intrinsically self-explanatory. The concept of inference-by-eye is an emerging inferential approach which will find a successful application in the near future due to the growing need of widening the audience of users of statistical methods to people with informal statistical skills. For instance, the latest European regulation concerning environmental issues introduced strict protocols for data handling (data quality assurance, outliers detection, etc.) and information exchange (areal statistics, trend detection, etc.) between regional and central environmental agencies. Therefore, more and more frequently, laboratory and field technicians will be requested to utilize complex software applications for subjecting data coming from monitoring, surveying or laboratory activities to specific statistical analyses. Unfortunately, inferential statistics, which actually influence the decisional processes for the correct managing of environmental resources, are often implemented in a way which expresses its outcomes in a numerical form with brief comments in a strict statistical jargon (degrees of freedom, level of significance, accepted/rejected H0, etc.). Therefore, often, the interpretation of such outcomes is really difficult for people with poor statistical knowledge. In such framework, the paradigm of the visual inference can contribute to fill in such gap, providing outcomes in self-explanatory graphical forms with a brief comment in the common language. Actually, the difficulties experienced by colleagues and their request for an effective tool for addressing such difficulties motivated us in adopting the inference-by-eye paradigm and implementing an easy-to-use, quick and reliable statistical tool. GTest visualizes its outcomes as a modified version of the Q-Q plot. The application has been developed in Visual Basic for Applications (VBA) within MS Excel 2010, which demonstrated to have all the characteristics of robustness and reliability needed. GTest provides true graphical normality tests which are as reliable as any statistical quantitative approach but much easier to understand. The Q-Q plots have been integrated with the outlining of an acceptance region around the representation of the theoretical distribution, defined in accordance with the alpha level of significance and the data sample size. The test decision rule is the following: if the empirical scatterplot falls completely within the acceptance region, then it can be concluded that the empirical distribution fits the theoretical one at the given alpha level. A comprehensive case study has been carried out with simulated and real-world data in order to check the robustness and reliability of the software.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s10661-016-5138-1DOI Listing

Publication Analysis

Top Keywords

empirical distribution
8
level significance
8
reliable statistical
8
robustness reliability
8
acceptance region
8
alpha level
8
statistical
7
data
5
gtest
4
gtest software
4

Similar Publications

Purpose: The emergence of unique and destructive viruses, such as COVID-19, has claimed lives, disrupted health systems and diverted resources from addressing the needs of male HIV/AIDS patients in the context of antiretroviral therapy and other HIV/AIDS-related issues. This study aims to assess male HIV/AIDS patients' satisfaction with antiretroviral therapy and its implications for sustainable development in Sub-Saharan Africa.

Design/methodology/approach: Satisfaction, word-of-mouth, trust and revisit intention were the variables in the research model.

View Article and Find Full Text PDF

Background: Staphylococcus saprophyticus is the second most common bacteria causing uncomplicated urinary tract infections (UTI). It is considered non-susceptible to mecillinam, with no defined breakpoint and only few available minimal inhibitory concentration (MIC) observations. However, this consideration does not correlate with clinical outcome.

View Article and Find Full Text PDF

Detecting low birth weight is crucial for early identification of at-risk pregnancies which are associated with significant neonatal and maternal morbidity and mortality risks. This study presents an efficient and interpretable framework for unsupervised detection of low, very low, and extreme birth weights. While traditional approaches to managing class imbalance require labeled data, our study explores the use of unsupervised learning to detect anomalies indicative of low birth weight scenarios.

View Article and Find Full Text PDF

The demographic history of a population, and the distribution of fitness effects (DFE) of newly arising mutations in functional genomic regions, are fundamental factors dictating both genetic variation and evolutionary trajectories. Although both demographic and DFE inference has been performed extensively in humans, these approaches have generally either been limited to simple demographic models involving a single population, or, where a complex population history has been inferred, without accounting for the potentially confounding effects of selection at linked sites. Taking advantage of the coding-sparse nature of the genome, we propose a 2-step approach in which coalescent simulations are first used to infer a complex multi-population demographic model, utilizing large non-functional regions that are likely free from the effects of background selection.

View Article and Find Full Text PDF

Objectives: The SF-12 version 2 is a survey instrument for collecting data on subjective health. The US-based scoring method is the recommended standard for measuring subjective health with data collected with this instrument. The inadequacy of the US-based scoring method of the SF-12 version 2 instrument for non-US populations is widely documented.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!