Univariate statistical analysis of environmental (compositional) data: problems and possibilities.

Sci Total Environ

Institute of Statistics and Probability Theory, Vienna University of Technology, Wiedner Hauptstrasse 8-10, Vienna, Austria.

Published: November 2009

For almost 30 years it has been known that compositional (closed) data have special geometrical properties. In environmental sciences, where the concentration of chemical elements in different sample materials is investigated, almost all datasets are compositional. In general, compositional data are parts of a whole which only give relative information. Data that sum up to a constant, e.g. 100 wt.%, 1,000,000 mg/kg are the best known example. It is widely neglected that the "closure" characteristic remains even if only one of all possible elements is measured, it is an inherent property of compositional data. No variable is free to vary independent of all the others. Existing transformations to "open" closed data are seldom applied. They are more complicated than a log transformation and the relationship to the original data unit is lost. Results obtained when using classical statistical techniques for data analysis appeared reasonable and the possible consequences of working with closed data were rarely questioned. Here the simple univariate case of data analysis is investigated. It can be demonstrated that data closure must be overcome prior to calculating even simple statistical measures like mean or standard deviation or plotting graphs of the data distribution, e.g. a histogram. Some measures like the standard deviation (or the variance) make no statistical sense with closed data and all statistical tests building on the standard deviation (or variance) will thus provide erroneous results if used with the original data.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.scitotenv.2009.08.008DOI Listing

Publication Analysis

Top Keywords

closed data
16
data
14
compositional data
12
standard deviation
12
original data
8
data analysis
8
measures standard
8
deviation variance
8
compositional
5
univariate statistical
4

Similar Publications

Revealing the status of forests is important for sustainable forest management. The basis of the concept lies in meeting the needs of future generations and today's generations in the management of forests. The use of remote-sensing (RS) technologies and geographic information systems (GIS) techniques in revealing the current forest structure and in long-term planning of forest areas with multipurpose planning techniques is increasing day by day.

View Article and Find Full Text PDF

Study Objective: Physician experiences with new care models like the virtual observation unit in emergency departments (EDs) can offer important insights. Virtual observation unit leverages telehealth, remote monitoring, and mobile integrated health to enable home-based ED-level care. We explored physicians' experience with delivering care in the virtual observation unit and perceived effect of this new model.

View Article and Find Full Text PDF

Introduction: Telemedicine is a health service that provides diagnosis, treatment evaluation, preventive medicine by using information and communication technologies between distant locations and aims to improve the health of the individuals and society. Social restrictions were applied during the pandemic process caused by coronavirus disease-2019 due to the virus called severe acute respiratory syndrome coronavirus-2 which emerged in late 2019. Through remote communication and information technologies in the followup of asthma patients, there is a need for studies on the effectiveness of using telemedicine methods was seen.

View Article and Find Full Text PDF

This paper presents a multiscale computational model, 'micro-to-meso-to-macro', to simulate polydopamine coated gold nanoparticles (AuNP@PDA) for assisted tumor photothermal therapy (PTT). The optical properties, mainly refractive index, of the PDA unit molecules are calculated using the density functional theory (DFT) method in this multiscale model. Subsequently, the thermodynamic properties, including thermal conductivity and heat capacity, of the PDA cells and AuNP@PDA particles are calculated using molecular dynamics (MD) simulation.

View Article and Find Full Text PDF

Objective: This study aimed to investigate the diagnostic value of 7-tumor associated autoantibodies (7-TAAB) and to evaluate the relationship between 7-TAAB and clinical features in esophageal squamous cell carcinoma (ESCC), which can be used to guide clinical diagnosis and treatment and achieve its clinical value.

Methods: (1) Blood specimens were collected from patients with ESCC who had not previously received antitumor therapy (ESCC group) and those who had normal medical check-ups in the hospital during the same period (control group). The concentrations of 7-TAAB (P53, PGP9.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!