A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3122
Function: getPubMedXML

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

How effective is score-based data quality assessment? An illustration with fish BCF data. | LitMetric

How effective is score-based data quality assessment? An illustration with fish BCF data.

Environ Res

Department of Agricultural Chemistry, National Taiwan University, Taipei City, Taiwan.

Published: December 2024

Increasingly rigorous data quality (DQ) evaluations and/or screening practices are being applied to environmental and ecotoxicological datasets. DQ is predominantly evaluated by scoring given data against preselected criteria. This study provides the first examination on the effectiveness of score-based DQ evaluation in providing statistically meaningful differentiation of measurements using fish bioconcentration factor (BCF) dataset as an illustration. This is achieved by inspecting how log BCF differs with the built-in overall-DQ and specific-DQ evaluations, and how it is influenced by interactive effects and hierarchy of DQ criteria. Approximately 80-90% of analyzable chemicals show no statistical difference in log BCF between low-quality (LQ) and high-quality (HQ) measurements in overall evaluation (n = 183) or in individual evaluation of 6 DQ criteria (n = 53 to 101). Further examination shows that log BCF may/may not change with different combinations or total number of criteria violations. Tree analysis and nodal structures of deviation in log BCF also reveal the absence of common structural dependence on the criteria violated. Finally, simple averaging of all measurements without DQ differentiation yields comparable log BCFs as those derived using strictly HQ data with ≤0.5 log unit difference in over 93% of the chemicals (n = 158) and no dependence on number of measurements, fraction of LQ measurements, or bioaccumulation potential of the chemicals. For accurate log BCF, DQ appears no more important than having more independent measurements irrespective of their individual DQ statuses. This work concludes by calling for: (i) re-documentation of experimental details in legacy environmental and ecotoxicological datasets, (ii) examination of other DQ-categorized datasets using the tests and tools applied here, and (ii) a thorough and systematic reflection on how DQ should be assessed for modeling, benchmarking, and other data-based analyses or applications.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.envres.2024.119880DOI Listing

Publication Analysis

Top Keywords

log bcf
20
data quality
8
environmental ecotoxicological
8
ecotoxicological datasets
8
bcf
7
log
7
measurements
6
data
5
criteria
5
effective score-based
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!