AI Article Synopsis

  • Accurately assessing risks from inorganic pollutants in groundwater is crucial, but existing water quality databases often lack sufficient data due to budget constraints and sample collection challenges.
  • Two advanced data imputation techniques, AMELIA and MICE, were compared, with AMELIA proving more effective in managing missing values without creating excessive outliers.
  • The use of imputed data revealed significantly more potential health risks in groundwater samples, helping state agencies better allocate resources for monitoring and analysis, thus enhancing decision-making and prioritization for future sampling efforts.

Article Abstract

Accurately assessing and managing risks associated with inorganic pollutants in groundwater is imperative. Historic water quality databases are often sparse due to rationale or financial budgets for sample collection and analysis, posing challenges in evaluating exposure or water treatment effectiveness. We utilized and compared two advanced multiple data imputation techniques, AMELIA and MICE algorithms, to fill gaps in sparse groundwater quality data sets. AMELIA outperformed MICE in handling missing values, as MICE tended to overestimate certain values, resulting in more outliers. Field data sets revealed that 75% to 80% of samples exhibited no co-occurring regulated pollutants surpassing MCL values, whereas imputed values showed only 15% to 55% of the samples posed no health risks. Imputed data unveiled a significant increase, ranging from 2 to 5 times, in the number of sampling locations predicted to potentially exceed health-based limits and identified samples where 2 to 6 co-occurring chemicals may occur and surpass health-based levels. Linking imputed data to sampling locations can pinpoint potential hotspots of elevated chemical levels and guide optimal resource allocation for additional field sampling and chemical analysis. With this approach, further analysis of complete data sets allows state agencies authorized to conduct groundwater monitoring, often with limited financial resources, to prioritize sampling locations and chemicals to be tested. Given existing data and time constraints, it is crucial to identify the most strategic use of the available resources to address data gaps effectively. This work establishes a framework to enhance the beneficial impact of funding groundwater data collection by reducing uncertainty in prioritizing future sampling locations and chemical analyses.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11580165PMC
http://dx.doi.org/10.1021/acs.est.4c05203DOI Listing

Publication Analysis

Top Keywords

sampling locations
16
data sets
12
data
9
multiple data
8
data imputation
8
imputed data
8
groundwater
5
sampling
5
imputation methods
4
methods advance
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!