Global increases in the occurrence of harmful algal blooms (HABs) are of major concern in water quality and resource management. A predictive model capable of quantifying the spatiotemporal associations between HABs and their influencing factors is required for effective preventive management. In this study, a feature stream network (FSN) model is proposed to provide daily forecasts of cyanobacteria abundance at multiple monitoring sites simultaneously in a river network.
View Article and Find Full Text PDFClimate warming is linked to earlier onset and extended duration of cyanobacterial blooms in temperate rivers. This causes an unpredictable extent of harm to the functioning of the ecosystem and public health. We used Microcystis spp.
View Article and Find Full Text PDFEnvironmental factors, such as climate change and land use changes, affect water quality drastically. To consider these, various predictive models, both process-based and data-driven, have been used. However, each model has distinct limitations.
View Article and Find Full Text PDFRecently, weather data have been applied to one of deep learning techniques known as "long short-term memory (LSTM)" to predict streamflow in rainfall-runoff relationships. However, this approach may not be suitable for regions with artificial water management structures such as dams and weirs. Therefore, this study aims to evaluate the prediction accuracy of LSTM for streamflow depending on the availability of dam/weir operational data across South Korea.
View Article and Find Full Text PDFThe process of designing streamlined workflows for developing microbial strains using classical methods from vast amounts of biological big data has reached its limits. With the continuous increase in the amount of biological big data, data-driven machine learning approaches are being used to overcome the limits of classical approaches for strain development. Here, machine learning-guided engineering of Deinococcus radiodurans R1 for high-yield production of lycopene was demonstrated.
View Article and Find Full Text PDFRoutine monitoring for harmful algal blooms (HABs) is generally undertaken at low temporal frequency (e.g., weekly to monthly) that is unsuitable for capturing highly dynamic variations in cyanobacteria abundance.
View Article and Find Full Text PDFThe dynamics of fecal indicator bacteria, such as fecal coliforms (FC) in streams, are influenced by the interactions of a myriad of factors. To predict complex spatiotemporal patterns of FC in streams and assess the relative importance of numerous controlling factors, the adoption of a hierarchical Bayesian network (HBN) was proposed in this study. By introducing latent variables correlated to the observed variables into a Bayesian network, the HBN can represent causal relationships among a large set of variables with a multilevel hierarchy.
View Article and Find Full Text PDFSpecies distribution models (SDMs), in which species occurrences are related to a suite of environmental variables, have been used as a decision-making tool in ecosystem management. Complex machine learning (ML) algorithms that lack interpretability may hinder the use of SDMs for ecological explanations, possibly limiting the role of SDMs as a decision-support tool. To meet the growing demand of explainable MLs, several interpretable ML methods have recently been proposed.
View Article and Find Full Text PDFUsing cross-sectional data for making ecological inference started as a practical means of pooling data to enable meaningful empirical model development. For example, limnologists routinely use sample averages from numerous individual lakes to examine patterns across lakes. The basic assumption behind the use of cross-lake data is often that responses within and across lakes are identical.
View Article and Find Full Text PDFUnderstanding harmful algal blooms is imperative to protect aquatic ecosystems and human health. This study describes the spatial and temporal distributions of cyanobacterial blooms to identify the relations between blooms and environmental factors in the Baekje Reservoir. Two-year cyanobacterial cell data at one fixed station and four remotely sensed distributions of phycocyanin (PC) concentrations based on hyperspectral images (HSIs) were used to describe the relation between the spatial and temporal variations in the blooms and the affecting factors.
View Article and Find Full Text PDFDespite a growing awareness of the problems associated with cyanobacterial blooms in rivers, and particularly in regulated rivers, the drivers of bloom formation and abundance in rivers are not well understood. We developed a Bayesian hierarchical model to assess the relative importance of predictors of summer cyanobacteria abundance, and to test whether the relative importance of each predictor varies by site, using monitoring data from 16 sites in the four major rivers of South Korea. The results suggested that temperature and residence time, but not nutrient levels, are important predictors of summer cyanobacteria abundance in rivers.
View Article and Find Full Text PDFBacteria are a primary contaminant in natural surface water. The instream concentration of fecal coliform, a potential indicator of pathogens, is influenced by meteorological conditions and land-use characteristics. However, the relationships between these conditions and fecal coliforms are not fully understood.
View Article and Find Full Text PDFEnviron Sci Technol
May 2015
The implications of Stein's paradox stirred considerable debate in statistical circles when the concept was first introduced in the 1950s. The paradox arises when we are interested in estimating the means of several variables simultaneously. In this situation, the best estimator for an individual mean, the sample average, is no longer the best.
View Article and Find Full Text PDFIn the Mekong River basin, groundwater from tube-wells is a major drinking water source. However, arsenic (As) contamination in groundwater resources has become a critical issue in the watershed. In this study, As species such as total As (AsTOT), As(III), and As(V), were monitored across the watershed to investigate their characteristics and inter-relationships with water quality parameters, including pH and redox potential (Eh).
View Article and Find Full Text PDFCyanobacterial blooms in western Lake Erie have recently garnered widespread attention. Current evidence indicates that a major source of the nutrients that fuel these blooms is the Maumee River. We applied a seasonal trend decomposition technique to examine long-term and seasonal changes in Maumee River discharge and nutrient concentrations and loads.
View Article and Find Full Text PDFWe explore how the analysis of web-based data, such as Twitter and Google Trends, can be used to assess the social relevance of an environmental accident. The concept and methods are applied in the shutdown of drinking water supply at the city of Toledo, Ohio, USA. Toledo's notice, which persisted from August 1 to 4, 2014, is a high-profile event that directly influenced approximately half a million people and received wide recognition.
View Article and Find Full Text PDFEnviron Sci Technol
April 2013
Correlations between chlorophyll a and total phosphorus in freshwater ecosystems were first documented in the 1960s and have been used since then to infer phosphorus limitation, build simple models, and develop management targets. Often these correlations are considered indicative of a cause-effect relationship. However, many scientists regard the use of these associations for modeling and inference to be misleading due to their potentially spurious nature.
View Article and Find Full Text PDFEnviron Sci Technol
September 2011
Dreissenid mussels were first documented in the Laurentian Great Lakes in the late 1980s. Zebra mussels (Dreissena polymorpha) spread quickly into shallow, hard-substrate areas; quagga mussels (Dreissena rostriformis bugensis) spread more slowly and are currently colonizing deep, offshore areas. These mussels occur at high densities, filter large water volumes while feeding on suspended materials, and deposit particulate waste on the lake bottom.
View Article and Find Full Text PDFWe propose the use of Bayesian hierarchical/multilevel ratio approach to estimate the annual riverine phosphorus loads in the Saginaw River, Michigan, from 1968 to 2008. The ratio estimator is known to be an unbiased, precise approach for differing flow-concentration relationships and sampling schemes. A Bayesian model can explicitly address the uncertainty in prediction by using a posterior predictive distribution, while in comparison, a Bayesian hierarchical technique can overcome the limitation of interpreting the estimated annual loads inferred from small sample sizes by borrowing strength from the underlying population shared by the years of interest.
View Article and Find Full Text PDF