Publications by authors named "YoonKyung Cha"

Global increases in the occurrence of harmful algal blooms (HABs) are of major concern in water quality and resource management. A predictive model capable of quantifying the spatiotemporal associations between HABs and their influencing factors is required for effective preventive management. In this study, a feature stream network (FSN) model is proposed to provide daily forecasts of cyanobacteria abundance at multiple monitoring sites simultaneously in a river network.

View Article and Find Full Text PDF
Article Synopsis
  • Recent advancements in remote sensing and deep learning have led to the development of the Hierarchical Concatenated Variational Autoencoder (HCVAE), which effectively monitors harmful algal blooms (HABs) by analyzing hyperspectral data in bodies of water such as Daecheong Lake in South Korea.
  • * HCVAE utilizes a multi-level hierarchical approach that allows for efficient extraction of key spectral features, enabling accurate estimation of chlorophyll-a (Chl-a) and phycocyanin (PC) concentrations with high agreement to measured values.
  • * The study showcases HCVAE's ability to create spatial distribution maps of algal pigments using drone technology, thereby improving near-real-time monitoring and assessment of HABs.
View Article and Find Full Text PDF

Climate warming is linked to earlier onset and extended duration of cyanobacterial blooms in temperate rivers. This causes an unpredictable extent of harm to the functioning of the ecosystem and public health. We used Microcystis spp.

View Article and Find Full Text PDF

Environmental factors, such as climate change and land use changes, affect water quality drastically. To consider these, various predictive models, both process-based and data-driven, have been used. However, each model has distinct limitations.

View Article and Find Full Text PDF

Recently, weather data have been applied to one of deep learning techniques known as "long short-term memory (LSTM)" to predict streamflow in rainfall-runoff relationships. However, this approach may not be suitable for regions with artificial water management structures such as dams and weirs. Therefore, this study aims to evaluate the prediction accuracy of LSTM for streamflow depending on the availability of dam/weir operational data across South Korea.

View Article and Find Full Text PDF

The process of designing streamlined workflows for developing microbial strains using classical methods from vast amounts of biological big data has reached its limits. With the continuous increase in the amount of biological big data, data-driven machine learning approaches are being used to overcome the limits of classical approaches for strain development. Here, machine learning-guided engineering of Deinococcus radiodurans R1 for high-yield production of lycopene was demonstrated.

View Article and Find Full Text PDF

Routine monitoring for harmful algal blooms (HABs) is generally undertaken at low temporal frequency (e.g., weekly to monthly) that is unsuitable for capturing highly dynamic variations in cyanobacteria abundance.

View Article and Find Full Text PDF

The dynamics of fecal indicator bacteria, such as fecal coliforms (FC) in streams, are influenced by the interactions of a myriad of factors. To predict complex spatiotemporal patterns of FC in streams and assess the relative importance of numerous controlling factors, the adoption of a hierarchical Bayesian network (HBN) was proposed in this study. By introducing latent variables correlated to the observed variables into a Bayesian network, the HBN can represent causal relationships among a large set of variables with a multilevel hierarchy.

View Article and Find Full Text PDF

Species distribution models (SDMs), in which species occurrences are related to a suite of environmental variables, have been used as a decision-making tool in ecosystem management. Complex machine learning (ML) algorithms that lack interpretability may hinder the use of SDMs for ecological explanations, possibly limiting the role of SDMs as a decision-support tool. To meet the growing demand of explainable MLs, several interpretable ML methods have recently been proposed.

View Article and Find Full Text PDF

Using cross-sectional data for making ecological inference started as a practical means of pooling data to enable meaningful empirical model development. For example, limnologists routinely use sample averages from numerous individual lakes to examine patterns across lakes. The basic assumption behind the use of cross-lake data is often that responses within and across lakes are identical.

View Article and Find Full Text PDF

Understanding harmful algal blooms is imperative to protect aquatic ecosystems and human health. This study describes the spatial and temporal distributions of cyanobacterial blooms to identify the relations between blooms and environmental factors in the Baekje Reservoir. Two-year cyanobacterial cell data at one fixed station and four remotely sensed distributions of phycocyanin (PC) concentrations based on hyperspectral images (HSIs) were used to describe the relation between the spatial and temporal variations in the blooms and the affecting factors.

View Article and Find Full Text PDF

Despite a growing awareness of the problems associated with cyanobacterial blooms in rivers, and particularly in regulated rivers, the drivers of bloom formation and abundance in rivers are not well understood. We developed a Bayesian hierarchical model to assess the relative importance of predictors of summer cyanobacteria abundance, and to test whether the relative importance of each predictor varies by site, using monitoring data from 16 sites in the four major rivers of South Korea. The results suggested that temperature and residence time, but not nutrient levels, are important predictors of summer cyanobacteria abundance in rivers.

View Article and Find Full Text PDF

Bacteria are a primary contaminant in natural surface water. The instream concentration of fecal coliform, a potential indicator of pathogens, is influenced by meteorological conditions and land-use characteristics. However, the relationships between these conditions and fecal coliforms are not fully understood.

View Article and Find Full Text PDF

The implications of Stein's paradox stirred considerable debate in statistical circles when the concept was first introduced in the 1950s. The paradox arises when we are interested in estimating the means of several variables simultaneously. In this situation, the best estimator for an individual mean, the sample average, is no longer the best.

View Article and Find Full Text PDF

In the Mekong River basin, groundwater from tube-wells is a major drinking water source. However, arsenic (As) contamination in groundwater resources has become a critical issue in the watershed. In this study, As species such as total As (AsTOT), As(III), and As(V), were monitored across the watershed to investigate their characteristics and inter-relationships with water quality parameters, including pH and redox potential (Eh).

View Article and Find Full Text PDF

Cyanobacterial blooms in western Lake Erie have recently garnered widespread attention. Current evidence indicates that a major source of the nutrients that fuel these blooms is the Maumee River. We applied a seasonal trend decomposition technique to examine long-term and seasonal changes in Maumee River discharge and nutrient concentrations and loads.

View Article and Find Full Text PDF

We explore how the analysis of web-based data, such as Twitter and Google Trends, can be used to assess the social relevance of an environmental accident. The concept and methods are applied in the shutdown of drinking water supply at the city of Toledo, Ohio, USA. Toledo's notice, which persisted from August 1 to 4, 2014, is a high-profile event that directly influenced approximately half a million people and received wide recognition.

View Article and Find Full Text PDF

Correlations between chlorophyll a and total phosphorus in freshwater ecosystems were first documented in the 1960s and have been used since then to infer phosphorus limitation, build simple models, and develop management targets. Often these correlations are considered indicative of a cause-effect relationship. However, many scientists regard the use of these associations for modeling and inference to be misleading due to their potentially spurious nature.

View Article and Find Full Text PDF

Dreissenid mussels were first documented in the Laurentian Great Lakes in the late 1980s. Zebra mussels (Dreissena polymorpha) spread quickly into shallow, hard-substrate areas; quagga mussels (Dreissena rostriformis bugensis) spread more slowly and are currently colonizing deep, offshore areas. These mussels occur at high densities, filter large water volumes while feeding on suspended materials, and deposit particulate waste on the lake bottom.

View Article and Find Full Text PDF

We propose the use of Bayesian hierarchical/multilevel ratio approach to estimate the annual riverine phosphorus loads in the Saginaw River, Michigan, from 1968 to 2008. The ratio estimator is known to be an unbiased, precise approach for differing flow-concentration relationships and sampling schemes. A Bayesian model can explicitly address the uncertainty in prediction by using a posterior predictive distribution, while in comparison, a Bayesian hierarchical technique can overcome the limitation of interpreting the estimated annual loads inferred from small sample sizes by borrowing strength from the underlying population shared by the years of interest.

View Article and Find Full Text PDF