Unsupervised learning for lake underwater vegetation classification: Constructing high-precision, large-scale aquatic ecological datasets.

Sci Total Environ

School of Engineering, Dali University, Yunnan 671003, China; National Observation and Research Station of Erhai Lake Ecosystem in Yunnan, Dali 671006, China.; Air-Space-Ground Integrated Intelligence and Big Data Application Engineering Research Center of Yunnan Provincial Department of Education, Yunnan 671003, China. Electronic address:

Published: December 2024

AI Article Synopsis

  • * Supervised AI techniques can identify vegetation but rely on labeled datasets that are costly and time-consuming to create, leading to issues with accuracy when applied to new environments.
  • * This study introduces an unsupervised classification method that significantly reduces the need for manual annotation, achieving high accuracy in identifying underwater vegetation while utilizing innovative techniques like dimensionality reduction and a voting mechanism, making it more efficient and scalable.

Article Abstract

Monitoring underwater vegetation is vital for evaluating lake ecosystem health. Automated data collection and analysis play key roles in achieving large-scale, high-precision, and high-frequency monitoring. While technologies such as unmanned vessels have made data collection more efficient, challenges persist in the analysis process, particularly in addressing the varied needs of different lake environments. Supervised AI methods can automatically identify underwater vegetation but are heavily reliant on labeled datasets. In practice, models trained on public datasets often struggle with generalization due to differences in vegetation types, collection environments, and equipment, resulting in discrepancies between training and testing datasets. Moreover, traditional dataset construction methods that rely on manual annotation are time-consuming and costly, limiting their scalability and application. This study aims to overcome these challenges by proposing an unsupervised method for automatically classifying underwater vegetation data, aiming to reduce manual annotation efforts and construct unbiased datasets at lower costs with greater efficiency. Compared with existing unsupervised, self-supervised, and unsupervised domain adaptation methods, this method introduces two key innovations: 1) a two-step dimensionality reduction method that combines pre-trained model and manifold learning to extract key features and 2) a multialgorithm voting mechanism to increase classification confidence. These features enable high-accuracy classification without prior data annotation. Experiments show 97.32 % accuracy on public dataset and 92.43 % and 96.15 % accuracy on private datasets from Erhai Lake and Wuhan East Lake, respectively, surpassing supervised methods and matching manual classification. Additionally, it drastically reduces the annotation effort, requiring only approximately 20 labeled images to classify thousands of points. By integrating unmanned vessel technology, this approach provides an efficient, cost-effective solution for large-scale, high-frequency underwater vegetation monitoring across diverse lakes.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.scitotenv.2024.177895DOI Listing

Publication Analysis

Top Keywords

underwater vegetation
20
data collection
8
supervised methods
8
manual annotation
8
vegetation
6
datasets
6
lake
5
underwater
5
unsupervised
4
unsupervised learning
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!