The underwater environment is complicated and changeable and contains many noises, making it difficult to detect a particular object in the underwater environment. At present, the main seabed detection technology explores the seabed environment with sonar equipment. However, the characteristics of underwater sonar imaging (e.g., low contrast, blurred edges, poor texture, and unsatisfactory quality) have serious negative influences on such image classification. Therefore, in this study, we propose a dual-path deep residual "shrinkage" network (DP-DRSN) module, which is a simple and effective neural network attention module that can classify side-scan sonar images. Specifically, the module can extract background and feature texture information of the input feature mapping through different scales (e.g., global average pooling and global max pooling), whereas scale information passes through a two-layer 1 × 1 convolution to increase nonlinearity. This helps realize cross-channel information interaction and information integration simultaneously before outputting threshold parameters in a sigmoid layer. The parameters are then multiplied by the average value of the input feature mapping to obtain a threshold, which is used to denoise the image features using the soft threshold function. The proposed DP-DRSN study provided higher classification accuracy and efficiency than other models. In this way, the feasibility and effectiveness of DP-DRSN in image classification of side-scan sonar are proven.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8970946 | PMC |
http://dx.doi.org/10.1155/2022/6962838 | DOI Listing |
J Imaging
September 2024
Department of Oceanography and Hydrography, Dalian Naval Academy, Dalian 116018, China.
In this paper, a method for augmenting samples of side-scan sonar seafloor sediment images based on CBAM-BCEL1-INGAN is proposed, aiming to address the difficulties in acquiring and labeling datasets, as well as the insufficient diversity and quantity of data samples. Firstly, a Convolutional Block Attention Module (CBAM) is integrated into the residual blocks of the INGAN generator to enhance the learning of specific attributes and improve the quality of the generated images. Secondly, a BCEL1 loss function (combining binary cross-entropy and L1 loss functions) is introduced into the discriminator, enabling it to focus on both global image consistency and finer distinctions for better generation results.
View Article and Find Full Text PDFSensors (Basel)
August 2024
Department of Oceanography and Hydrography, Dalian Naval Academy, Dalian 116018, China.
Side-scan sonar is a principal technique for subsea target detection, where the quantity of sonar images of seabed targets significantly influences the accuracy of intelligent target recognition. To expand the number of representative side-scan sonar target image samples, a novel augmentation method employing self-training with a Disrupted Student model is designed (DS-SIAUG). The process begins by inputting a dataset of side-scan sonar target images, followed by augmenting the samples through an adversarial network consisting of the DDPM (Denoising Diffusion Probabilistic Model) and the YOLO (You Only Look Once) detection model.
View Article and Find Full Text PDFSensors (Basel)
July 2024
Edgelab S.r.l, Via Privata OTO, 10, 19136 La Spezia, Italy.
CORAL (Catamaran fOr UndeRwAter expLoration) is a compact, unmanned catamaran-type vehicle designed and developed to assist the scientific community in exploring marine areas such as inshore regions that are not easily accessible by traditional vessels. This vehicle can operate in different modalities: completely autonomous, semi-autonomous, or remotely assisted by the operator, thus accommodating various investigative scenarios. CORAL is characterized by compact dimensions, a very low draft and a total electric propulsion system.
View Article and Find Full Text PDFSensors (Basel)
July 2024
Department of Oceanography and Hydrography, Dalian Naval Academy, Dalian 116018, China.
Aiming at the problem of low accuracy of multi-scale seafloor target detection in side-scan sonar images with high noise and complex background texture, a model for multi-scale target detection using the BES-YOLO network is proposed. First, an efficient multi-scale attention (EMA) mechanism is used in the backbone of the YOLOv8 network, and a bi-directional feature pyramid network (Bifpn) is introduced to merge the information of different scales, finally, a Shape_IoU loss function is introduced to continuously optimize the model and improve its accuracy. Before training, the dataset is preprocessed using 2D discrete wavelet decomposition and reconstruction to enhance the robustness of the network.
View Article and Find Full Text PDFSci Rep
June 2024
Research Institute of USV Engineering, School of Mechatronic Engineering and Automation, Shanghai University, Shanghai, 200444, China.
Underwater object detection based on side-scan sonar (SSS) suffers from a lack of finely annotated data. This study aims to avoid the laborious task of annotation by achieving unsupervised underwater object detection through domain-adaptive object detection (DAOD). In DAOD, there exists a conflict between feature transferability and discriminability, suppressing the detection performance.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!