Human activity recognition by radar sensors plays an important role in healthcare and smart homes. However, labeling a large number of radar datasets is difficult and time-consuming, and it is difficult for models trained on insufficient labeled data to obtain exact classification results. In this paper, we propose a multiscale residual weighted classification network with large-scale, medium-scale, and small-scale residual networks. Firstly, an MRW image encoder is used to extract salient feature representations from all time-Doppler images through contrastive learning. This can extract the representative vector of each image and also obtain the pre-training parameters of the MRW image encoder. During the pre-training process, large-scale residual networks, medium-scale residual networks, and small-scale residual networks are used to extract global information, texture information, and semantic information, respectively. Moreover, the time-channel weighting mechanism can allocate weights to important time and channel dimensions to achieve more effective extraction of feature information. The model parameters obtained from pre-training are frozen, and the classifier is added to the backend. Finally, the classifier is fine-tuned using a small amount of labeled data. In addition, we constructed a new dataset with eight dangerous activities. The proposed MRW-CN model was trained on this dataset and achieved a classification accuracy of 96.9%. We demonstrated that our method achieves state-of-the-art performance. The ablation analysis also demonstrated the role of multi-scale convolutional kernels and time-channel weighting mechanisms in classification.

Download full-text PDF

Source
http://dx.doi.org/10.3390/s25010197DOI Listing

Publication Analysis

Top Keywords

residual networks
16
multiscale residual
8
residual weighted
8
weighted classification
8
classification network
8
human activity
8
activity recognition
8
labeled data
8
small-scale residual
8
mrw image
8

Similar Publications

This paper introduces a novel energy-efficient lightweight, void hole avoidance, localization, and trust-based scheme, termed as Energy-Efficient and Trust-based Autonomous Underwater Vehicle (EETAUV) protocol designed for 6G-enabled underwater acoustic sensor networks (UASNs). The proposed scheme addresses key challenges in UASNs, such as energy consumption, network stability, and data security. It integrates a trust management framework that enhances communication security through node identification and verification mechanisms utilizing normal and phantom nodes.

View Article and Find Full Text PDF

Human activity recognition by radar sensors plays an important role in healthcare and smart homes. However, labeling a large number of radar datasets is difficult and time-consuming, and it is difficult for models trained on insufficient labeled data to obtain exact classification results. In this paper, we propose a multiscale residual weighted classification network with large-scale, medium-scale, and small-scale residual networks.

View Article and Find Full Text PDF

Bridge expansion joints are critical components that accommodate the movement of a bridge caused by temperature fluctuations, concrete shrinkage, and vehicular loads. Analyzing the spatiotemporal deformation of these expansion joints is essential for monitoring bridge safety. This study investigates the deformation characteristics of Hongtang Bridge in Fuzhou, China, using synthetic aperture radar interferometry (InSAR).

View Article and Find Full Text PDF

Residual Vision Transformer and Adaptive Fusion Autoencoders for Monocular Depth Estimation.

Sensors (Basel)

December 2024

Institute of Computer and Communication Engineering, Department of Electrical Engineering, National Cheng Kung University, Tainan 701, Taiwan.

Precision depth estimation plays a key role in many applications, including 3D scene reconstruction, virtual reality, autonomous driving and human-computer interaction. Through recent advancements in deep learning technologies, monocular depth estimation, with its simplicity, has surpassed the traditional stereo camera systems, bringing new possibilities in 3D sensing. In this paper, by using a single camera, we propose an end-to-end supervised monocular depth estimation autoencoder, which contains an encoder with a structure with a mixed convolution neural network and vision transformers and an effective adaptive fusion decoder to obtain high-precision depth maps.

View Article and Find Full Text PDF

As highway tunnel operations continue over time, structural defects, particularly cracks, have been observed to increase annually. Coupled with the rapid expansion of tunnel networks, traditional manual inspection methods have proven inadequate to meet current demands. In recent years, machine vision and deep learning technologies have gained significant attention in civil engineering for the detection and analysis of structural defects.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!