Publications by authors named "Weidong Min"

Article Synopsis
  • Vehicle re-identification (Vehicle Re-ID) is a technology used for tracking and retrieving specific vehicles across multiple cameras, which can aid in monitoring violations and capturing fugitives, but faces challenges like low-resolution images that hinder accurate feature extraction.
  • This study introduces a new Vehicle Re-ID method that utilizes super-resolution image generation with Generative Adversarial Networks (GANs) to enhance low-resolution images, along with a pyramidal convolution residual network to capture various feature scales and improve recognition accuracy.
  • The proposed method combines features from both original and super-resolution images to produce clearer, more detailed representations, ultimately increasing the model's performance in accurately identifying vehicles.
View Article and Find Full Text PDF

Medical image segmentation is a compelling fundamental problem and an important auxiliary tool for clinical applications. Recently, the Transformer model has emerged as a valuable tool for addressing the limitations of convolutional neural networks by effectively capturing global relationships and numerous hybrid architectures combining convolutional neural networks (CNNs) and Transformer have been devised to enhance segmentation performance. However, they suffer from multilevel semantic feature gaps and fail to account for multilevel dependencies between space and channel.

View Article and Find Full Text PDF

Introduction: Accurately counting the number of dense objects in an image, such as pedestrians or vehicles, is a challenging and practical task. The existing density map regression methods based on CNN are mainly used to count a class of dense objects in a single scene. However, in complex traffic scenes, objects such as vehicles and pedestrians usually exist at the same time, and multiple classes of dense objects need to be counted simultaneously.

View Article and Find Full Text PDF

Semi-supervised medical image segmentation strives to polish deep models with a small amount of labeled data and a large amount of unlabeled data. The efficiency of most semi-supervised medical image segmentation methods based on voxel-level consistency learning is affected by low-confidence voxels. In addition, voxel-level consistency learning fails to consider the spatial correlation between neighboring voxels.

View Article and Find Full Text PDF

The vehicle logo contains the vehicle's identity information, so vehicle logo detection (VLD) technology has extremely important significance. Although the VLD field has been studied for many years, the detection task is still difficult due to the small size of the vehicle logo and the background interference problem. To solve these problems, this paper proposes a method of VLD based on the YOLO-T model and the correlation of the vehicle space structure.

View Article and Find Full Text PDF

Medical image segmentation has long been a compelling and fundamental problem in the realm of neuroscience. This is an extremely challenging task due to the intensely interfering irrelevant background information to segment the target. State-of-the-art methods fail to consider simultaneously addressing both long-range and short-range dependencies, and commonly emphasize the semantic information characterization capability while ignoring the geometric detail information implied in the shallow feature maps resulting in the dropping of crucial features.

View Article and Find Full Text PDF

Objective: Autism spectrum disorder (ASD) affects nearly 1 in 44 children younger than 8 years old in the United States, and the situation may be even worse in remote areas of the world. However, it is difficult to utilize existing approaches to screen patients with ASD in remote areas due to the lack of professionals and high-tech instruments. Therefore, we develop a fast and accurate scalable method for screening children with ASD.

View Article and Find Full Text PDF

Due to the complexity of medical imaging techniques and the high heterogeneity of glioma surfaces, image segmentation of human gliomas is one of the most challenging tasks in medical image analysis. Current methods based on convolutional neural networks concentrate on feature extraction while ignoring the correlation between local and global. In this paper, we propose a residual mix transformer fusion net, namely RMTF-Net, for brain tumor segmentation.

View Article and Find Full Text PDF

Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB flow alone or combined with optical flow. A novel recognition method using action sequences optimization and two-stream fusion network with different modalities is proposed to solve these problems.

View Article and Find Full Text PDF

An important area in a gathering place is a region attracting the constant attention of people and has evident visual features, such as a flexible stage or an open-air show. Finding such areas can help security supervisors locate the abnormal regions automatically. The existing related methods lack an efficient means to find important area candidates from a scene and have failed to judge whether or not a candidate attracts people's attention.

View Article and Find Full Text PDF

Vehicle Logo Recognition (VLR) is an important part of vehicle behavior analysis and can provide supplementary information for vehicle identification, which is an essential research topic in robotic systems. However, the inaccurate extraction of vehicle logo candidate regions will affect the accuracy of logo recognition. Additionally, the existing methods have low recognition rate for most small vehicle logos and poor performance under complicated environments.

View Article and Find Full Text PDF

Behavior analysis through posture recognition is an essential research in robotic systems. Sitting with unhealthy sitting posture for a long time seriously harms human health and may even lead to lumbar disease, cervical disease and myopia. Automatic vision-based detection of unhealthy sitting posture, as an example of posture detection in robotic systems, has become a hot research topic.

View Article and Find Full Text PDF

We propose two tampered image detection methods based on consistency of shadow. The first method is based on texture consistency of shadow for the first kind of splicing image, in which the shadow as well as main body is copied and pasted from another image. The suspicious region including shadow and nonshadow is first selected.

View Article and Find Full Text PDF