Publications by authors named "Sam Kwong"

The research of class-incremental semantic segmentation (CISS) seeks to enhance semantic segmentation methods by enabling the progressive learning of new classes while preserving knowledge of previously learned ones. A significant yet often neglected challenge in this domain is class imbalance. In CISS, each task focuses on different foreground classes, with the training set for each task exclusively comprising images that contain these currently focused classes.

View Article and Find Full Text PDF

Feature selection (FS) is a significant research topic in machine learning and artificial intelligence, but it becomes complicated in the high dimensional search space due to the vast number of features. Evolutionary computation (EC) has been widely used in solving FS by modeling it as an expensive wrapper-form optimization task, where a classifier is used to obtain classification accuracy for fitness evaluation (FE). In this article, we propose that the FS problem can be also modeled as a cheap filter-form optimization task, where the FE is based on the relevance and redundancy of the selected features.

View Article and Find Full Text PDF

Electroencephalography (EEG) is widely utilized for train driver state detection due to its high accuracy and low latency. However, existing methods for driver status detection rarely use the rich physiological information in EEG to improve detection performance. Moreover, there is currently a lack of EEG datasets for abnormal states of train drivers.

View Article and Find Full Text PDF

In this paper, we address the challenge of significant memory consumption and redundant components in large-scale voxel-based model, which are commonly encountered in real-world 3D reconstruction scenarios. We propose a novel method called Shell-guided compression of Voxel Radiance Fields (SVRF), aimed at optimizing voxel-based model into a shell-like structure to reduce storage costs while maintaining rendering accuracy. Specifically, we first introduce a Shell-like Constraint, operating in two main aspects: 1) enhancing the influence of voxels neighboring the surface in determining the rendering outcomes, and 2) expediting the elimination of redundant voxels both inside and outside the surface.

View Article and Find Full Text PDF

Typically, deep network-based full-reference image quality assessment (FR-IQA) models compare deep features from reference and distorted images pairwise, overlooking correlations among features from the same source. We propose a dual-branch framework to capture the joint degradation effect among deep network features. The first branch uses kernel representation similarity analysis (KRSA), which compares feature self-similarity matrices via the mean absolute error (MAE).

View Article and Find Full Text PDF

High Dynamic Range (HDR) images present unique challenges for Learned Image Compression (LIC) due to their complex domain distribution compared to Low Dynamic Range (LDR) images. In coding practice, HDR-oriented LIC typically adopts preprocessing steps (e.g.

View Article and Find Full Text PDF

Accurate monitoring of drowsy driving through electroencephalography (EEG) can effectively reduce traffic accidents. Developing a calibration-free drowsiness detection system with single-channel EEG alone is very challenging due to the non-stationarity of EEG signals, the heterogeneity among different individuals, and the relatively parsimonious compared to multi-channel EEG. Although deep learning-based approaches can effectively decode EEG signals, most deep learning models lack interpretability due to their black-box nature.

View Article and Find Full Text PDF
Article Synopsis
  • In refining Knowledge Graphs, new entities appear and old ones change, leading to a problem of distribution shift for entity features during representation learning.
  • Most current methods for embedding these graphs mainly focus on new entities and overlook the issues caused by this distribution shift.
  • The proposed model, EDSU, uses mean and variance reconstruction to address this shift by integrating both the characteristics of entity embeddings and their neighborhood structures, resulting in improved performance in inductive link prediction tasks compared to existing models.
View Article and Find Full Text PDF

In recent years, the High-Dynamic-Range (HDR) image has gained widespread popularity across various domains, such as the security, multimedia, and biomedical fields, owing to its ability to deliver an authentic visual experience. However, the extensive dynamic range and rich detail in HDR images present challenges in assessing their quality. Therefore, current efforts involve constructing subjective databases and proposing objective quality assessment metrics to achieve an efficient HDR Image Quality Assessment (IQA).

View Article and Find Full Text PDF

The self-expressive coefficient plays a crucial role in the self-expressiveness-based subspace clustering method. To enhance the precision of the self-expressive coefficient, we propose a novel deep subspace clustering method, named grouping belief-based deep contrastive subspace clustering (GRESS), which integrates the clustering information and higher-order relationship into the coefficient matrix. Specifically, we develop a deep contrastive subspace clustering module to enhance the learning of both self-expressive coefficients and cluster representations simultaneously.

View Article and Find Full Text PDF

Knowledge transfer (KT) is crucial for optimizing tasks in evolutionary multitask optimization (EMTO). However, most existing KT methods can only achieve superficial KT but lack the ability to deeply mine the similarities or relationships among different tasks. This limitation may result in negative transfer, thereby degrading the KT performance.

View Article and Find Full Text PDF
Article Synopsis
  • The paper introduces a Learning-based gEnome Codec (LEC) aimed at achieving high efficiency and flexibility in lossless data compression.
  • LEC employs advanced techniques like Group of Bases compression, multi-stride coding, and bidirectional prediction to optimize coding performance while keeping complexity manageable.
  • Experimental results demonstrate that LEC effectively balances compression ratios and inference speed, making it suitable for various real-world applications.
View Article and Find Full Text PDF

Deep CNNs have achieved impressive improvements for night-time self-supervised depth estimation form a monocular image. However, the performance degrades considerably compared to day-time depth estimation due to significant domain gaps, low visibility, and varying illuminations between day and night images. To address these challenges, we propose a novel night-time self-supervised monocular depth estimation framework with structure regularization, i.

View Article and Find Full Text PDF

The extraction of spatiotemporal neuron activity from calcium imaging videos plays a crucial role in unraveling the coding properties of neurons. While existing neuron extraction approaches have shown promising results, disturbing and scattering background and unused depth still impede their performance. To address these limitations, we develop an automatic and accurate neuron extraction paradigm, dubbed as decomposition-estimation-reconstruction (DER), consisting of D-procedure, E-procedure, and R-procedure.

View Article and Find Full Text PDF

This study aims to develop advanced and training-free full-reference image quality assessment (FR-IQA) models based on deep neural networks. Specifically, we investigate measures that allow us to perceptually compare deep network features and reveal their underlying factors. We find that distribution measures enjoy advanced perceptual awareness and test the Wasserstein distance (WSD), Jensen-Shannon divergence (JSD), and symmetric Kullback-Leibler divergence (SKLD) measures when comparing deep features acquired from various pretrained deep networks, including the Visual Geometry Group (VGG) network, SqueezeNet, MobileNet, and EfficientNet.

View Article and Find Full Text PDF

The statistical regularities of natural images, referred to as natural scene statistics, play an important role in no-reference image quality assessment. However, it has been widely acknowledged that screen content images (SCIs), which are typically computer generated, do not hold such statistics. Here we make the first attempt to learn the statistics of SCIs, based upon which the quality of SCIs can be effectively determined.

View Article and Find Full Text PDF
Article Synopsis
  • The paper presents a new method called the Graph-Represented Image Distribution Similarity (GRIDS) index for evaluating image quality by comparing distorted images with reference images using graph-based representations.
  • It involves transforming images into graphs to capture visual perception features and then analyzing their distribution patterns by calculating joint probability distributions through cliques.
  • The proposed method shows strong performance in image quality prediction, matching or exceeding state-of-the-art techniques, and the source code is available for public use.
View Article and Find Full Text PDF
Article Synopsis
  • Residual coding helps make images smaller without losing important details by first using a method that loses some quality and then fixing any mistakes with lossless techniques.*
  • The new method is designed for 3D medical images and uses a video technology to reduce size and a special network called BCM-Net to fix errors more efficiently.*
  • This method looks at patterns within single images and between different images to improve how well it compresses the data, and tests showed it worked better than other methods.*
View Article and Find Full Text PDF

The development of data sensing technology has generated a vast amount of high-dimensional data, posing great challenges for machine learning models. Over the past decades, despite demonstrating its effectiveness in data classification, genetic programming (GP) has still encountered three major challenges when dealing with high-dimensional data: 1) solution diversity; 2) multiclass imbalance; and 3) large feature space. In this article, we have developed a problem-specific multiobjective GP framework (PS-MOGP) for handling classification tasks with high-dimensional data.

View Article and Find Full Text PDF
Article Synopsis
  • Video service providers prioritize video quality, and recent advancements in video quality assessment (VQA) using deep neural networks aim to address this concern.
  • The proposed HVS-5M method introduces five modules that mimic characteristics of the human visual system (HVS) and connect them in a cooperative way to enhance video quality evaluation.
  • Experimental results demonstrate that HVS-5M outperforms existing VQA methods, with additional studies confirming the effectiveness of each module in the system.
View Article and Find Full Text PDF

Salient instance segmentation (SIS) is an emerging field that evolves from salient object detection (SOD), aiming at identifying individual salient instances using segmentation maps. Inspired by the success of dynamic convolutions in segmentation tasks, this article introduces a keypoints-based SIS network (KepSalinst). It employs multiple keypoints, that is, the center and several peripheral points of an instance, as effective geometrical guidance for dynamic convolutions.

View Article and Find Full Text PDF

Due to the light absorption and scattering induced by the water medium, underwater images usually suffer from some degradation problems, such as low contrast, color distortion, and blurring details, which aggravate the difficulty of downstream underwater understanding tasks. Therefore, how to obtain clear and visually pleasant images has become a common concern of people, and the task of underwater image enhancement (UIE) has also emerged as the times require. Among existing UIE methods, Generative Adversarial Networks (GANs) based methods perform well in visual aesthetics, while the physical model-based methods have better scene adaptability.

View Article and Find Full Text PDF

Convolutional Neural Networks (CNNs) dominate image processing but suffer from local inductive bias, which is addressed by the transformer framework with its inherent ability to capture global context through self-attention mechanisms. However, how to inherit and integrate their advantages to improve compressed sensing is still an open issue. This paper proposes CSformer, a hybrid framework to explore the representation capacity of local and global features.

View Article and Find Full Text PDF
Article Synopsis
  • * The article introduces a new network called MPFR-Net that uses both equirectangular projection and cube-unfolding images for better detection of salient objects in 360° images.
  • * To enhance the integration of these projections, the authors developed a dynamic weighting fusion module and a filtration and refinement module to improve feature processing, leading to superior performance compared to existing methods on relevant datasets.
View Article and Find Full Text PDF

In this article, the problem of impulse noise image restoration is investigated. A typical way to eliminate impulse noise is to use an L norm data fitting term and a total variation (TV) regularization. However, a convex optimization method designed in this way always yields staircase artifacts.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_sessionqjc8j3o5il4a6fnje6e1f4ssm1p037sa): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once