Publications by authors named "Zenglin Shi"

Article Synopsis
  • In clinical settings, certain medical imaging techniques may be unavailable due to factors like cost and radiation, making unpaired cross-modality translation techniques essential for synthesizing target images without direct pairing.
  • The proposed target-guided diffusion model (TGDM) uses a perception prioritized weight scheme to enhance learning and incorporates a pre-trained classifier during sampling to minimize unwanted remnants from source data.
  • Experiments on MRI-CT and MRI-US datasets show that TGDM produces realistic images that accurately represent anatomical features, supported by subjective assessments confirming its clinical utility.
View Article and Find Full Text PDF

In many practical applications, massive data are observed from multiple sources, each of which contains multiple cohesive views, called hierarchical multiview (HMV) data, such as image-text objects with different types of visual and textual features. Naturally, the inclusion of source and view relationships offers a comprehensive view of the input HMV data and achieves an informative and correct clustering result. However, most existing multiview clustering (MVC) methods can only process single-source data with multiple views or multisource data with single type of feature, failing to consider all the views across multiple sources.

View Article and Find Full Text PDF

The goal of this paper is guided image filtering, which emphasizes the importance of structure transfer during filtering by means of an additional guidance image. Where classical guided filters transfer structures using hand-designed functions, recent guided filters have been considerably advanced through parametric learning of deep networks. The state-of-the-art leverages deep networks to estimate the two core coefficients of the guided filter.

View Article and Find Full Text PDF

Multiview clustering (MVC) has recently been the focus of much attention due to its ability to partition data from multiple views via view correlations. However, most MVC methods only learn either interfeature correlations or intercluster correlations, which may lead to unsatisfactory clustering performance. To address this issue, we propose a novel dual-correlated multivariate information bottleneck (DMIB) method for MVC.

View Article and Find Full Text PDF

Is recurrent network really necessary for learning a good visual representation for video based person re-identification (VPRe-id)? In this paper, we first show that the common practice of employing recurrent neural networks (RNNs) to aggregate temporal-spatial features may not be optimal. Specifically, with a diagnostic analysis, we show that the recurrent structure may not be effective learn temporal dependencies than what we expected and implicitly yields an orderless representation. Based on this observation, we then present a simple yet surprisingly powerful approach for VPRe-id, where we treat VPRe-id as an efficient orderless ensemble of image based person re-identification problem.

View Article and Find Full Text PDF

Nonlinear regression has been extensively employed in many computer vision problems (e.g., crowd counting, age estimation, affective computing).

View Article and Find Full Text PDF

Knowledge of whole heart anatomy is a prerequisite for many clinical applications. Whole heart segmentation (WHS), which delineates substructures of the heart, can be very valuable for modeling and analysis of the anatomy and functions of the heart. However, automating this segmentation can be challenging due to the large variation of the heart shape, and different image qualities of the clinical data.

View Article and Find Full Text PDF

The balance of neighborhood space around a central point is an important concept in cluster analysis. It can be used to effectively detect cluster boundary objects. The existing neighborhood analysis methods focus on the distribution of data, i.

View Article and Find Full Text PDF

Pooling is a key mechanism in deep convolutional neural networks (CNNs) which helps to achieve translation invariance. Numerous studies, both empirically and theoretically, show that pooling consistently boosts the performance of the CNNs. The conventional pooling methods are operated on activation values.

View Article and Find Full Text PDF