Arbitrary image stylization by neural networks has become a popular topic, and video stylization is attracting more attention as an extension of image stylization. However, when image stylization methods are applied to videos, unsatisfactory results that suffer from severe flickering effects appear. In this article, we conducted a detailed and comprehensive analysis of the cause of such flickering effects. Systematic comparisons among typical neural style transfer approaches show that the feature migration modules for state-of-the-art (SOTA) learning systems are ill-conditioned and could lead to a channelwise misalignment between the input content representations and the generated frames. Unlike traditional methods that relieve the misalignment via additional optical flow constraints or regularization modules, we focus on keeping the temporal consistency by aligning each output frame with the input frame. To this end, we propose a simple yet efficient multichannel correlation network (MCCNet), to ensure that output frames are directly aligned with inputs in the hidden feature space while maintaining the desired style patterns. An inner channel similarity loss is adopted to eliminate side effects caused by the absence of nonlinear operations such as softmax for strict alignment. Furthermore, to improve the performance of MCCNet under complex light conditions, we introduce an illumination loss during training. Qualitative and quantitative evaluations demonstrate that MCCNet performs well in arbitrary video and image style transfer tasks. Code is available at https://github.com/kongxiuxiu/MCCNetV2.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2022.3230084DOI Listing

Publication Analysis

Top Keywords

style transfer
12
image stylization
12
temporal consistency
8
flickering effects
8
exploring temporal
4
consistency arbitrary
4
style
4
arbitrary style
4
transfer channelwise
4
channelwise perspective
4

Similar Publications

Predicting cell morphological responses to perturbations using generative modeling.

Nat Commun

January 2025

Department of Computational Health, Institute of Computational Biology, Helmholtz Zentrum München, Munich, Germany.

Advancements in high-throughput screenings enable the exploration of rich phenotypic readouts through high-content microscopy, expediting the development of phenotype-based drug discovery. However, analyzing large and complex high-content imaging screenings remains challenging due to incomplete sampling of perturbations and the presence of technical variations between experiments. To tackle these shortcomings, we present IMage Perturbation Autoencoder (IMPA), a generative style-transfer model predicting morphological changes of perturbations across genetic and chemical interventions.

View Article and Find Full Text PDF

Style Transfer of Chinese Wuhu Iron Paintings Using Hierarchical Visual Transformer.

Sensors (Basel)

December 2024

College of Computer Science and Engineering, Chongqing University of Technology, Chongqing 400054, China.

Within the domain of traditional art, Chinese Wuhu Iron Painting distinguishes itself through its distinctive craftsmanship, aesthetic expressiveness, and choice of materials, presenting a formidable challenge in the arena of stylistic transformation. This paper introduces an innovative Hierarchical Visual Transformer (HVT) framework aimed at achieving effectiveness and precision in the style transfer of Wuhu Iron Paintings. The study begins with an in-depth analysis of the artistic style of Wuhu Iron Paintings, extracting key stylistic elements that meet technical requirements for style conversion.

View Article and Find Full Text PDF

Background: Cardiometabolic multimorbidity (CMM), characterized by the coexistence of diabetes, hypertension, and cardiovascular disease, poses a major health challenge in India, particularly in rural areas with limited healthcare resources. Lifestyle interventions can manage cardiometabolic risk factors, yet adherence remains suboptimal. Mobile health (mHealth) interventions offer a scalable approach for managing CMM by promoting behaviour change and medication adherence.

View Article and Find Full Text PDF

Association Kinetics for Perfluorinated -Alkyl Radicals.

J Phys Chem A

December 2024

Chemical Sciences and Engineering Division, Argonne National Laboratory, Lemont, Illinois 60439, United States.

Radical-radical reaction channels are important in the pyrolysis and oxidation chemistry of perfluoroalkyl substances (PFAS). In particular, unimolecular dissociation reactions within unbranched -perfluoroalkyl chains, and their corresponding reverse barrierless association reactions, are expected to be significant contributors to the gas-phase thermal decomposition of families of species such as perfluorinated carboxylic acids and perfluorinated sulfonic acids. Unfortunately, experimental data for these reactions are scarce and uncertain.

View Article and Find Full Text PDF

Dynamic domain generalization for medical image segmentation.

Neural Netw

December 2024

School of Cyberspace, Hangzhou Dianzi University, Hangzhou, 310018, China; Suzhou Research Institute of Shandong University, Suzhou, 215123, China. Electronic address:

Domain Generalization-based Medical Image Segmentation (DGMIS) aims to enhance the robustness of segmentation models on unseen target domains by learning from fully annotated data across multiple source domains. Despite the progress made by traditional DGMIS methods, they still face several challenges. First, most DGMIS approaches rely on static models to perform inference on unseen target domains, lacking the ability to dynamically adapt to samples from different target domains.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!