Publications by authors named "Shiji Song"

MicroRNAs (miRNAs) can function as either tumor suppressors or oncogenes. This study explores the role of miR-675 in ovarian cancer (OC) using OC cell lines and an orthotopic mouse model. We demonstrate that miR-675 expression inhibits primary tumor growth and metastasis by targeting TGFβ1, suppressing epithelial to mesenchymal transition (EMT), and attenuating the TGFβ signaling pathway.

View Article and Find Full Text PDF

As one of fundamental ways to interpret spine images, detection of vertebral landmarks is an informative prerequisite for further diagnosis and management of spine disorders such as scoliosis and fractures. Most existing machine learning-based methods for automatic vertebral landmark detection suffer from overlapping landmarks or abnormally long distances between nearby landmarks against anatomical priors, and thus lack sufficient reliability and interpretability. To tackle the problem, this paper systematically utilizes anatomical prior knowledge in vertebral landmark detection.

View Article and Find Full Text PDF
Article Synopsis
  • Infrared and visible image fusion combines thermal and detailed texture images to enhance target visibility and texture quality.
  • Traditional methods using auto encoder-decoder frameworks are inflexible and rely on manual strategies, which can limit their effectiveness.
  • The new EMAFusion approach introduces a multiscale encoder and a learnable fusion network that uses advanced attention mechanisms, showing improved performance in the TNO image fusion dataset compared to current methods.
View Article and Find Full Text PDF

Differentiable architecture search (DARTS) has emerged as a promising technique for effective neural architecture search, and it mainly contains two steps to find the high-performance architecture. First, the DARTS supernet that consists of mixed operations will be optimized via gradient descent. Second, the final architecture will be built by the selected operations that contribute the most to the supernet.

View Article and Find Full Text PDF

This article investigates the local synchronization for delayed complex dynamical networks (CDNs) under self-triggered impulsive control (STIC) approaches involving delays. With the help of Lyapunov-Razumikhin methods and comparison principle, some design criteria of STIC strategies ensuring local synchronization for delayed CDNs with delayed impulses are provided, and Zeno behavior can be avoided. Compared with the existing results on synchronization of CDNs under STIC, in this article, time delays in both continuous and discrete system dynamics are well considered.

View Article and Find Full Text PDF

The superior performance of modern computer vision backbones (e.g., vision Transformers learned on ImageNet-1 K/22 K) usually comes with a costly training procedure.

View Article and Find Full Text PDF

Dynamic networks have become a pivotal area of study in deep learning due to their ability to selectively activate computing units (such as layers or channels) or dynamically allocate computation to information-rich regions. This capability significantly curtails unnecessary computations, adapting to varying inputs. Despite these advantages, the practical efficiency of dynamic models often falls short of theoretical computation.

View Article and Find Full Text PDF

Diabetic retinopathy (DR) is a serious ocular complication that can pose a serious risk to a patient's vision and overall health. Currently, the automatic grading of DR is mainly using deep learning techniques. However, the lesion information in DR images is complex, variable in shape and size, and randomly distributed in the images, which leads to some shortcomings of the current research methods, i.

View Article and Find Full Text PDF

Long-tailed distributions frequently emerge in real-world data, where a large number of minority categories contain a limited number of samples. Such imbalance issue considerably impairs the performance of standard supervised learning algorithms, which are mainly designed for balanced training sets. Recent investigations have revealed that supervised contrastive learning exhibits promising potential in alleviating the data imbalance.

View Article and Find Full Text PDF

Unsupervised domain adaptation (UDA) aims to adapt models learned from a well-annotated source domain to a target domain, where only unlabeled samples are given. Current UDA approaches learn domain-invariant features by aligning source and target feature spaces through statistical discrepancy minimization or adversarial training. However, these constraints could lead to the distortion of semantic feature structures and loss of class discriminability.

View Article and Find Full Text PDF

Offline reinforcement learning (RL) optimizes the policy on a previously collected dataset without any interactions with the environment, yet usually suffers from the distributional shift problem. To mitigate this issue, a typical solution is to impose a policy constraint on a policy improvement objective. However, existing methods generally adopt a "one-size-fits-all" practice, i.

View Article and Find Full Text PDF

Recent years have witnessed a growing interest in neural network-based medical image classification methods, which have demonstrated remarkable performance in this field. Typically, convolutional neural network (CNN) architectures have been commonly employed to extract local features. However, the transformer, a newly emerged architecture, has gained popularity due to its ability to explore the relevance of remote elements in an image through a self-attention mechanism.

View Article and Find Full Text PDF

Spatial redundancy widely exists in visual recognition tasks, i.e., discriminative features in an image or video frame usually correspond to only a subset of pixels, while the remaining regions are irrelevant to the task at hand.

View Article and Find Full Text PDF

Finite-time stability and stabilization problems of state-dependent delayed systems are studied in this paper. Different from discrete delays and time-dependent delays which can be well estimated over time, the information of state-dependent delays is usually hard to be estimated, especially when states are unknown or unmeasurable. To guarantee the stability of state-dependent delayed systems in the framework of finite time, a Razumikhin-type inequality is used, following which estimations on the settling time and the region of attraction are proposed.

View Article and Find Full Text PDF

Presently, research on deep learning-based change detection (CD) methods has become a hot topic. In particular, feature pyramid networks (FPNs) are widely used in CD tasks to gradually fuse semantic features. However, existing FPN-based CD methods do not correctly detect the complete change region and cannot accurately locate the boundaries of the change region.

View Article and Find Full Text PDF
Article Synopsis
  • The paper discusses the importance of punctual steel-making scheduling to reduce production costs but highlights the uncertainty in the pretreatment process timing.
  • A distributionally robust iron-steel allocation (DRISA) model is introduced to develop a solid scheduling plan that accounts for this uncertainty by using statistical moments.
  • An efficient variable neighborhood algorithm is proposed to solve large-scale problems quickly, and experiments show that the DRISA model results in more reliable schedules with fewer adjustments and delays compared to traditional methods.
View Article and Find Full Text PDF

In this article, we focus on a biobjective hot strip mill (HSM) scheduling problem arising in the steel industry. Besides the conventional objective regarding penalty costs, we have also considered minimizing the total starting times of rolling operations in order to reduce the energy consumption for slab reheating. The problem is complicated by the inevitable uncertainty in rolling processing times, which means deterministic scheduling models will be ineffective.

View Article and Find Full Text PDF

Deep reinforcement learning (RL) agents are becoming increasingly proficient in a range of complex control tasks. However, the agent's behavior is usually difficult to interpret due to the introduction of black-box function, making it difficult to acquire the trust of users. Although there have been some interesting interpretation methods for vision-based RL, most of them cannot uncover temporal causal information, raising questions about their reliability.

View Article and Find Full Text PDF

Spatial redundancy commonly exists in the learned representations of convolutional neural networks (CNNs), leading to unnecessary computation on high-resolution features. In this paper, we propose a novel Spatially Adaptive feature Refinement (SAR) approach to reduce such superfluous computation. It performs efficient inference by adaptively fusing information from two branches: one conducts standard convolution on input features at a lower spatial resolution, and the other one selectively refines a set of regions at the original resolution.

View Article and Find Full Text PDF

Meta reinforcement learning (meta-RL) is a promising technique for fast task adaptation by leveraging prior knowledge from previous tasks. Recently, context-based meta-RL has been proposed to improve data efficiency by applying a principled framework, dividing the learning procedure into task inference and task execution. However, the task information is not adequately leveraged in this approach, thus leading to inefficient exploration.

View Article and Find Full Text PDF
Dynamic Neural Networks: A Survey.

IEEE Trans Pattern Anal Mach Intell

November 2022

Dynamic neural network is an emerging research topic in deep learning. Compared to static models which have fixed computational graphs and parameters at the inference stage, dynamic networks can adapt their structures or parameters to different inputs, leading to notable advantages in terms of accuracy, computational efficiency, adaptiveness, etc. In this survey, we comprehensively review this rapidly developing area by dividing dynamic networks into three main categories: 1) sample-wise dynamic models that process each sample with data-dependent architectures or parameters; 2) spatial-wise dynamic networks that conduct adaptive computation with respect to different spatial locations of image data; and 3) temporal-wise dynamic models that perform adaptive inference along the temporal dimension for sequential data such as videos and texts.

View Article and Find Full Text PDF

Deep reinforcement learning is confronted with problems of sampling inefficiency and poor task migration capability. Meta-reinforcement learning (meta-RL) enables meta-learners to utilize the task-solving skills trained on similar tasks and quickly adapt to new tasks. However, meta-RL methods lack enough queries toward the relationship between task-agnostic exploitation of data and task-related knowledge introduced by latent context, limiting their effectiveness and generalization ability.

View Article and Find Full Text PDF

This article mainly explores the local input-to-state stability (LISS) property of a class of nonlinear systems via a saturated control strategy, where both the external disturbance and impulsive disturbance being fully considered. In terms of the Lyapunov method and inequality techniques, some sufficient conditions under which the system can be made LISS are proposed, and the elastic constraint relationship among saturated control gain, rate coefficients, external disturbance, and domain of initial value is revealed. Moreover, the optimization design procedures are provided with the hope of obtaining the estimates of admissible external disturbance and domain of initial value as large as possible, where the corresponding saturated control law can be designed by solving LMI -based conditions.

View Article and Find Full Text PDF

In this article, we study the multiroute job shop scheduling problem with continuous-limited output buffers (MRJSP-CLOBs). In contrast to the standard job shop scheduling problem (JSP), continuous-limited output buffers render the commonly used graph-based approaches inapplicable, and the multiroute issue further increases computational complexity. To this end, we formulate MRJSP-CLOB as a mixed-integer linear program (MILP), which is typically NP-hard.

View Article and Find Full Text PDF

Reinforcement learning (RL) is a promising technique for designing a model-free controller by interacting with the environment. Several researchers have applied RL to autonomous underwater vehicles (AUVs) for motion control, such as trajectory tracking. However, the existing RL-based controller usually assumes that the unknown AUV dynamics keep invariant during the operation period, limiting its further application in the complex underwater environment.

View Article and Find Full Text PDF