Publications by authors named "Yang Xiaokang"

A numerical tool for simulating the detection signals of electromagnetic nondestructive testing technology (ENDT) is of great significance for studying detection mechanisms and improving detection efficiency. However, the quantitative analysis methods for ENDT have not yet been sufficiently studied due to the absence of an effective constitutive model. This paper proposed a new magneto-mechanical model that can reflect the dependence of relative permeability on elasto-plastic deformation and proposed a finite element-infinite element coupling method that can replace the traditional finite element truncation boundary.

View Article and Find Full Text PDF

Robots need to sense information about the external environment before moving, which helps them to recognize and understand their surroundings so that they can plan safe and effective paths and avoid obstacles. Conventional algorithms using a single sensor cannot obtain enough information and lack real-time capabilities. To solve these problems, we propose an information perception algorithm with vision as the core and the fusion of LiDAR.

View Article and Find Full Text PDF

Dynamic graphs arise in various real-world applications, and it is often welcomed to model the dynamics in continuous time domain for its flexibility. This paper aims to design an easy-to-use pipeline (EasyDGL which is also due to its implementation by DGL toolkit) composed of three modules with both strong fitting ability and interpretability, namely encoding, training and interpreting: i) a temporal point process (TPP) modulated attention architecture to endow the continuous-time resolution with the coupled spatiotemporal dynamics of the graph with edge-addition events; ii) a principled loss composed of task-agnostic TPP posterior maximization based on observed events, and a task-aware loss with a masking strategy over dynamic graph, where the tasks include dynamic link prediction, dynamic node classification and node traffic forecasting; iii) interpretation of the outputs (e.g.

View Article and Find Full Text PDF

The increasing prevalence of myopia worldwide presents a significant public health challenge. A key strategy to combat myopia is with early detection and prediction in children as such examination allows for effective intervention using readily accessible imaging technique. To this end, we introduced DeepMyopia, an artificial intelligence (AI)-enabled decision support system to detect and predict myopia onset and facilitate targeted interventions for children at risk using routine retinal fundus images.

View Article and Find Full Text PDF
Article Synopsis
  • Primary diabetes care and diabetic retinopathy (DR) screening face challenges due to a lack of trained primary care physicians, especially in low-resource areas.
  • The integrated image-language system, DeepDR-LLM, combines a language model and deep learning to help PCPs provide tailored diabetes management recommendations, showing comparable or better accuracy than PCPs in diagnosing DR.
  • In a study, patients assisted by DeepDR-LLM demonstrated improved self-management and adherence to referral recommendations, indicating that the system enhances both care quality and patient outcomes.
View Article and Find Full Text PDF

Context: Large-for-gestational-age (LGA), one of the most common complications of gestational diabetes mellitus (GDM), has become a global concern. The predictive performance of common continuous glucose monitoring (CGM) metrics for LGA is limited.

Objective: We aimed to develop and validate an artificial intelligence (AI) based model to determine the probability of women with GDM giving birth to LGA infants during pregnancy using CGM measurements together with demographic data and metabolic indicators.

View Article and Find Full Text PDF

Motivation: Retrosynthesis planning poses a formidable challenge in the organic chemical industry, particularly in pharmaceuticals. Single-step retrosynthesis prediction, a crucial step in the planning process, has witnessed a surge in interest in recent years due to advancements in AI for science. Various deep learning-based methods have been proposed for this task in recent years, incorporating diverse levels of additional chemical knowledge dependency.

View Article and Find Full Text PDF

Recent years have witnessed the incredible performance boost of data-driven deep visual object trackers. Despite the success, these trackers require millions of sequential manual labels on videos for supervised training, implying the heavy burden of human annotating. This raises a crucial question: how to train a powerful tracker from abundant videos using limited manual annotations? In this paper, we challenge the conventional belief that frame-by-frame labeling is indispensable, and show that providing a small number of annotated bounding boxes in each video is sufficient for training a strong tracker.

View Article and Find Full Text PDF

Intraoperative imaging techniques for reconstructing deformable tissues in vivo are pivotal for advanced surgical systems. Existing methods either compromise on rendering quality or are excessively computationally intensive, often demanding dozens of hours to perform, which significantly hinders their practical application. In this paper, we introduce Fast Orthogonal Plane (Forplane), a novel, efficient framework based on neural radiance fields (NeRF) for the reconstruction of deformable tissues.

View Article and Find Full Text PDF
Article Synopsis
  • The "DRAC - Diabetic Retinopathy Analysis Challenge" was held at the MICCAI 2022 conference, introducing the DRAC ultra-wide optical coherence tomography angiography dataset containing 1,103 images to tackle diabetic retinopathy (DR) analysis tasks.
  • The challenge focused on three main clinical tasks: segmenting DR lesions, assessing image quality, and grading diabetic retinopathy, attracting participation from multiple teams with 11, 12, and 13 solutions submitted for each task.
  • The paper summarizes the best-performing solutions, which can aid in developing better classification and segmentation models for DR diagnosis, and the dataset is now available to enhance computer-aided diagnostic systems in the healthcare field.
View Article and Find Full Text PDF

In this paper, we present a simple yet effective continual learning method for blind image quality assessment (BIQA) with improved quality prediction accuracy, plasticity-stability trade-off, and task-order/-length robustness. The key step in our approach is to freeze all convolution filters of a pre-trained deep neural network (DNN) for an explicit promise of stability, and learn task-specific normalization parameters for plasticity. We assign each new IQA dataset (i.

View Article and Find Full Text PDF

Diabetic retinopathy (DR) is the leading cause of preventable blindness worldwide. The risk of DR progression is highly variable among different individuals, making it difficult to predict risk and personalize screening intervals. We developed and validated a deep learning system (DeepDR Plus) to predict time to DR progression within 5 years solely from fundus images.

View Article and Find Full Text PDF

The urban spatial structure represents the distribution of public and private spaces in cities and how people move within them. Although it usually evolves slowly, it can change quickly during large-scale emergency events, as well as due to urban renewal in rapidly developing countries. Here we present an approach to delineate such urban dynamics in quasi-real time through a human mobility metric, the mobility centrality index ΔKS.

View Article and Find Full Text PDF

World models learn the consequences of actions in vision-based interactive systems. However, in practical scenarios like autonomous driving, noncontrollable dynamics that are independent or sparsely dependent on action signals often exist, making it challenging to learn effective world models. To address this issue, we propose Iso-Dream++, a model-based reinforcement learning approach that has two main contributions.

View Article and Find Full Text PDF

The increasing prevalence of diabetes, high avoidable morbidity and mortality due to diabetes and diabetic complications, and related substantial economic burden make diabetes a significant health challenge worldwide. A shortage of diabetes specialists, uneven distribution of medical resources, low adherence to medications, and improper self-management contribute to poor glycemic control in patients with diabetes. Recent advancements in digital health technologies, especially artificial intelligence (AI), provide a significant opportunity to achieve better efficiency in diabetes care, which may diminish the increase in diabetes-related health-care expenditures.

View Article and Find Full Text PDF

Modeling the architecture search process on a supernet and applying a differentiable method to find the importance of architecture are among the leading tools for differentiable neural architectures search (DARTS). One fundamental problem in DARTS is how to discretize or select a single-path architecture from the pretrained one-shot architecture. Previous approaches mainly exploit heuristic or progressive search methods for discretization and selection, which are not efficient and easily trapped by local optimizations.

View Article and Find Full Text PDF

Recently neural architecture (NAS) search has attracted great interest in academia and industry. It remains a challenging problem due to the huge search space and computational costs. Recent studies in NAS mainly focused on the usage of weight sharing to train a SuperNet once.

View Article and Find Full Text PDF

The high-quality pathological microscopic images are essential for physicians or pathologists to make a correct diagnosis. Image quality assessment (IQA) can quantify the visual distortion degree of images and guide the imaging system to improve image quality, thus raising the quality of pathological microscopic images. Current IQA methods are not ideal for pathological microscopy images due to their specificity.

View Article and Find Full Text PDF

The significance of artistry in creating animated virtual characters is widely acknowledged, and motion style is a crucial element in this process. There has been a long-standing interest in stylizing character animations with style transfer methods. However, this kind of models can only deal with short-term motions and yield deterministic outputs.

View Article and Find Full Text PDF

Graph matching (GM) has been a long-standing combinatorial problem due to its NP-hard nature. Recently (deep) learning-based approaches have shown their superiority over the traditional solvers while the methods are almost based on supervised learning which can be expensive or even impractical. We develop a unified unsupervised framework from matching two graphs to multiple graphs, without correspondence ground truth for training.

View Article and Find Full Text PDF

The rapid development of deep learning has made a great progress in image segmentation, one of the fundamental tasks of computer vision. However, the current segmentation algorithms mostly rely on the availability of pixel-level annotations, which are often expensive, tedious, and laborious. To alleviate this burden, the past years have witnessed an increasing attention in building label-efficient, deep-learning-based image segmentation algorithms.

View Article and Find Full Text PDF

Comprehensive understanding of video content requires both spatial and temporal localization. However, there lacks a unified video action localization framework, which hinders the coordinated development of this field. Existing 3D CNN methods take fixed and limited input length at the cost of ignoring temporally long-range cross-modal interaction.

View Article and Find Full Text PDF

Tumor pathology can assess patient prognosis based on a morphological deviation of tumor tissue from normal. Digitizing whole slide images (WSIs) of tissue enables the use of deep learning (DL) techniques in pathology, which may shed light on prognostic indicators of cancers, and avoid biases introduced by human experience. We aim to explore new prognostic indicators of ovarian cancer (OC) patients using the DL framework on WSIs, and provide a valuable approach for OC risk stratification.

View Article and Find Full Text PDF

Rare diseases, which are severely underrepresented in basic and clinical research, can particularly benefit from machine learning techniques. However, current learning-based approaches usually focus on either mono-modal image data or matched multi-modal data, whereas the diagnosis of rare diseases necessitates the aggregation of unstructured and unmatched multi-modal image data due to their rare and diverse nature. In this study, we therefore propose diagnosis-guided multi-to-mono modal generation networks (TMM-Nets) along with training and testing procedures.

View Article and Find Full Text PDF

With the development of multimedia technology, Augmented Reality (AR) has become a promising next-generation mobile platform. The primary value of AR is to promote the fusion of digital contents and real-world environments, however, studies on how this fusion will influence the Quality of Experience (QoE) of these two components are lacking. To achieve better QoE of AR, whose two layers are influenced by each other, it is important to evaluate its perceptual quality first.

View Article and Find Full Text PDF