Automatically recognising surgical gestures from surgical data is an important building block of automated activity recognition and analytics, technical skill assessment, intra-operative assistance and eventually robotic automation. The complexity of articulated instrument trajectories and the inherent variability due to surgical style and patient anatomy make analysis and fine-grained segmentation of surgical motion patterns from robot kinematics alone very difficult. Surgical video provides crucial information from the surgical site with context for the kinematic data and the interaction between the instruments and tissue. Yet sensor fusion between the robot data and surgical video stream is non-trivial because the data have different frequency, dimensions and discriminative capability. In this paper, we integrate multimodal attention mechanisms in a two-stream temporal convolutional network to compute relevance scores and weight kinematic and visual feature representations dynamically in time, aiming to aid multimodal network training and achieve effective sensor fusion. We report the results of our system on the JIGSAWS benchmark dataset and on a new in vivo dataset of suturing segments from robotic prostatectomy procedures. Our results are promising and obtain multimodal prediction sequences with higher accuracy and better temporal structure than corresponding unimodal solutions. Visualization of attention scores also gives physically interpretable insights on network understanding of strengths and weaknesses of each sensor.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7616924 | PMC |
http://dx.doi.org/10.1109/TMI.2022.3147640 | DOI Listing |
J Neurol
January 2025
Department of Neurology, School of Medical Sciences, University of Campinas-UNICAMP, Universitaria "Zeferino Vaz", Rua Tessália Vieira de Camargo, 126. Cidade, Campinas, SP, 13083-887, Brazil.
Background: Skeletal and cardiac muscle damage have been increasingly recognized in female carriers of DMD pathogenic variants (DMDc). Little is known about cognitive impairment in these women or whether they have structural brain damage.
Objective: To characterize the cognitive profile in a Brazilian cohort of DMDc and determine whether they have structural brain abnormalities using multimodal MRI.
J Pharm Anal
December 2024
School of Chemical Science and Engineering, Tongji University, Shanghai, 200092, China.
Reactive oxygen species (ROS)-mediated anticancer modalities, which disturb the redox balance of cancer cells through multi-pathway simulations, hold great promise for effective cancer management. Among these, cooperative physical and biochemical activation strategies have attracted increasing attention because of their spatiotemporal controllability, low toxicity, and high therapeutic efficacy. Herein, we demonstrate a nanogel complex as a multilevel ROS-producing system by integrating chloroperoxidase (CPO) into gold nanorod (AuNR)-based nanogels (ANGs) for cascade-amplifying photothermal-enzymatic synergistic tumor therapy.
View Article and Find Full Text PDFCirc Cardiovasc Imaging
January 2025
Division of Cardiology, Massachusetts General Hospital, Boston. (S.P.M., A.T.-R., A.C.S.S., S.H., C.P.L., T.W.C., D.F.Y., E.Y.).
Disorders of the pulmonic valve (PV) receive considerably less attention than other forms of valvular heart disease. Due to the dramatically improved survival of children with congenital heart disease over the last 5 decades, there has been a steady increase in the prevalence of adults with congenital heart disease, which necessitates that clinicians become familiar with the anatomy and the evaluation of right ventricular outflow tract and PV anomalies. A multimodality imaging approach using echocardiography, cardiac computed tomography, and magnetic resonance imaging is essential for a comprehensive evaluation of the anatomy and function of the right ventricular outflow tract, PV, and supravalvular region.
View Article and Find Full Text PDFBMC Bioinformatics
January 2025
School of Information and Artificial Intelligence, Anhui Agricultural University, Changjiang West Road, Hefei, 230036, Anhui, China.
Drug-target interactions (DTIs) are pivotal in drug discovery and development, and their accurate identification can significantly expedite the process. Numerous DTI prediction methods have emerged, yet many fail to fully harness the feature information of drugs and targets or address the issue of feature redundancy. We aim to refine DTI prediction accuracy by eliminating redundant features and capitalizing on the node topological structure to enhance feature extraction.
View Article and Find Full Text PDFSensors (Basel)
January 2025
Faculty of Science and Technology, Keio University, Yokohama 223-8522, Japan.
Person identification is a critical task in applications such as security and surveillance, requiring reliable systems that perform robustly under diverse conditions. This study evaluates the Vision Transformer (ViT) and ResNet34 models across three modalities-RGB, thermal, and depth-using datasets collected with infrared array sensors and LiDAR sensors in controlled scenarios and varying resolutions (16 × 12 to 640 × 480) to explore their effectiveness in person identification. Preprocessing techniques, including YOLO-based cropping, were employed to improve subject isolation.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!