Audio information has been bypassed by most of current visual attention prediction studies. However, sound could have influence on visual attention and such influence has been widely investigated and proofed by many psychological studies. In this paper, we propose a novel multi-modal saliency (MMS) model for videos containing scenes with high audio-visual correspondence. In such scenes, humans tend to be attracted by the sound sources and it is also possible to localize the sound sources via cross-modal analysis. Specifically, we first detect the spatial and temporal saliency maps from the visual modality by using a novel free energy principle. Then we propose to detect the audio saliency map from both audio and visual modalities by localizing the moving-sounding objects using cross-modal kernel canonical correlation analysis, which is first of its kind in the literature. Finally we propose a new two-stage adaptive audiovisual saliency fusion method to integrate the spatial, temporal and audio saliency maps to our audio-visual saliency map. The proposed MMS model has captured the influence of audio, which is not considered in the latest deep learning based saliency models. To take advantages of both deep saliency modeling and audio-visual saliency modeling, we propose to combine deep saliency models and the MMS model via a later fusion, and we find that an average of 5% performance gain is obtained. Experimental results on audio-visual attention databases show that the introduced models incorporating audio cues have significant superiority over state-of-the-art image and video saliency models which utilize a single visual modality.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2020.2966082DOI Listing

Publication Analysis

Top Keywords

mms model
12
saliency models
12
saliency
11
model videos
8
high audio-visual
8
audio-visual correspondence
8
visual attention
8
sound sources
8
spatial temporal
8
saliency maps
8

Similar Publications

Currently, there is great demand for flexible three-dimensional (3D) printable thermoplastic polyurethane (TPU) wires with excellent ultraviolet (UV) resistance, which have broad application prospects in wearable products. In this study, UV-resistant TPU composites were obtained using a blending modification method. The relationship between the optimized parameters of fused deposition modeling 3D printing and mechanical properties of the TPU composite is discussed using an orthogonal test.

View Article and Find Full Text PDF

Optimizing Catheter Verification: An Understandable AI Model for Efficient Assessment of Central Venous Catheter Placement in Chest Radiography.

Invest Radiol

October 2024

From the Department of Radiology and Nuclear Medicine, UKSH Lübeck, Lübeck, Germany (J.S., M.M., L.B., Y.E., J.B., M.M.S.); Institute of Medical Informatics, University of Lübeck, Lübeck, Germany (L.H., M.P.H.); Philips Research Hamburg, Hamburg, Germany (A.S., H.S.); and Institute of Interventional Radiology, UKSH Lübeck, Lübeck, Germany (M.M.S.).

Purpose: Accurate detection of central venous catheter (CVC) misplacement is crucial for patient safety and effective treatment. Existing artificial intelligence (AI) often grapple with the limitations of label inaccuracies and output interpretations that lack clinician-friendly comprehensibility. This study aims to introduce an approach that employs segmentation of support material and anatomy to enhance the precision and comprehensibility of CVC misplacement detection.

View Article and Find Full Text PDF

Background: With many rare tumour types, acquiring the correct diagnosis is a challenging but crucial process in paediatric oncology. Historically, this is done based on histology and morphology of the disease. However, advances in genome wide profiling techniques such as RNA sequencing now allow the development of molecular classification tools.

View Article and Find Full Text PDF

Background: Covered stent correction for a sinus venosus atrial septal defect (SVASD) was first performed in 2009. This innovative approach was initially viewed as experimental and was reserved for highly selected patients with unusual anatomic variants. In 2016, increasing numbers of procedures began to be performed, and in several centers, it is now offered as a standard of care option alongside surgical repair.

View Article and Find Full Text PDF

Mirikizumab is a p19-directed anti-interleukin-23 antibody approved for the treatment of adults with moderate-to-severe ulcerative colitis (UC). Here, we report the first data of mirikizumab pharmacokinetics (PK) and exposure-response (E/R) relationships in pediatric participants (aged 2 to <18 years weighing >10 kg) with moderate-to-severe UC from the phase II, open-label study SHINE-1 (NCT04004611). PK parameters were analyzed using a model developed previously in adults with fixed-exponent allometry for body weight.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!