Video event recognition using kernel methods with multilevel temporal alignment.

IEEE Trans Pattern Anal Mach Intell

School of Computer Engineering, Nanyang Technological University, 50 Nanyang Avenue, Blk N4, Singapore.

Published: November 2008

In this work, we systematically study the problem of event recognition in unconstrained news video sequences. We adopt the discriminative kernel-based method for which video clip similarity plays an important role. First, we represent a video clip as a bag of orderless descriptors extracted from all of the constituent frames and apply the earth mover's distance (EMD) to integrate similarities among frames from two clips. Observing that a video clip is usually comprised of multiple subclips corresponding to event evolution over time, we further build a multilevel temporal pyramid. At each pyramid level, we integrate the information from different subclips with Integer-value-constrained EMD to explicitly align the subclips. By fusing the information from the different pyramid levels, we develop Temporally Aligned Pyramid Matching (TAPM) for measuring video similarity. We conduct comprehensive experiments on the TRECVID 2005 corpus, which contains more than 6,800 clips. Our experiments demonstrate that 1) the TAPM multilevel method clearly outperforms single-level EMD (SLEMD) and 2) SLEMD outperforms keyframe and multiframe-based detection methods by a large margin. In addition, we conduct in-depth investigation of various aspects of the proposed techniques such as weight selection in SLEMD, sensitivity to temporal clustering, the effect of temporal alignment, and possible approaches for speed up. Extensive analysis of the results also reveals intuitive interpretation of video event recognition through video subclip alignment at different levels.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2008.129DOI Listing

Publication Analysis

Top Keywords

event recognition
12
video clip
12
video
8
video event
8
multilevel temporal
8
temporal alignment
8
recognition kernel
4
kernel methods
4
methods multilevel
4
temporal
4

Similar Publications

The Post-Acute COVID-19-Vaccination Syndrome in the Light of Pharmacovigilance.

Vaccines (Basel)

December 2024

Central Institute of Clinical Chemistry and Laboratory Diagnostics, Medical Faculty, Heinrich Heine University, University Hospital, 40255 Düsseldorf, Germany.

Clinical studies show that SARS-CoV-2 vaccination sometimes entails a severe and disabling chronic syndrome termed post-acute-COVID-19-vaccination syndrome (PACVS). PACVS shares similarities with long COVID. Today, PACVS is still not officially recognised as a disease.

View Article and Find Full Text PDF

Decoding the Genes Orchestrating Egg and Sperm Fusion Reactions and Their Roles in Fertility.

Biomedicines

December 2024

Medical Genomics Research Department, King Abdullah International Medical Research Center (KAIMRC), King Saud Bin Abdulaziz University for Health Sciences (KSAU-HS), Ministry of National Guard Health Affairs (MNGHA), Riyadh 11426, Saudi Arabia.

Mammalian fertilization is a complex and highly regulated process that has garnered significant attention, particularly with advancements in assisted reproductive technologies such as in vitro fertilization (IVF). The fusion of egg and sperm involves a sequence of molecular and cellular events, including capacitation, the acrosome reaction, adhesion, and membrane fusion. Critical genetic factors, such as IZUMO1, JUNO (also known as FOLR4), CD9, and several others, have been identified as essential mediators in sperm-egg recognition and membrane fusion.

View Article and Find Full Text PDF

Effects of Language Proficiency on Selective Attention Patterns at Segmenting Boundaries in English Audio Sentences.

Brain Sci

November 2024

School of Foreign Languages, Hunan University, Lushannan Road No. 2, Yuelu District, Changsha 410082, China.

Background/objectives: Normative perceptual segmentation facilitates event perception, comprehension, and memory. Given that native English listeners' normative perceptual segmentation of English speech streams coexists with a highly selective attention pattern at segmentation boundaries, it is significant to test whether Chinese learners of English have a different attention pattern at boundaries, thereby checking whether they perform a normative segmentation.

Methods: Thirty Chinese learners of English with relatively higher language proficiency (CLH) and 26 with relatively lower language proficiency (CLL) listened to a series of English audio sentences.

View Article and Find Full Text PDF

The human heterogeneous nuclear ribonucleoprotein (hnRNP) A1 is a prototypical RNA-binding protein essential in regulating a wide range of post-transcriptional events in cells. As a multifunctional protein with a key role in RNA metabolism, deregulation of its functions has been linked to neurodegenerative diseases, tumour aggressiveness and chemoresistance, which has fuelled efforts to develop novel therapeutics that modulates its RNA binding activities. Here, using a combination of Molecular Dynamics (MD) simulations and graph neural network pockets predictions, we showed that hnRNPA1 N-terminal RNA binding domain (UP1) contains several cryptic pockets capable of binding small molecules.

View Article and Find Full Text PDF

A cerebral spinal fluid (CSF) leak from the anterior skull base is a challenging neurosurgical issue that requires prompt recognition and treatment. Options for treatment include medical and surgical repair. A systematic review was performed screening for both retrospective and prospective clinical studies evaluating the efficacy of acetazolamide in the event of CSF leaks of the anterior skull base.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!