Video captioning aims to generate natural language descriptions for a given video clip. Existing methods mainly focus on end-to-end representation learning via word-by-word comparison between predicted captions and ground-truth texts. Although significant progress has been made, such supervised approaches neglect semantic alignment between visual and linguistic entities, which may negatively affect the generated captions. In this work, we propose a hierarchical modular network to bridge video representations and linguistic semantics at four granularities before generating captions: entity, verb, predicate, and sentence. Each level is implemented by one module to embed corresponding semantics into video representations. Additionally, we present a reinforcement learning module based on the scene graph of captions to better measure sentence similarity. Extensive experimental results show that the proposed method performs favorably against the state-of-the-art models on three widely-used benchmark datasets, including microsoft research video description corpus (MSVD), MSR-video to text (MSR-VTT), and video-and-TEXt (VATEX).
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2023.3327677 | DOI Listing |
J Am Chem Soc
December 2024
Department of Chemistry at Brown University, 324 Brook Street, Providence, Rhode Island 02912, United States.
Biomacromolecular networks with multiscale fibrillar structures are characterized by exceptional mechanical properties, making them attractive architectures for synthetic materials. However, there is a dearth of synthetic polymeric building blocks capable of forming similarly structured networks. Bottlebrush polymers (BBPs) are anisotropic graft polymers with the potential to mimic and replace biomacromolecules such as tropocollagen for the fabrication of synthetic fibrillar networks; however, a longstanding limitation of BBPs has been the lack of rigidity necessary to access the lyotropic ordering that underpins the formation of collagenous networks.
View Article and Find Full Text PDFAdv Mater
December 2024
College of Materials Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, 210016, P. R. China.
Optical-electromagnetic compatible devices are urgently required in intelligent building monitors and cross-band protection. Meanwhile, the insufficient systematicness and semi-empirical attempts significantly limit the prosperity of cross-band materials, causing enormous challenges for deviceization and material database construction. Herein, the systematical component-deviceization-machine learning prediction-array construction strategy is attempted to solve the bottleneck issues.
View Article and Find Full Text PDFMicrobiol Spectr
December 2024
Statistics Department, University of Wisconsin-Madison, Madison, Wisconsin, USA.
Unlabelled: Mediation analysis has emerged as a versatile tool for answering mechanistic questions in microbiome research because it provides a statistical framework for attributing treatment effects to alternative causal pathways. Using a series of linked regressions, this analysis quantifies how complementary data relate to one another and respond to treatments. Despite these advances, existing software's rigid assumptions often result in users viewing mediation analysis as a black box.
View Article and Find Full Text PDFNeuroimage
December 2024
Department of Psychology & Tsinghua Laboratory of Brain and Intelligence, Tsinghua University, Beijing, China. Electronic address:
The constantly evolving world necessitates a brain that can swiftly adapt and respond to rapid changes. The brain, conceptualized as a system performing cognitive functions through collective neural activity, has been shown to maintain a resting state characterized by near-critical neural dynamics, positioning it to effectively respond to external stimuli. However, how near-criticality is dynamically modulated during task performance remains insufficiently understood.
View Article and Find Full Text PDFBrain Behav Immun Health
December 2024
Ace Alzheimer Center Barcelona-Universitat Internacional de Catalunya, Barcelona, Spain.
Despite the central role attributed to neuroinflammation in the etiology and pathobiology of Alzheimer's disease (AD), the direct link between levels of inflammatory mediators in blood and cerebrospinal fluid (CSF) compartments, as well as their potential implications for AD diagnosis and progression, remains inconclusive. Moreover, there is debate on whether inflammation has a protective or detrimental effect on disease onset and progression. Indeed, distinct immunological mechanisms may govern protective and damaging effects at early and late stages, respectively.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!