Understanding a person's behavior from their 3D motion sequence is a fundamental problem in computer vision with many applications. An important component of this problem is 3D action localization, which involves recognizing what actions a person is performing, and when the actions occur in the sequence. To promote the progress of the 3D action localization community, we introduce a new, challenging, and more complex benchmark dataset, BABEL-TAL (BT), for 3D action localization. Important baselines and evaluating metrics, as well as human evaluations, are carefully established on this benchmark. We also propose a strong baseline model, i.e., Localizing Actions with Transformers (LocATe), that jointly localizes and recognizes actions in a 3D sequence. The proposed LocATe shows superior performance on BABEL-TAL as well as on the large-scale PKU-MMD dataset, achieving state-of-the-art performance by using only 10% of the labeled training data. Our research could advance the development of more accurate and efficient systems for human behavior analysis, with potential applications in areas such as human-computer interaction and healthcare.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11372174PMC
http://dx.doi.org/10.1038/s44172-024-00272-7DOI Listing

Publication Analysis

Top Keywords

action localization
12
localization
4
localization recognition
4
recognition human
4
action
4
human action
4
action transformers
4
transformers understanding
4
understanding person's
4
person's behavior
4

Similar Publications

New therapeutic agents in oncology are emerging rapidly, both in terms of the number of approved drugs and the technological and biological innovation of new treatments. Antibody-drug conjugates (ADC) offer a promising cancer therapy by specifically targeting tumor cells. ADC are composed of a monoclonal antibody recognizing the tumor cell via specific antigens, coupled with a potent cytotoxic agent that resembles classical chemotherapy.

View Article and Find Full Text PDF

A strong repetitive stimulus can occasionally enhance axonal excitability, leading to the generation of afterdischarge. This afterdischarge outlasts the stimulus period and originates either from the physiological spike initiation site, typically the axon initial segment, or from ectopic sites for spike generation. One of the possible mechanisms underlying the stimulus-induced ectopic afterdischarge is the local depolarization due to accumulated potassium ions surrounding the axonal membranes of the distal portion.

View Article and Find Full Text PDF

Understanding the role and mode of action of nutrient transporters requires information about their dynamic associations with plant membranes. Historically, apoplastic nutrient export has been associated with proteins localized at the plasma membrane (PM), while the role of endomembrane localization has been less explored. However, recent work on the PHOSPHATE 1 (PHO1) inorganic phosphate (Pi) exporter demonstrated that, although primarily localized at the Golgi and trans-Golgi network (TGN) vesicles, PHO1 does associate with the PM when clathrin-mediated endocytosis (CME) was inhibited, supporting a mechanism for Pi homeostasis involving exocytosis.

View Article and Find Full Text PDF

Indigenous communities worldwide continue to disproportionately bear the burden during pandemics due to ongoing health inequities and systemic exclusion from pandemic decision-making processes. As the global community prepares for the next pandemic, it is critical to prioritise Indigenous leadership and governance within public health responses. This commentary highlights successful models of Indigenous-led pandemic responses during COVID-19 in Canada and Australia.

View Article and Find Full Text PDF

Background: In most of the published plication techniques in face lift surgery, the vectors of plication are not entirely superiorly and vertically directed. The same applies with the deep plane, SMAS elevation techniques in the majority of which the vectors of traction are not superiorly vertically directed. The aging symptoms are mostly prominent at the anterior mobile face due to the gravity effect, and this is the area where attention should be focused to correct these symptoms following a face lift surgery.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!