With the success of deep learning in classifying short trimmed videos, more attention has been focused on temporally segmenting and classifying activities in long untrimmed videos. State-of-the-art approaches for action segmentation utilize several layers of temporal convolution and temporal pooling. Despite the capabilities of these approaches in capturing temporal dependencies, their predictions suffer from over-segmentation errors. In this paper, we propose a multi-stage architecture for the temporal action segmentation task that overcomes the limitations of the previous approaches. The first stage generates an initial prediction that is refined by the next ones. In each stage we stack several layers of dilated temporal convolutions covering a large receptive field with few parameters. While this architecture already performs well, lower layers still suffer from a small receptive field. To address this limitation, we propose a dual dilated layer that combines both large and small receptive fields. We further decouple the design of the first stage from the refining stages to address the different requirements of these stages. Extensive evaluation shows the effectiveness of the proposed model in capturing long-range dependencies and recognizing action segments. Our models achieve state-of-the-art results on three datasets: 50Salads, Georgia Tech Egocentric Activities (GTEA), and the Breakfast dataset.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2020.3021756DOI Listing

Publication Analysis

Top Keywords

action segmentation
12
receptive field
8
small receptive
8
temporal
6
ms-tcn++ multi-stage
4
multi-stage temporal
4
temporal convolutional
4
convolutional network
4
action
4
network action
4

Similar Publications

Addressing methane emissions across the liquefied natural gas (LNG) supply chain is key to reducing climate impacts of LNG. Actions to address methane emissions have emphasized the importance of the use of measurement-informed emissions inventories given the systematic underestimation in official greenhouse gas (GHG) emission inventories. Despite significant progress in field measurements of GHG emissions across the natural gas supply chain, no detailed measurements at US liquefaction terminals are publicly available.

View Article and Find Full Text PDF

Background: While antimicrobial use (AMU) in human healthcare has received significant attention as a key driver of antimicrobial resistance (AMR), less emphasis has been placed on AMU practices and attitudes in animal husbandry. To address this gap, this study examines the patterns and underlying drivers of AMU on animal farms.

Methods: A survey instrument was distributed to farm staff in 150 animal farms across 15 Egyptian governorates.

View Article and Find Full Text PDF

Dynamic single cell transcriptomics defines kidney FGF23/KL bioactivity and novel segment-specific inflammatory targets.

Kidney Int

January 2025

Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA, 46202; Department of Medicine/Division of Nephrology, Indiana University School of Medicine, Indianapolis, IN, USA, 46202. Electronic address:

Fibroblast growth factor 23 (FGF23) via its coreceptor αKlotho (KL) provides critical control of phosphate metabolism, which is altered in both rare and very common syndromes. However, the spatial-temporal mechanisms dictating kidney FGF23 functions remain poorly understood. Thus, developing approaches to modify specific FGF23-dictated pathways has proven problematic.

View Article and Find Full Text PDF

The relentless surge in carbon emissions is exacting a devastating toll on human wellbeing, critical infrastructure, and natural ecosystems, leaving a stark and distressing legacy of destruction. Communities worldwide are reeling from the impacts of pervasive smog, record-breaking wildfires, and deadly heatwaves-manifestations of a climate crisis that grows more severe by the day. Once a vanguard of environmental policy, the Organisation for Economic Co-operation and Development (OECD) now struggles with exceeding emissions targets, eroding its credibility and influence.

View Article and Find Full Text PDF

Background: Human immunodeficiency virus (HIV) and acquired immunodeficiency syndrome (AIDS) have evolved into a global development burden, with nearly 40 million infections and 25 million deaths. Compared to other age groups, youth have increased risks of contracting the disease due to social and health structural factors; thus, additional efforts are needed to effectively tackle the challenges associated with this age group. Epidemiological studies employing unsupervised learning techniques are essential for shaping public health policies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!