4 months ago

G-TAD: Sub-Graph Localization for Temporal Action Detection

Mengmeng Xu Chen Zhao David S. Rojas Ali Thabet Bernard Ghanem

Abstract

Temporal action detection is a fundamental yet challenging task in video understanding. Video context is a critical cue to effectively detect actions, but current works mainly focus on temporal context, while neglecting semantic context as well as other important context properties. In this work, we propose a graph convolutional network (GCN) model to adaptively incorporate multi-level semantic context into video features and cast temporal action detection as a sub-graph localization problem. Specifically, we formulate video snippets as graph nodes, snippet-snippet correlations as edges, and actions associated with context as target sub-graphs. With graph convolution as the basic operation, we design a GCN block called GCNeXt, which learns the features of each node by aggregating its context and dynamically updates the edges in the graph. To localize each sub-graph, we also design an SGAlign layer to embed each sub-graph into the Euclidean space. Extensive experiments show that G-TAD is capable of finding effective video context without extra supervision and achieves state-of-the-art performance on two detection benchmarks. On ActivityNet-1.3, it obtains an average mAP of 34.09%; on THUMOS14, it reaches 51.6% at [email protected] when combined with a proposal processing method. G-TAD code is publicly available at https://github.com/frostinassiky/gtad.

Code Repositories

812618101/TAL-Demo

Mentioned in GitHub

coolbay/VSGN

pytorch

Mentioned in GitHub

handhand123/prsa-net

pytorch

Mentioned in GitHub

musicalOffering/sola

pytorch

Mentioned in GitHub

carpedkm/G_TAD_customizing

pytorch

Mentioned in GitHub

Frostinassiky/gtad

Official

pytorch

Mentioned in GitHub

sauradip/fewshotQAT

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
temporal-action-localization-on-activitynet	G-TAD	mAP: 34.09 mAP [email protected]: 50.36 mAP [email protected]: 34.60 mAP [email protected]: 9.02
temporal-action-localization-on-epic-kitchens	G-TAD (verb)	Avg mAP (0.1-0.5): 9.4 mAP [email protected]: 12.1 mAP [email protected]: 11.0 mAP [email protected]: 9.4 mAP [email protected]: 8.1 mAP [email protected]: 6.5
temporal-action-localization-on-fineaction	G-TAD (i3d feature)	mAP: 9.06 mAP [email protected]: 13.74 mAP [email protected]: 8.83 mAP [email protected]: 3.06
temporal-action-localization-on-thumos14	G-TAD	mAP [email protected]: 40.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Console

4 months ago

G-TAD: Sub-Graph Localization for Temporal Action Detection

View Paper Details

Mengmeng Xu Chen Zhao David S. Rojas Ali Thabet Bernard Ghanem

Abstract

Code Repositories

812618101/TAL-Demo

Mentioned in GitHub

coolbay/VSGN

pytorch

Mentioned in GitHub

handhand123/prsa-net

pytorch

Mentioned in GitHub

musicalOffering/sola

pytorch

Mentioned in GitHub

carpedkm/G_TAD_customizing

pytorch

Mentioned in GitHub

Frostinassiky/gtad

Official

pytorch

Mentioned in GitHub

sauradip/fewshotQAT

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
temporal-action-localization-on-activitynet	G-TAD	mAP: 34.09 mAP [email protected]: 50.36 mAP [email protected]: 34.60 mAP [email protected]: 9.02
temporal-action-localization-on-epic-kitchens	G-TAD (verb)	Avg mAP (0.1-0.5): 9.4 mAP [email protected]: 12.1 mAP [email protected]: 11.0 mAP [email protected]: 9.4 mAP [email protected]: 8.1 mAP [email protected]: 6.5
temporal-action-localization-on-fineaction	G-TAD (i3d feature)	mAP: 9.06 mAP [email protected]: 13.74 mAP [email protected]: 8.83 mAP [email protected]: 3.06
temporal-action-localization-on-thumos14	G-TAD	mAP [email protected]: 40.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

G-TAD: Sub-Graph Localization for Temporal Action Detection

Mengmeng Xu Chen Zhao David S. Rojas Ali Thabet Bernard Ghanem

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters

Command Palette

G-TAD: Sub-Graph Localization for Temporal Action Detection

Mengmeng Xu Chen Zhao David S. Rojas Ali Thabet Bernard Ghanem

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters