4 months ago

Background Suppression Network for Weakly-supervised Temporal Action Localization

View Paper Details

Pilhyeon Lee Youngjung Uh Hyeran Byun

Background Suppression Network for Weakly-supervised Temporal Action Localization

Abstract

Weakly-supervised temporal action localization is a very challenging problem because frame-wise labels are not given in the training stage while the only hint is video-level labels: whether each video contains action frames of interest. Previous methods aggregate frame-level class scores to produce video-level prediction and learn from video-level action labels. This formulation does not fully model the problem in that background frames are forced to be misclassified as action classes to predict video-level labels accurately. In this paper, we design Background Suppression Network (BaS-Net) which introduces an auxiliary class for background and has a two-branch weight-sharing architecture with an asymmetrical training strategy. This enables BaS-Net to suppress activations from background frames to improve localization performance. Extensive experiments demonstrate the effectiveness of BaS-Net and its superiority over the state-of-the-art methods on the most popular benchmarks - THUMOS'14 and ActivityNet. Our code and the trained model are available at https://github.com/Pilhyeon/BaSNet-pytorch.

Code Repositories

Pilhyeon/BaSNet-pytorch

Official

pytorch

Mentioned in GitHub

Pilhyeon/Learning-Action-Completeness-from-Points

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
weakly-supervised-action-localization-on	BaS-Net	[email protected]:0.5: 43.6 [email protected]:0.7: 35.3 [email protected]: 27
weakly-supervised-action-localization-on-1	BaS-Net	[email protected]: 34.5 [email protected]:0.95: 22.2
weakly-supervised-action-localization-on-2	BaS-Net	[email protected]: 38.5
weakly-supervised-action-localization-on-4	BasNet	[email protected]: 27.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

4 months ago

Background Suppression Network for Weakly-supervised Temporal Action Localization

View Paper Details

Pilhyeon Lee Youngjung Uh Hyeran Byun

Background Suppression Network for Weakly-supervised Temporal Action Localization

Abstract

Weakly-supervised temporal action localization is a very challenging problem because frame-wise labels are not given in the training stage while the only hint is video-level labels: whether each video contains action frames of interest. Previous methods aggregate frame-level class scores to produce video-level prediction and learn from video-level action labels. This formulation does not fully model the problem in that background frames are forced to be misclassified as action classes to predict video-level labels accurately. In this paper, we design Background Suppression Network (BaS-Net) which introduces an auxiliary class for background and has a two-branch weight-sharing architecture with an asymmetrical training strategy. This enables BaS-Net to suppress activations from background frames to improve localization performance. Extensive experiments demonstrate the effectiveness of BaS-Net and its superiority over the state-of-the-art methods on the most popular benchmarks - THUMOS'14 and ActivityNet. Our code and the trained model are available at https://github.com/Pilhyeon/BaSNet-pytorch.

Code Repositories

Pilhyeon/BaSNet-pytorch

Official

pytorch

Mentioned in GitHub

Pilhyeon/Learning-Action-Completeness-from-Points

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
weakly-supervised-action-localization-on	BaS-Net	[email protected]:0.5: 43.6 [email protected]:0.7: 35.3 [email protected]: 27
weakly-supervised-action-localization-on-1	BaS-Net	[email protected]: 34.5 [email protected]:0.95: 22.2
weakly-supervised-action-localization-on-2	BaS-Net	[email protected]: 38.5
weakly-supervised-action-localization-on-4	BasNet	[email protected]: 27.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

Background Suppression Network for Weakly-supervised Temporal Action Localization | Papers | HyperAI