Kirill Gavrilyuk Amir Ghodrati Zhenyang Li Cees G. M. Snoek

Abstract
This paper strives for pixel-level segmentation of actors and their actions in video content. Different from existing works, which all learn to segment from a fixed vocabulary of actor and action pairs, we infer the segmentation from a natural language input sentence. This allows to distinguish between fine-grained actors in the same super-category, identify actor and action instances, and segment pairs that are outside of the actor and action vocabulary. We propose a fully-convolutional model for pixel-level actor and action segmentation using an encoder-decoder architecture optimized for video. To show the potential of actor and action video segmentation from a sentence, we extend two popular actor and action datasets with more than 7,500 natural language descriptions. Experiments demonstrate the quality of the sentence-guided segmentations, the generalization ability of our model, and its advantage for traditional actor and action segmentation compared to the state-of-the-art.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| referring-expression-segmentation-on-a2d | Gavriluyk el al. (Optical flow) | AP: 0.215 IoU mean: 0.426 IoU overall: 0.551 [email protected]: 0.5 [email protected]: 0.376 [email protected]: 0.231 [email protected]: 0.094 [email protected]: 0.004 |
| referring-expression-segmentation-on-a2d | Gavriluyk el al. | AP: 0.198 IoU mean: 0.421 IoU overall: 0.536 [email protected]: 0.475 [email protected]: 0.347 [email protected]: 0.211 [email protected]: 0.08 [email protected]: 0.002 |
| referring-expression-segmentation-on-j-hmdb | Gavrilyuk et al. | AP: 0.233 IoU mean: 0.542 IoU overall: 0.541 [email protected]: 0.699 [email protected]: 0.460 [email protected]: 0.173 [email protected]: 0.014 [email protected]: 0.000 |
| referring-expression-segmentation-on-j-hmdb | Gavrilyuk et al. (Optical flow) | AP: 0.267 IoU mean: 0.570 IoU overall: 0.555 [email protected]: 0.712 [email protected]: 0.518 [email protected]: 0.264 [email protected]: 0.030 [email protected]: 0.000 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.