AVSD Audio-Visual Scene Aware Dialogue Dataset

AVSD stands for The Audio Visual Scene-Aware Dialog (or DSTC7 Track 3) is an audio-visual dataset for understanding dialogue. The dataset aims to build a system and respond to the dialogue in the input video.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.