Action Classification On Kinetics Sounds
Metrics
Top 1 Accuracy
Top 5 Accuracy
Results
Performance results of various models on this benchmark
| Paper Title | |||
|---|---|---|---|
| Mirasol3B | 90.1 | - | Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities |
| MBT (AV) | 85 | 96.8 | Attention Bottlenecks for Multimodal Fusion |
0 of 2 row(s) selected.