Command Palette
Search for a command to run...
Visual Question Answering On Mmbench
评估指标
GPT-3.5 score
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | ||
|---|---|---|
| LLaVA-InternLM2-ViT + MoSLoRA | 73.8 | Mixture-of-Subspaces in Low-Rank Adaptation |
| CuMo-7B | 73.0 | CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts |
| LLaVA-LLaMA3-8B-ViT + MoSLoRA | 73.0 | Mixture-of-Subspaces in Low-Rank Adaptation |
| Video-LaVIT | 67.3 | Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization |
| DreamLLM-7B | 49.9 | DreamLLM: Synergistic Multimodal Comprehension and Creation |
0 of 5 row(s) selected.