HyperAIHyperAI

Command Palette

Search for a command to run...

UNO-Bench full-modal Evaluation Benchmark Dataset

Date

16 days ago

Size

9.71 GB

Organization

Meituan

Paper URL

2510.18915

License

MIT

UNO-Bench is the first unified full-modal evaluation benchmark released by Meituan's LongCat team in 2025. The related paper is titled "UNO-Bench: A Unified Benchmark for Exploring the Compositional Law Between Uni-modal and Omni-modal in Omni ModelsThe goal is to efficiently assess single-modal and multi-modal understanding capabilities.

This dataset contains 1250 full-modal samples with 98% cross-modal solvability and 2480 single-modal samples, covering 44 task types and 5 modality combinations. The dataset also includes a general scoring model that supports automated evaluation of 6 question types, providing a unified evaluation standard for multimodal tasks. The full-modal samples were carefully constructed by humans to closely resemble real-world applications, especially suitable for the Chinese context; the single-modal samples supplement the basic cognitive and ability dimensions, making the overall evaluation more comprehensive.

Data Structures:

The data is stored in Parquet format, and each sample contains structured fields:

  • qid (sample ID), subset_name (subset name);
  • question (textual question) and answer (standard answer);
  • images / audios / videos (multimodal content, file paths are stored as a dictionary, null if not present);
  • task (44 task tags), ability (ability type), source (data source), score_type (scoring method).
Dataset Example
UNO-Bench.torrent
Seeding 2Downloading 0Completed 0Total Downloads 4
  • UNO-Bench/
    • README.md
      1.97 KB
    • README.txt
      3.93 KB
      • data/
        • UNO-Bench.zip
          9.71 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp