Date

2 months ago

Organization

Tags

Human-Computer Interaction

Action Recognition

Multimodal

Emotion Recognition

Natural Language Processing

Video Understanding

HumanSense Benchmark is a human perception evaluation benchmark dataset released in 2025 by Xi'an Jiaotong University in collaboration with Ant Group. The related research paper is titled "HumanSense: From Multimodal Perception to Empathetic Context-Aware Responses through Reasoning MLLMsThe goal is to comprehensively measure the model's real-world interactive capabilities under the fusion of multimodal information such as vision, audio, and text.

This dataset contains 3,291 video-based questions and 591 audio-based questions, covering 15 tasks of increasing difficulty. The task structure is a four-layer pyramid, including:

L1–L2 Perception Layers: Fundamental and complex perceptual capabilities for vision, audio, and cross-modal perception;
L3 Understanding Layer: The ability to understand implicit relationships, emotions, and states based on interactive situations;
L4 Response Layer: Strategic and contextualized response capabilities in interactive scenarios.

This dataset constructs questions from real videos, audio, and multimodal dialogues. It is generated through various open-source datasets and real-world scene recordings, covering a wide range of human-centered interaction tasks, from appearance recognition and emotion recognition to relationship understanding and psychological dialogue. It is one of the current multimodal evaluation benchmarks that is closer to real human communication scenarios.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset Discuss on Discord

Date

2 months ago

Organization

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

HumanSense Benchmark Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

HumanSense Benchmark Dataset

Related Datasets

VideoRewardBench Video Reward Model Evaluation Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

IF-Bench Infrared Image Understanding Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

PolypSense3D Polyp Size Aware Dataset

NAMD_Benchmark Molecular Dynamics Performance Benchmark Dataset

PhysDriver Physiological Test Dataset

VERA Voice Reasoning Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

HumanSense Benchmark Dataset

Related Datasets

VideoRewardBench Video Reward Model Evaluation Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

IF-Bench Infrared Image Understanding Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

PolypSense3D Polyp Size Aware Dataset

NAMD_Benchmark Molecular Dynamics Performance Benchmark Dataset

PhysDriver Physiological Test Dataset

VERA Voice Reasoning Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

VideoRewardBench Video Reward Model Evaluation Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

IF-Bench Infrared Image Understanding Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

PolypSense3D Polyp Size Aware Dataset

NAMD_Benchmark Molecular Dynamics Performance Benchmark Dataset

PhysDriver Physiological Test Dataset

VERA Voice Reasoning Evaluation Dataset

Related Datasets

VideoRewardBench Video Reward Model Evaluation Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

IF-Bench Infrared Image Understanding Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

PolypSense3D Polyp Size Aware Dataset

NAMD_Benchmark Molecular Dynamics Performance Benchmark Dataset

PhysDriver Physiological Test Dataset

VERA Voice Reasoning Evaluation Dataset