Date

12 hours ago

Organization

Paper URL

openreview.net

Tags

Reinforcement Learning

The Mean Velocity Policy (MVP) was jointly proposed by research teams from Tsinghua University (School of Vehicle and Transportation and School of Artificial Intelligence), the BAIR (Baidu Research Laboratory for Artificial Intelligence) at the University of California, Berkeley, and the University of Hong Kong. This work was formally published as a conference paper at the International Conference on Learning Representations (ICLR 2026) in 2026. Related research results were published in the paper "Mean Flow Policy with Instantaneous Velocity Constraint for One-step Action Generation".

MVP is a novel generative policy for reinforcement learning that achieves the fastest single-step action generation by modeling an "average velocity field," completely eliminating the computational overhead of multi-step sampling. To address the challenge of lacking explicit boundary conditions in the model, the research team introduced "instantaneous velocity constraints (IVC)," effectively improving learning accuracy and policy expressiveness. In practical performance, MVP significantly improves training and inference speed (average single-step inference time is only 10.93 milliseconds) and achieves a state-of-the-art average success rate of 0.88 on complex robot manipulation tasks in Robomimic and OGBench, reaching the state-of-the-art in this field.

Related Wiki

Decomposed Forward Pass (DePass)

DePass is used to interpret the Transformer model by decomposing the forward pass.

12 days ago

Safety Comparison Method: Deep Aligned Visual Safety Prompt

It effectively solves the key challenges in LVLM secure alignment.

12 days ago

iSeal Fingerprint Recognition Method

iSeal achieves a 100% fingerprint success rate (FSR) against more than 10 attacks on 12 LLMs.

12 days ago

Sparse Code Tree Decoding Tree Sketching

By leveraging GPU parallelism to efficiently expand the decoding tree, fast and scalable optimization of the inference path is achieved.

5 days ago

Model Souping

Model Souping can generate a better model by averaging the weights of multiple fine-tunings.

5 days ago

SoCE Class Expert Soup

SoCE is a model optimization paradigm based on an automatic category-aware expert selection mechanism and combined with multiple benchmark tasks.

12 days ago

WorldGen

WorldGen is capable of creating geometrically unified, visually rich, and highly efficient real-time rendering worlds.

5 days ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

12 hours ago

Organization

Paper URL

openreview.net

Related Wiki

Decomposed Forward Pass (DePass)

DePass is used to interpret the Transformer model by decomposing the forward pass.

12 days ago

Safety Comparison Method: Deep Aligned Visual Safety Prompt

It effectively solves the key challenges in LVLM secure alignment.

12 days ago

iSeal Fingerprint Recognition Method

iSeal achieves a 100% fingerprint success rate (FSR) against more than 10 attacks on 12 LLMs.

12 days ago

Sparse Code Tree Decoding Tree Sketching

By leveraging GPU parallelism to efficiently expand the decoding tree, fast and scalable optimization of the inference path is achieved.

5 days ago

Model Souping

Model Souping can generate a better model by averaging the weights of multiple fine-tunings.

5 days ago

SoCE Class Expert Soup

SoCE is a model optimization paradigm based on an automatic category-aware expert selection mechanism and combined with multiple benchmark tasks.

12 days ago

WorldGen

WorldGen is capable of creating geometrically unified, visually rich, and highly efficient real-time rendering worlds.

5 days ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Mean Speed Strategy (MVP)

Build AI with AI

HyperAI Newsletters

Command Palette

Mean Speed Strategy (MVP)

Related Wiki

Decomposed Forward Pass (DePass)

Safety Comparison Method: Deep Aligned Visual Safety Prompt

iSeal Fingerprint Recognition Method

Sparse Code Tree Decoding Tree Sketching

Model Souping

SoCE Class Expert Soup

WorldGen

Build AI with AI

HyperAI Newsletters

Command Palette

Mean Speed Strategy (MVP)

Related Wiki

Decomposed Forward Pass (DePass)

Safety Comparison Method: Deep Aligned Visual Safety Prompt

iSeal Fingerprint Recognition Method

Sparse Code Tree Decoding Tree Sketching

Model Souping

SoCE Class Expert Soup

WorldGen

Build AI with AI

HyperAI Newsletters

Related Wiki

Decomposed Forward Pass (DePass)

Safety Comparison Method: Deep Aligned Visual Safety Prompt

iSeal Fingerprint Recognition Method

Sparse Code Tree Decoding Tree Sketching

Model Souping

SoCE Class Expert Soup

WorldGen

Related Wiki

Decomposed Forward Pass (DePass)

Safety Comparison Method: Deep Aligned Visual Safety Prompt

iSeal Fingerprint Recognition Method

Sparse Code Tree Decoding Tree Sketching

Model Souping

SoCE Class Expert Soup

WorldGen