HyperAIHyperAI

Command Palette

Search for a command to run...

CapRL Describes Reinforcement Learning

Date

3 days ago

Organization

The Chinese University of Hong Kong
University of Science and Technology of China
Shanghai Artificial Intelligence Laboratory

Paper URL

2509.22647

CapRL was proposed in September 2025 by a research team from the University of Science and Technology of China, the Chinese University of Hong Kong, and the Shanghai Artificial Intelligence Laboratory, among other institutions. The related research findings were published in the paper "...".CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning".

CapRL is a novel training framework that redefines description quality through practicality: high-quality descriptions should enable non-visual language models to accurately answer questions about the corresponding image. It employs a decoupled two-stage process where a large visual language model (LVLM) generates the description, while the objective reward derives from the accuracy of a separate, non-visual large language model (LLM) in answering multiple-choice questions based on that description. Pre-trained on the CapRL-5M description dataset annotated with CapRL-3B, CapRL achieves significant improvements across 12 benchmarks. Furthermore, within the Prism framework for description quality evaluation, its performance is comparable to Qwen2.5-VL-72B, outperforming the baseline by an average of 8.41 TP3T.

CapRL method

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp