HyperAI

Main

GPU

Console
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

HyperAI

Main

GPU

Console
Docs
Pricing

Pulse

News

Resources

Papers
Notebooks
Datasets
Wiki

Benchmarks

SOTA
LLM Models
GPU Leaderboard

Community

Events

Utility

About Terms of Service Privacy Policy
English

Command Palette

Search for a command to run...

HyperAI
Papers

Papers

Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Build the Future of Artificial Intelligence

About

About Us Dataset Help

Products

News Papers Notebooks Datasets Wiki

Links

© HyperAI

GitHub Discord X (formerly Twitter)

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Dmitrii Stoianov, Danil Taranets, Olga Tsymboi, et al.

AutoGLM: Autonomous Foundation Agents for GUIs

AutoGLM: Autonomous Foundation Agents for GUIs

Xiao Liu, Bo Qin, Dongzhu Liang, et al.

OpenGU: A Comprehensive Benchmark for Graph Unlearning

Machine Learning

Bowen Fan, Yuming Ai, Xunkai Li, et al.

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Reinforcement Learning

Charlie Zhang, Graham Neubig, Xiang Yue

DeepCode: Open Agentic Coding

Code Generation

Retrieval-Augmented Generation

Zongwei Li, Zhonghang Li, Zirui Guo, et al.

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Hongyuan Tao, Bencheng Liao, Shaoyu Chen, et al.

OmniPSD: Layered PSD Generation with Diffusion Transformer

Diffusion Model

Image Generation

Cheng Liu, Yiren Song, Haofan Wang, et al.

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Minghui Lin, Pengxiang Ding, Shu Wang, et al.

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

Monishwaran Maheswaran, Rishabh Tiwari, Yuezhou Hu, et al.

Composing Concepts from Images and Videos via Concept-prompt Binding

Xianghao Kong, Zeyu Zhang, Yuwei Guo, et al.

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Video Generation

Ke Xing, Longfei Li, Yuyang Yin, et al.

Urania: Differentially Private Insights into AI Use

Daogao Liu, Edith Cohen, Badih Ghazi, et al.

Training LLMs for Honesty via Confessions

Supervised Fine-Tuning

Manas Joglekar, Jeremy Chen, Gabriel Wu, et al.

Measuring Agents in Production

Melissa Z. Pan, Negar Arabzadeh, Riccardo Cogo, et al.

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

Yiming Wang, Pei Zhang, Jialong Tang, et al.

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Long Lian, Sida Wang, Felix Juefei-Xu, et al.

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

Reinforcement Learning

Supervised Fine-Tuning

Salman Rahman, Sruthi Gorantla, Arpit Gupta, et al.

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Video Generation

Zhaochong An, Menglin Jia, Haonan Qiu, et al.

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Video Processing

Computer Vision

Zekai Luo, Zongze Du, Zhouhang Zhu, et al.

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Yuning Gong, Yifei Liu, Yifan Zhan, et al.

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Video Generation

Ruihang Chu, Yefei He, Zhekai Chen, et al.

Soft Adaptive Policy Optimization

Reinforcement Learning

Chang Gao, Chujie Zheng, Xiong-Hui Chen, et al.

Scaling Zero-Shot Reference-to-Video Generation

Video Generation

Zijian Zhou, Shikun Liu, Haozhe Liu, et al.

Voxify3D: Pixel Art Meets Volumetric Rendering

Yi-Chuan Huang, Jiewen Chan, Hao-Jen Chien, et al.

DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems

Ming Ma, Jue Zhang, Fangkai Yang, et al.

Unified Video Editing with Temporal Reasoner

Video Generation

Video Processing

Xiangpeng Yang, Ji Xie, Yiyuan Yang, et al.

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Xiaoran Liu, Yuerong Song, Zhigeng Liu, et al.

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Tong Wu, Yang Liu, Jun Bai, et al.

iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification

Zixun Xiong, Gaoyi Wu, Qingyang Yu, et al.

DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt

Supervised Fine-Tuning

Yitong Zhang, Jia Li, Liyi Cai, et al.

WorldGen: From Text to Traversable and Interactive 3D Worlds

Diffusion Model

Dilin Wang, Hyunyoung Jung, Tom Monnier, et al.

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Shalini Maiti, Amar Budhiraja, Bhavul Gauri, et al.

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Dmitrii Stoianov, Danil Taranets, Olga Tsymboi, et al.

AutoGLM: Autonomous Foundation Agents for GUIs

AutoGLM: Autonomous Foundation Agents for GUIs

Xiao Liu, Bo Qin, Dongzhu Liang, et al.

OpenGU: A Comprehensive Benchmark for Graph Unlearning

Machine Learning

Bowen Fan, Yuming Ai, Xunkai Li, et al.

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Reinforcement Learning

Charlie Zhang, Graham Neubig, Xiang Yue

DeepCode: Open Agentic Coding

Code Generation

Retrieval-Augmented Generation

Zongwei Li, Zhonghang Li, Zirui Guo, et al.

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Hongyuan Tao, Bencheng Liao, Shaoyu Chen, et al.

OmniPSD: Layered PSD Generation with Diffusion Transformer

Diffusion Model

Image Generation

Cheng Liu, Yiren Song, Haofan Wang, et al.

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Minghui Lin, Pengxiang Ding, Shu Wang, et al.

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

Monishwaran Maheswaran, Rishabh Tiwari, Yuezhou Hu, et al.

Composing Concepts from Images and Videos via Concept-prompt Binding

Xianghao Kong, Zeyu Zhang, Yuwei Guo, et al.

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Video Generation

Ke Xing, Longfei Li, Yuyang Yin, et al.

Urania: Differentially Private Insights into AI Use

Daogao Liu, Edith Cohen, Badih Ghazi, et al.

Training LLMs for Honesty via Confessions

Supervised Fine-Tuning

Manas Joglekar, Jeremy Chen, Gabriel Wu, et al.

Measuring Agents in Production

Melissa Z. Pan, Negar Arabzadeh, Riccardo Cogo, et al.

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

Yiming Wang, Pei Zhang, Jialong Tang, et al.

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Long Lian, Sida Wang, Felix Juefei-Xu, et al.

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

Reinforcement Learning

Supervised Fine-Tuning

Salman Rahman, Sruthi Gorantla, Arpit Gupta, et al.

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Video Generation

Zhaochong An, Menglin Jia, Haonan Qiu, et al.

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Video Processing

Computer Vision

Zekai Luo, Zongze Du, Zhouhang Zhu, et al.

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Yuning Gong, Yifei Liu, Yifan Zhan, et al.

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Video Generation

Ruihang Chu, Yefei He, Zhekai Chen, et al.

Soft Adaptive Policy Optimization

Reinforcement Learning

Chang Gao, Chujie Zheng, Xiong-Hui Chen, et al.

Scaling Zero-Shot Reference-to-Video Generation

Video Generation

Zijian Zhou, Shikun Liu, Haozhe Liu, et al.

Voxify3D: Pixel Art Meets Volumetric Rendering

Yi-Chuan Huang, Jiewen Chan, Hao-Jen Chien, et al.

DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems

Ming Ma, Jue Zhang, Fangkai Yang, et al.

Unified Video Editing with Temporal Reasoner

Video Generation

Video Processing

Xiangpeng Yang, Ji Xie, Yiyuan Yang, et al.

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Xiaoran Liu, Yuerong Song, Zhigeng Liu, et al.

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Tong Wu, Yang Liu, Jun Bai, et al.

iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification

Zixun Xiong, Gaoyi Wu, Qingyang Yu, et al.

DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt

Supervised Fine-Tuning

Yitong Zhang, Jia Li, Liyi Cai, et al.

WorldGen: From Text to Traversable and Interactive 3D Worlds

Diffusion Model

Dilin Wang, Hyunyoung Jung, Tom Monnier, et al.

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Shalini Maiti, Amar Budhiraja, Bhavul Gauri, et al.

OpenGU: A Comprehensive Benchmark for Graph Unlearning

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

DeepCode: Open Agentic Coding

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

OmniPSD: Layered PSD Generation with Diffusion Transformer

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

Composing Concepts from Images and Videos via Concept-prompt Binding

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Urania: Differentially Private Insights into AI Use

Training LLMs for Honesty via Confessions

Measuring Agents in Production

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Soft Adaptive Policy Optimization

Scaling Zero-Shot Reference-to-Video Generation

Voxify3D: Pixel Art Meets Volumetric Rendering

DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems

Unified Video Editing with Temporal Reasoner

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification

DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt

WorldGen: From Text to Traversable and Interactive 3D Worlds

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

OpenGU: A Comprehensive Benchmark for Graph Unlearning

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

DeepCode: Open Agentic Coding

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

OmniPSD: Layered PSD Generation with Diffusion Transformer

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

Composing Concepts from Images and Videos via Concept-prompt Binding

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Urania: Differentially Private Insights into AI Use

Training LLMs for Honesty via Confessions

Measuring Agents in Production

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Soft Adaptive Policy Optimization

Scaling Zero-Shot Reference-to-Video Generation

Voxify3D: Pixel Art Meets Volumetric Rendering

DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems

Unified Video Editing with Temporal Reasoner

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification

DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt

WorldGen: From Text to Traversable and Interactive 3D Worlds

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance