Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Flow-OPD: On-Policy Distillation for Flow Matching Models































Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Flow-OPD: On-Policy Distillation for Flow Matching Models






























MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems
When to Trust Imagination: Adaptive Action Execution for World Action Models
RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation
MiA-Signature: Approximating Global Activation for Long-Context Understanding
Continuous Latent Diffusion Language Model
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction
MathNet: A GLOBAL MULTIMODAL BENCHMARK FOR MATHEMATICAL REASONING AND RETRIEVAL
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models
ZAYA1-8B Technical Report
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
RLDX-1 Technical Report
Stream-T1: Test-Time Scaling for Streaming Video Generation
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe
AGENTIC-IMODELS: Evolving agentic interpretability tools via autoresearch
HEAVYSKILL: Heavy Thinking as the Inner Skill in Agentic Harness
WindowsWorld: A Process-Centric Benchmark of Autonomous GUI Agents in Professional Cross-Application Environments
Hallucinations Undermine Trust; Metacognition is a Way Forward
X2SAM: Any Segmentation in Images and Videos
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories
PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration
ProgramBench: Can Language Models Rebuild Programs From Scratch?
Efficient Accelerated Graph Edit Distance Computation on GPU
LLM-based uncertainty assessment of social media situational signals for crisis reporting
Canonical LST: A Protocol-Native Liquid Staking Solution for Tezos
MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems
When to Trust Imagination: Adaptive Action Execution for World Action Models
RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation
MiA-Signature: Approximating Global Activation for Long-Context Understanding
Continuous Latent Diffusion Language Model
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction
MathNet: A GLOBAL MULTIMODAL BENCHMARK FOR MATHEMATICAL REASONING AND RETRIEVAL
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models
ZAYA1-8B Technical Report
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
RLDX-1 Technical Report
Stream-T1: Test-Time Scaling for Streaming Video Generation
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe
AGENTIC-IMODELS: Evolving agentic interpretability tools via autoresearch
HEAVYSKILL: Heavy Thinking as the Inner Skill in Agentic Harness
WindowsWorld: A Process-Centric Benchmark of Autonomous GUI Agents in Professional Cross-Application Environments
Hallucinations Undermine Trust; Metacognition is a Way Forward
X2SAM: Any Segmentation in Images and Videos
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories
PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration
ProgramBench: Can Language Models Rebuild Programs From Scratch?
Efficient Accelerated Graph Edit Distance Computation on GPU
LLM-based uncertainty assessment of social media situational signals for crisis reporting
Canonical LST: A Protocol-Native Liquid Staking Solution for Tezos