Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

Reinforcing Dual-Path Reasoning in Spatial Vision Language Models

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Kairos: A Native World Model Stack for Physical AI

Guava: An Effective and Universal Harness for Embodied Manipulation

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

LifeSciBench: Evaluating Language Models on Realistic, Expert-Level Tasks in the Life Sciences

TRIAGE: Dialectical Reasoning for Explainable Risk Prediction on Irregularly Sampled Medical Time Series with LLMs

LectūraAgents: A Multi-Agent Framework for Adaptive Personalized AI-Assisted Learning and Embodied Teaching

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

ACE-Ego-0: Unifying Egocentric Human and Robotic Data for VLA Pretraining

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

Predicting LLM Safety Before Release by Simulating Deployment

FastContext: Training Efficient Repository Explorer for Coding Agents

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

DreamX-World 1.0: A General-Purpose Interactive World Model

Geometric Action Model for Robot Policy Learning

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

dots.tts Technical Report

Deterministic Video Depth Estimation with Generative Priors

Galaxy Image Deconvolution for Weak Gravitational Lensing with Unrolled Plug-and-Play ADMM

AI Must Embrace Specialization via Superhuman Adaptable Intelligence

Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal Bayesians

Agents of Chaos

HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

Orchestra-o1: Omnimodal Agent Orchestration

From Chatbot to Digital Colleague: The Paradigm Shift Toward Persistent Autonomous AI

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

APPO: Agentic Procedural Policy Optimization

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

Reinforcing Dual-Path Reasoning in Spatial Vision Language Models

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Kairos: A Native World Model Stack for Physical AI

Guava: An Effective and Universal Harness for Embodied Manipulation

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

LifeSciBench: Evaluating Language Models on Realistic, Expert-Level Tasks in the Life Sciences

TRIAGE: Dialectical Reasoning for Explainable Risk Prediction on Irregularly Sampled Medical Time Series with LLMs

LectūraAgents: A Multi-Agent Framework for Adaptive Personalized AI-Assisted Learning and Embodied Teaching

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

ACE-Ego-0: Unifying Egocentric Human and Robotic Data for VLA Pretraining

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

Predicting LLM Safety Before Release by Simulating Deployment

FastContext: Training Efficient Repository Explorer for Coding Agents

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

DreamX-World 1.0: A General-Purpose Interactive World Model

Geometric Action Model for Robot Policy Learning

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

dots.tts Technical Report

Deterministic Video Depth Estimation with Generative Priors

Galaxy Image Deconvolution for Weak Gravitational Lensing with Unrolled Plug-and-Play ADMM

AI Must Embrace Specialization via Superhuman Adaptable Intelligence

Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal Bayesians

Agents of Chaos

HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

Orchestra-o1: Omnimodal Agent Orchestration

From Chatbot to Digital Colleague: The Paradigm Shift Toward Persistent Autonomous AI

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

APPO: Agentic Procedural Policy Optimization