Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation

Video-Based Reward Modeling for Computer-Use Agents































ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation

Video-Based Reward Modeling for Computer-Use Agents






























IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams
ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning
In-Context Reinforcement Learning for Tool Use in Large Language Models
MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents
Flash-KMeans: Fast and Memory-Efficient Exact K-Means
OpenClaw-RL: Train Any Agent Simply by Talking
Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing
Believe Your Model: Distribution-Guided Confidence Calibration
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory
How Far Can Unsupervised RLVR Scale LLM Training?
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs
DreamCAD: Scaling Multi-modal CAD Generation using Differentiable Parametric Surfaces
Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum
NOTAI.AI: Explainable Detection of Machine-Generated Text via Curvature and Feature Attribution
Safer Reasoning Traces: Measuring and Mitigating Chain-of-Thought Leakage in LLMs
RACAS: Controlling Diverse Robots With a Single Agentic System
Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models
ArtLLM: Generating Articulated Assets via 3D LLM
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images
RoboPocket: Improve Robot Policies Instantly with Your Phone
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams
ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning
In-Context Reinforcement Learning for Tool Use in Large Language Models
MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents
Flash-KMeans: Fast and Memory-Efficient Exact K-Means
OpenClaw-RL: Train Any Agent Simply by Talking
Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing
Believe Your Model: Distribution-Guided Confidence Calibration
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory
How Far Can Unsupervised RLVR Scale LLM Training?
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs
DreamCAD: Scaling Multi-modal CAD Generation using Differentiable Parametric Surfaces
Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum
NOTAI.AI: Explainable Detection of Machine-Generated Text via Curvature and Feature Attribution
Safer Reasoning Traces: Measuring and Mitigating Chain-of-Thought Leakage in LLMs
RACAS: Controlling Diverse Robots With a Single Agentic System
Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models
ArtLLM: Generating Articulated Assets via 3D LLM
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images
RoboPocket: Improve Robot Policies Instantly with Your Phone