Command Palette
Search for a command to run...
Papers
Daily updated cutting-edge AI research papers to help you keep up with the latest AI trends

Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders































Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders






























BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models
LLM-in-Sandbox Elicits General Agentic Intelligence
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
HY-MT1.5 Technical Report
Scaling Laws for Code: Every Programming Language Matters
Qwen3-TTS Technical Report
Small Models, Big Results: Achieving Superior Intent Extraction through Decomposition
FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents
DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution
Rethinking Video Generation Model for the Embodied World
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance
Agentic Reasoning for Large Language Models
PERSONAPLEX: VOICE AND ROLE CONTROL FOR FULL DUPLEX CONVERSATIONALSPEECH MODELS
FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning
MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer
Toward Efficient Agents: Memory, Tool learning, and Planning
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization
Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey
Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision
Building Production-Ready Probes For Gemini
LFM2 Technical Report
CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge
BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models
LLM-in-Sandbox Elicits General Agentic Intelligence
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
HY-MT1.5 Technical Report
Scaling Laws for Code: Every Programming Language Matters
Qwen3-TTS Technical Report
Small Models, Big Results: Achieving Superior Intent Extraction through Decomposition
FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents
DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution
Rethinking Video Generation Model for the Embodied World
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance
Agentic Reasoning for Large Language Models
PERSONAPLEX: VOICE AND ROLE CONTROL FOR FULL DUPLEX CONVERSATIONALSPEECH MODELS
FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning
MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer
Toward Efficient Agents: Memory, Tool learning, and Planning
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization
Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey
Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision
Building Production-Ready Probes For Gemini
LFM2 Technical Report
CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge