HyperAIHyperAI

Command Palette

Search for a command to run...

Deploying April-1.5-15b-Thinker Using vLLM + Open WebUI

Date

2 months ago

Size

3.7 MB

License

MIT

Paper URL

arxiv.org

1. Tutorial Introduction

Apriel-1.5-15b-Thinker is a multimodal inference model launched by ServiceNow in October 2025. This model does not employ reinforcement learning or preference optimization, nor is it trained from scratch. Instead, it relies on a carefully designed "mid-training" process, achieving outstanding performance comparable to top-tier closed-source models on both text and visual tasks. Despite having only 15 billion parameters, its performance on several authoritative benchmarks rivals mainstream models with more than 10 times the number of parameters (such as Deepseek R1 0528 and Gemini Flash), demonstrating extremely high inference efficiency and comprehensive capabilities. The related paper is titled "..."Apriel-1.5-15b-Thinker".

This tutorial uses a dual-card RTX 5090 setup.

Model Function

  • Text Generation
  • Image analysis
  • Logical reasoning
  • Mathematical Problem Solving
  • Code Generation
  • function call
  • Multi-step task processing
  • Scientific discourse
  • Knowledge Q&A

2. Project Examples

3. Operation steps

1. After starting the container, click the API address to enter the Web interface

2. After entering the webpage, you can start a conversation with the model

If "Model" is not displayed, it means the model is being initialized. Since the model is large, please wait about 2-3 minutes and refresh the page.

How to use

Citation Information

The citation information for this project is as follows:

@misc{radhakrishna2025apriel1515bthinker,
      title={Apriel-1.5-15b-Thinker}, 
      author={Shruthan Radhakrishna and Aman Tiwari and Aanjaneya Shukla and Masoud Hashemi and Rishabh Maheshwary and Shiva Krishna Reddy Malay and Jash Mehta and Pulkit Pattnaik and Saloni Mittal and Khalil Slimi and Kelechi Ogueji and Akintunde Oladipo and Soham Parikh and Oluwanifemi Bamgbose and Toby Liang and Ahmed Masry and Khyati Mahajan and Sai Rajeswar Mudumba and Vikas Yadav and Sathwik Tejaswi Madhusudhan and Torsten Scholak and Sagar Davasam and Srinivas Sunkara and Nicholas Chapados},
      year={2025},
      eprint={2510.01141},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2510.01141}, 
}

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp