Inference model · GEMMA

Gemma 4 26B A4B

The cheapest model in the catalog. 4B active parameters, 26B total. Built for volume.

Request Gemma 4 26B A4B access Back to catalog

Gemma 4 26B A4B is what we recommend when an agent is going to call inference hundreds of thousands of times a day and you can tolerate some per-call quality variance. The cost ceiling stays predictable; the floor stays useful.

When to pick it

Background research agents, daily-digest workflows
High-throughput document processing pipelines
Anything where you’d otherwise want to “use a cheap model and re-rank”

When to look elsewhere

Strict output-quality floors → use a dense model (Qwen 3.6 27B or Gemma 4 31B)
Function-calling heavy workflows that need very stable schemas → Qwen 3.6 27B

Continue in the ScaLabs Cloud Console

We'll create your account and email you a 6-digit sign-in code. Finish the request inside the console.

Gemma 4 26B A4B

When to pick it

When to look elsewhere

Get an API key for this model.

Continue in the ScaLabs Cloud Console