JetBrains Opens Mellum2 for AI Workflows

2026-06-03 · jetbrains

JetBrains is positioning Mellum2 as a practical model for the unglamorous but important parts of production AI. The focus is not general-purpose spectacle but speed, routing, and low-cost execution inside software workflows. That makes this release relevant to anyone building agent pipelines, IDE assistants, or private model stacks for engineering teams.

Key Features or Updates

Mellum2 is an open-source 12B model released under Apache 2.0 and designed from scratch for software engineering workflows. JetBrains says it is especially suited for routing, summarization, Q&A, intermediate reasoning, and fast sub-agents.

Impact on Developers

The model is meant to solve latency, throughput, and cost problems that show up once AI systems move beyond demos. If it performs as advertised, teams can use it to split agent pipelines into smaller, more efficient steps instead of forcing a single large model to do everything.

How to use it

Try Mellum2 in a local or self-hosted setup if you need private AI for code-heavy workflows. It is most useful when you want a fast routing layer, a lightweight sub-agent, or a model that can sit inside a larger orchestration system.

Read Original Post →