Published on 6/2/2026

JetBrains Launches Mellum2: A Specialized 12B MoE Model for Engineering

Model Name

Mellum2

Architecture

Mixture-of-Experts (MoE)

Total Parameters

12B

Active Parameters

2.5B

License

Apache 2.0

Daily Highlight 1: Mellum2 Model Release

JetBrains has officially released Mellum2, a 12B parameter Mixture-of-Experts (MoE) model under the Apache 2.0 license. Designed specifically for software engineering, it features 2.5B active parameters per token and a 131k token context window, optimized for rapid code generation, debugging, and function calling.

Daily Highlight 2: Specialized Architecture

Unlike general-purpose models, Mellum2 is engineered as a 'focal model' intended to serve as a fast, specialized component within larger AI-driven development workflows. Its MoE architecture allows for high efficiency, making it suitable for real-time coding assistance without the overhead of massive frontier models.

Daily Highlight 3: Targeted Engineering Utility

The model was trained from scratch on approximately 10.6 trillion tokens focused exclusively on software engineering datasets. This ensures high accuracy in code-specific reasoning and syntax, addressing the need for reliable, specialized tools in production development environments.

Niche Project of the Day: Mellum2 Integration Kit

Developers can now pull the Mellum2 weights directly from open repositories to integrate into local IDE extensions or custom CI/CD pipelines. Its lightweight active parameter count makes it an ideal candidate for local inference on developer workstations, reducing latency in code-completion tasks.

Key Takeaways

JetBrains released Mellum2, a 12B MoE model (Apache 2.0) for software engineering.
Mellum2 is optimized for speed and specialized tasks like debugging and function calling.
The model features a 131k token context window and is designed for multi-model pipeline integration.

Verified Sources

🔗 JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks