← Back to News
Published on 6/2/2026

JetBrains Launches Mellum2: A Specialized 12B MoE Model for Engineering

JetBrains Launches Mellum2: A Specialized 12B MoE Model for Engineering
Model Name
Mellum2
Architecture
Mixture-of-Experts (MoE)
Total Parameters
12B
Active Parameters
2.5B
License
Apache 2.0

Daily Highlight 1: Mellum2 Model Release

JetBrains has officially released Mellum2, a 12B parameter Mixture-of-Experts (MoE) model under the Apache 2.0 license. Designed specifically for software engineering, it features 2.5B active parameters per token and a 131k token context window, optimized for rapid code generation, debugging, and function calling.

Daily Highlight 2: Specialized Architecture

Unlike general-purpose models, Mellum2 is engineered as a 'focal model' intended to serve as a fast, specialized component within larger AI-driven development workflows. Its MoE architecture allows for high efficiency, making it suitable for real-time coding assistance without the overhead of massive frontier models.

Daily Highlight 3: Targeted Engineering Utility

The model was trained from scratch on approximately 10.6 trillion tokens focused exclusively on software engineering datasets. This ensures high accuracy in code-specific reasoning and syntax, addressing the need for reliable, specialized tools in production development environments.

Niche Project of the Day: Mellum2 Integration Kit

Developers can now pull the Mellum2 weights directly from open repositories to integrate into local IDE extensions or custom CI/CD pipelines. Its lightweight active parameter count makes it an ideal candidate for local inference on developer workstations, reducing latency in code-completion tasks.

Key Takeaways

  • JetBrains released Mellum2, a 12B MoE model (Apache 2.0) for software engineering.
  • Mellum2 is optimized for speed and specialized tasks like debugging and function calling.
  • The model features a 131k token context window and is designed for multi-model pipeline integration.