JetBrains, the company behind a range of popular app development tools, has released its first “open” AI model for coding.
On Wednesday, JetBrains made Mellum, a code-generating model the company released for its various software development suites last year, openly available on the AI dev platform Hugging Face. Mellum, trained on more than 4 trillion tokens, weighs in at 4 billion parameters, and is designed specifically for code completion (i.e. completing code snippets based on the surrounding context).
Parameters roughly correspond to a model’s problem-solving skills, while tokens are the raw bits of data that a model processes. A million tokens is equivalent to ~30,000 lines of code.
“Designed for integration into professional developer tooling (e.g. intelligent code suggestions in integrated developer environments), AI-powered coding assistants, and research on code understanding and generation, Mellum is also well-suited for educational applications and fine-tuning experiments,” explains JetBrains in a technical report.
JetBrains says that it trained Mellum, which is Apache 2.0-licensed, on a collection of data sets including permissively licensed code from GitHub and English-language Wikipedia articles. Training took around 20 days on a cluster of 256 H200 Nvidia GPUs.
Mellum takes some work to get up and running. The base model can’t be used out of the box; it has to be fine-tuned first. While JetBrians has provided a few Mellum models fine-tuned for Python, the company cautions that they’re meant for “estimation about potential capabilities” — not deploying into a production environment.
AI-generated code is no doubt changing how software is built, but it’s also introducing new security challenges. More than 50% of organizations encounter security issues with AI-produced code sometimes or frequently, according to a late 2023 survey by developer security platform Synk.
Techcrunch event
Exhibit at TechCrunch Sessions: AI
Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.
Exhibit at TechCrunch Sessions: AI
Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.
Berkeley, CA
|
June 5
BOOK NOW
Indeed, JetBrains notes that Mellum may “reflect biases present in public codebases” (e.g. generating code similar in style to open source repositories), and that its code suggestions won’t necessarily be “secure or free of vulnerabilities.”
“This is just the beginning,” JetBrains wrote in a blog post. “We’re not chasing generality — we’re building focus. If Mellum sparks even one meaningful experiment, contribution, or collaboration, we would consider it a win.”
Topics
AI, JetBrains
Kyle Wiggers
AI Editor
Kyle Wiggers is TechCrunch’s AI Editor. His writing has appeared in VentureBeat and Digital Trends, as well as a range of gadget blogs including Android Police, Android Authority, Droid-Life, and XDA-Developers. He lives in Manhattan with his partner, a music therapist.
View Bio
May 13, 2025
London, England
Get inside access to Europe’s top investment minds — with leaders from Monzo, Accel, Paladin Group, and more — plus top-tier networking at StrictlyVC London.
REGISTER NOW
Most Popular
Google launches AI tools for practicing languages through personalized lessons
Aisha Malik
Hugging Face releases a 3D-printed robotic arm starting at $100
Kyle Wiggers
An OpenAI researcher who worked on GPT-4.5 had their green card denied
Kyle Wiggers
Bezos-backed Slate Auto debuts analog EV pickup truck that is decidedly anti-Tesla
Sean O'Kane
Wait, how did a decentralized service like Bluesky go down?
Sarah Perez
Perplexity CEO says its browser will track everything users do online to sell ‘hyper personalized’ ads
Julie Bort
Government censorship comes to Bluesky, but not its third-party apps … yet
Sarah Perez