IBM just launched powerful new open source AI models – here’s what you need to know
Available under the Apache 2.0 license, IBM's Granite 3.0 models are trained on enterprise data and can out-perform the competition


IBM has launched the latest versions of its Granite AI models, claiming they match or surpass everything else currently available on the market.
The new models are open source, released under the Apache 2.0 license, and the Granite 3.0 8B and 2B language models in particular are being pitched as 'workhorse' models for tasks such as Retrieval Augmented Generation (RAG), classification, summarization, entity extraction, and tool use.
While many large language models (LLMs) are trained on publicly available data, IBM has gone for enterprise data, which it said can deliver better task-specific performance than rivals larger models at a much lower cost.
The models can be fine-tuned with enterprise data and seamlessly integrated across a range of business environments or workflows, with IBM providing an IP indemnity for all Granite models on watsonx.ai for enterprise clients.
Meanwhile, the Granite Guardian 3.0 models allow application developers to implement safety guardrails by checking user prompts and LLM responses for a variety of risks.
"The Granite Guardian 3.0 8B and 2B models provide the most comprehensive set of risk and harm detection capabilities available in the market today," IBM claimed.
"In addition to harm dimensions such as social bias, hate, toxicity, profanity, violence, jailbreaking and more, these models also provide a range of unique RAG-specific checks such as groundedness, context relevance, and answer relevance."
Get the ITPro daily newsletter
Sign up today and you will receive a free copy of our Future Focus 2025 report - the leading guidance on AI, cybersecurity and other IT challenges as per 700+ senior executives
The tech giant said the Granite 3.0 3B-A800M and Granite 3.0 1B-A400M MoE models deliver high inference efficiency with a minimal trade-off in performance.
Trained on more than 10 trillion tokens of data, the company is pushing them for on-device applications, CPU servers, and situations requiring extremely low latency.
IBM doubles down on AI assistant commitments
As part of the announcement, IBM also promised new developments in AI assistants, saying it's paving the way for future AI agents that can self-direct, reflect, and perform complex tasks in dynamic business environments.
RELATED WHITEPAPER
This sharpened focus will include the next generation of watsonx Code Assistant, powered by Granite code models, to offer general-purpose coding assistance across languages like C, C++, Go, Java, and Python, and with advanced application modernization capabilities for Enterprise Java Applications.
It's also planning to release new tools to help developers build, customize and deploy AI more efficiently via watsonx.ai, including agentic frameworks, integrations with existing environments and low-code automations for common use cases like RAG and agents.
Over the rest of the year, IBM said it will expand all model context windows to 128K tokens, improve multilingual support for 12 natural languages, and introduce multimodal image-in, text-out capabilities.
Emma Woollacott is a freelance journalist writing for publications including the BBC, Private Eye, Forbes, Raconteur and specialist technology titles.
-
Bigger salaries, more burnout: Is the CISO role in crisis?
In-depth CISOs are more stressed than ever before – but why is this and what can be done?
By Kate O'Flaherty Published
-
Cheap cyber crime kits can be bought on the dark web for less than $25
News Research from NordVPN shows phishing kits are now widely available on the dark web and via messaging apps like Telegram, and are often selling for less than $25.
By Emma Woollacott Published
-
Put AI to work for IT operations
whitepaper Reduce the cost and complexity of managing hybrid applications
By ITPro Published
-
AI in the retail industry is spreading beyond the IT department
News AI has become a strategic imperative for retailers, delivering marked productivity gains
By Emma Woollacott Published
-
Maximizing contact center operations with generative AI assistants backed by responsible AI principles
whitepaper Reduce the cost and complexity of managing hybrid applications
By ITPro Published
-
Achieving business outcomes with generative AI
Webinar Take your hybrid cloud journey to the next level with generative AI
By ITPro Published
-
Wimbledon’s new Catch Me Up AI feature promises to keep fans up to date at the tournament – after it irons out some of the wrinkles
News The latest feature to come out of IBM’s partnership with Wimbledon will keep fans engaged from the early stages right through to the final with dynamic player insights
By Solomon Klappholz Published
-
AI demands new ways of data management
whitepaper The data leader’s guide for how to leverage the right databases for applications, analytics and generative AI
By ITPro Last updated
-
AI governance for responsible transparent and explainable AI workflows
whitepaper Build greater trust in your AI
By ITPro Published
-
AI academy guidebook: AI for customer service
Whitepaper Discover how AI will improve the customer service journey
By ITPro Published