Fractal has announced the launch of LLM Studio, an enterprise platform that helps organisations build and run language models tailored to their business.
It is designed for teams that want greater control over how models are governed, deployed and managed in production. LLM Studio will be demonstrated at NVIDIA GTC 2026, the premier AI and accelerated computing conference by NVIDIA, at the San Jose McEnery Convention Center in California.
Enterprises are moving beyond one-size-fits-all, API-only large language models for high-value use cases. Today, business and technology leaders require clearer guardrails for governance, predictable costs and reliable performance. At the same time, industry research points to growing adoption of smaller, purpose-built models that can be tuned to specific functions and domains.
LLM Studio enables businesses to design, build, evaluate and operate domain-adapted language models using open-source models, powered by NVIDIA AI infrastructure and software stack. It brings together two modules.
AutoLLM helps businesses create smaller, specialised models for specific tasks or industries. It supports open-source model selection, synthetic data generation, model customisation, evaluation and performance benchmarking. LLMOps helps teams manage the full life cycle after a model is created, supporting deployment, monitoring and governance.
LLM Studio helps keep model responses tied to an organisation’s approved data and context, reducing hallucinations and improving the quality of reasoning. The resulting models remain proprietary to the organisation. Teams can then use these models in agents or other generative AI applications to deliver more reliable performance, often at a fraction of the cost of running larger foundation models.
LLM Studio, developed by the AI Client Services team at Fractal, is built on NVIDIA reference architectures, using NVIDIA NeMo for key model development workflow capabilities and NVIDIA NIM microservices for model hosting. This design helps businesses standardise how models are deployed and governed across major cloud environments, reducing the need for custom builds in each setup. Fractal is planning to use NVIDIA Nemotron open models for development.
“Enterprises are past the experimentation phase with generative AI. They need solutions that are governed, cost predictable and reliable in production. With LLM Studio, we are giving organisations a practical way to build and operate domain-specific language models using open-source options, while taking advantage of NVIDIA AI infrastructure,” said Pranay Agrawal, Chief Executive Officer, Fractal.
LLM Studio supports a wide range of applications and is built for enterprise users, including teams with limited or no coding experience. Businesses can tailor models to their content, train them to follow specific instructions, or build models designed for tasks that require stronger reasoning.
Send news announcements/press releases to:
editor@thefoundermedia.com
