Why most enterprise AI agents never reach production and how Databricks plans to fix it

June 12, 2025

58

Table of Contents

Many enterprise AI agent improvement efforts by no means make it to manufacturing and it’s not as a result of the know-how isn’t prepared. The issue, based on Databricks, is that firms are nonetheless counting on handbook evaluations with a course of that’s sluggish, inconsistent and tough to scale.

In the present day on the Knowledge + AI Summit, Databricks launched Mosaic Agent Bricks as an answer to that problem. The know-how builds on and extends the Mosaic AI Agent Framework the corporate introduced in 2024. Merely put, it’s not adequate to only be capable to construct AI brokers with a purpose to have real-world affect.

The Mosaic Agent Bricks platform automates agent optimization utilizing a collection of research-backed improvements. Among the many key improvements is the mixing of TAO (Take a look at-time Adaptive Optimization), which offers a novel strategy to AI tuning with out the necessity for labeled information. Mosaic Agent Bricks additionally generates domain-specific artificial information, creates task-aware benchmarks and optimizes quality-to-cost steadiness with out handbook intervention.

Basically the aim of the brand new platform is to resolve a problem that Databricks customers had with current AI agent improvement efforts.

“They had been flying blind, that they had no solution to consider these brokers,” Hanlin Tang, Databricks’ Chief Know-how Officer of Neural Networks, advised VentureBeat. “Most of them had been counting on a type of handbook, handbook vibe monitoring to see if the agent sounds adequate, however this doesn’t give them the arrogance to enter manufacturing.”

From analysis innovation to enterprise AI manufacturing scale

Tang was beforehand the co-founder and CTO of Mosaic, which was acquired by Databricks in 2023 for $1.3 billion.

At Mosaic, a lot of the analysis innovation didn’t essentially have a direct enterprise affect. That every one modified after the acquisition.

“The massive mild bulb second for me was once we first launched our product on Databricks, and immediately, in a single day, we had, like hundreds of enterprise prospects utilizing it,” Tang stated.

In distinction, previous to the acquisition Mosaic would spend months attempting to get only a handful of enterprises to check out merchandise. The combination of Mosaic into Databricks has given Mosaic’s analysis crew direct entry to enterprise issues at scale and revealed new areas to discover.

This enterprise contact revealed new analysis alternatives.

“It’s solely when you may have contact with enterprise prospects, you’re employed with them deeply, that you just truly uncover type of attention-grabbing analysis issues to go after,” Tang defined. “Agent Bricks….is, in some methods, type of an evolution of every part that we’ve been engaged on at Mosaic now that we’re all totally, totally bricksters.”

Fixing the agentic AI analysis disaster

Enterprise groups face a pricey trial-and-error optimization course of. With out task-aware benchmarks or domain-specific take a look at information, each agent adjustment turns into an costly guessing sport. High quality drift, value overruns and missed deadlines comply with.

Agent Bricks automates your complete optimization pipeline. The platform takes a high-level process description and enterprise information. It handles the remaining robotically.

First, it generates task-specific evaluations and LLM judges. Subsequent, it creates artificial information that mirrors buyer information. Lastly, it searches throughout optimization methods to search out the most effective configuration.

“The shopper describes the issue at a excessive stage they usually don’t go into the low stage particulars, as a result of we deal with these,” Tang stated. “The system generates artificial information and builds customized LLM judges particular to every process.”

The platform gives 4 agent configurations:

Info Extraction: Converts paperwork (PDFs, emails) into structured information. One use case may very well be retail organizations that use it to drag product particulars from provider PDFs, even with advanced formatting.
Data Assistant: Gives correct, cited solutions from enterprise information. For instance, manufacturing technicians can get immediate solutions from upkeep manuals with out digging via binders.
Customized LLM: Handles textual content transformation duties (summarization, classification). For instance, healthcare organizations can customise fashions that summarize affected person notes for scientific workflows.
Multi-Agent Supervisor: Orchestrates a number of brokers for advanced workflows. One use case instance is monetary providers companies that may coordinate brokers for intent detection, doc retrieval and compliance checks.

Brokers are nice, however don’t overlook about information

Constructing and evaluating brokers is a core a part of making AI enterprise prepared, however it’s not the one half that’s wanted.

Databricks positions Mosaic Agent Bricks because the AI consumption layer sitting atop its unified information stack. On the Knowledge + AI Summit, Databricks additionally introduced the final availability of its Lakeflow information engineering platform, which was first previewed in 2024.

Lakeflow solves the information preparation problem. It unifies three vital information engineering journeys that beforehand required separate instruments. Ingestion handles getting each structured and unstructured information into Databricks. Transformation offers environment friendly information cleansing, reshaping and preparation. Orchestration manages manufacturing workflows and scheduling.

The workflow connection is direct: Lakeflow prepares enterprise information via unified ingestion and transformation, then Agent Bricks builds optimized AI brokers on that ready information.

“We assist get the information into the platform, after which you are able to do ML, BI and AI analytics,” Bilal Aslam, Senior Director of Product Administration at Databricks advised VentureBeat.

Going past information ingestion, Mosaic Agent Bricks additionally advantages from Databricks’ Unity Catalog’s governance options. That features entry controls and information lineage monitoring. This integration ensures that agent habits respects enterprise information governance with out extra configuration.

Agent Studying from Human Suggestions eliminates immediate stuffing

One of many widespread approaches to guiding AI brokers at present is to make use of a system immediate. Tang referred to the observe of ‘immediate stuffing’ the place customers shove all types of steering right into a immediate within the hope that the agent will comply with it.

Agent Bricks introduces a brand new idea referred to as – Agent Studying from Human Suggestions. This function robotically adjusts system parts based mostly on pure language steering. It solves what Tang calls the immediate stuffing downside. In response to Tang, the immediate stuffing strategy usually fails as a result of agent methods have a number of parts that want adjustment.

Agent Studying from Human Suggestions is a system that robotically interprets pure language steering and adjusts the suitable system parts. The strategy mirrors reinforcement studying from human suggestions (RLHF) however operates on the agent system stage slightly than particular person mannequin weights.

The system handles two core challenges. First, pure language steering may be imprecise. For instance, what does ‘respect your model’s voice’ truly imply? Second, agent methods include quite a few configuration factors. Groups wrestle to establish which parts want adjustment.

The system eliminates the guesswork about which agent parts want adjustment for particular behavioral adjustments.

“This we consider will assist brokers change into extra steerable,” Tang stated.

Technical benefits over current frameworks

There isn’t any scarcity of agentic AI improvement frameworks and instruments out there at present. Among the many rising checklist of vendor choices are instruments from Langchain, Microsoft and Google.

Tang argued that what makes Mosaic Agent Bricks totally different is the optimization. Quite than requiring handbook configuration and tuning, Agent Bricks incorporates a number of analysis methods robotically: TAO, in-context studying, immediate optimization and fine-tuning.

In the case of agent to agent communications, there are a number of choices out there at present, together with Google’s Agent2Agent protocol. In response to Tang, Databricks is at present exploring numerous agent protocols and hasn’t dedicated to a single normal.

At the moment, Agent Bricks handles agent-to-agent communication via two main strategies:

Exposing brokers as endpoints that may be wrapped in several protocols.
Utilizing a multi-agent supervisor that’s MCP (Mannequin Context Protocol) conscious.

Strategic implications for enterprise decision-makers

For enterprises seeking to paved the way in AI, it’s vital to have the appropriate applied sciences in place to guage high quality and effectiveness.

Deploying brokers with out analysis isn’t going to result in an optimum consequence and neither will having brokers and not using a stable information basis. When contemplating agent improvement applied sciences, it’s vital to have correct mechanisms to guage the most effective choices.

The Agent Studying from Human Suggestions strategy can also be noteworthy for enterprise choice makers because it helps to information agentic AI to the most effective consequence.

For enterprises seeking to lead in AI agent deployment, this improvement means analysis infrastructure is not a blocking issue. Organizations can focus assets on use case identification and information preparation slightly than constructing optimization frameworks.

Supply hyperlink

Tags
AI
AI News

Buy now

Why most enterprise AI agents never reach production and how Databricks plans to fix it

From analysis innovation to enterprise AI manufacturing scale

Fixing the agentic AI analysis disaster

Brokers are nice, however don’t overlook about information

Agent Studying from Human Suggestions eliminates immediate stuffing

Technical benefits over current frameworks

Strategic implications for enterprise decision-makers

Related Articles

My Sonos Arc Ultra faced an unexpected challenger – and the...

Tim Cook says Apple is open to M&A on the AI...

Top AI Tools of 2025: The Ultimate Guide for Innovators

Leave a Reply Cancel reply

Latest Articles

My Sonos Arc Ultra faced an unexpected challenger – and the...

Tim Cook says Apple is open to M&A on the AI...

Top AI Tools of 2025: The Ultimate Guide for Innovators

Thinking of buying an Arm-based Windows PC? These three issues might...

Equity Live: From $300M seed rounds to data center builds, AI...