ZenML

RAG-Powered Agent Assist Tool for Insurance Contact Centers

Allianz Direct 2024
View original source

Allianz Direct implemented a GenAI-powered agent assist tool using RAG to help contact center agents quickly and accurately answer customer questions about insurance policies. Built on the Databricks Data Intelligence Platform using Mosaic AI tools, the solution improved answer accuracy by 10-15% compared to their previous system, while allowing agents to focus more on customer relationships rather than searching through documentation.

Industry

Insurance

Technologies

Overview

Allianz Direct is a Munich-based online insurance company operating as a subsidiary of the global insurance leader Allianz Group. With a strategic mission to become “digitally unbeatable,” the company embarked on a generative AI journey to enhance customer touchpoints and improve the efficiency of their contact center operations. This case study examines their implementation of a RAG-based agent assist tool, which represents their first production GenAI use case built on the Databricks Data Intelligence Platform.

The fundamental business problem Allianz Direct aimed to solve was not about reducing call times or replacing human agents with automation. Instead, their CTO Des Field Corbett emphasized that the goal was to eliminate mundane back-office tasks that agents dislike, freeing them to spend more time in meaningful conversations with customers. This human-centric approach to AI deployment is notable and reflects a thoughtful strategy for technology adoption in customer service environments.

Technical Implementation and Architecture

The solution leverages Databricks Mosaic AI Agent Framework to build a RAG (Retrieval-Augmented Generation) based agent assist application. The system ingests all of Allianz Direct’s product terms and conditions documents and intelligently delivers the right information to contact center agents at the right time when customers call with policy questions. Common queries include questions like “Can my brother drive my car?” which require agents to quickly navigate complex policy documentation.

From an infrastructure perspective, Allianz Direct benefits from their existing Databricks foundation. According to Field Corbett, it was “pretty much zero effort” to get started since they already had Databricks as their data platform. The company uses AWS as their cloud provider and leverages Unity Catalog for unified governance of all data, analytics, and AI assets. This pre-existing infrastructure meant the team could focus on the GenAI application logic rather than building foundational data infrastructure from scratch.

The development workflow utilized Databricks Notebooks, which Field Corbett noted made it “very simple for our developers to make workflow changes.” This approach streamlined both the development and implementation process compared to their previous GenAI solution, though the case study does not provide specific details about what that prior solution entailed or how it was architecturally different.

Human-in-the-Loop Design and Compliance Considerations

A particularly noteworthy aspect of this implementation is the human-in-the-loop design pattern. Rather than having the GenAI system respond directly to customers, the tool provides answers to human agents who then relay the information during customer calls. This architectural decision provided multiple benefits from a compliance and governance perspective.

First, using publicly available terms and conditions as the knowledge base meant there were no concerns about exposing sensitive customer data or proprietary information. Second, having a human agent as the intermediary between the AI system and the customer provided an additional layer of verification and quality control. This approach gave Allianz Direct confidence to proceed with deployment while their understanding of GenAI governance and compliance continues to mature.

The Allianz Group established an AI Data Council to address questions around governance and ethics. Field Corbett acknowledged that “there’s quite a bit of learning still to be done for GenAI governance and compliance,” but they are making sufficient progress that it is not preventing them from pursuing opportunities. This candid assessment reflects the reality many enterprises face when deploying generative AI—the need to balance innovation velocity with appropriate risk management.

Development Process and Vendor Collaboration

The case study highlights the collaborative relationship between Allianz Direct and Databricks during implementation. The Databricks team was embedded in Allianz Direct’s Slack channels, engaging directly with their engineers and helping resolve issues quickly. This close partnership accelerated the proof of concept timeline, though specific metrics on development duration are not provided.

Field Corbett credited the working relationship with Databricks as a key factor in their platform decision, noting that the team “helped point us in the right direction with GenAI.” This suggests that vendor support and expertise played a significant role in the success of the implementation, which is an important consideration for organizations evaluating GenAI platforms.

Results and Measured Outcomes

The proof of concept demonstrated a 10-15% improvement in answer accuracy compared to the previous GenAI application. While this quantitative metric is valuable, Field Corbett emphasized that the more significant outcome was the trust the tool engendered among contact center agents. Instead of having to search multiple systems and second-guess their answers, agents could rely on the GenAI tool to quickly deliver accurate information.

This trust led to increased adoption and, importantly, to agents suggesting additional use cases where GenAI could help them work more effectively and spend more time with customers. This organic adoption pattern represents a positive indicator for the long-term success of the initiative, as agent buy-in is often cited as a critical success factor for contact center technology implementations.

Future Vision and Scalability

Looking ahead, Field Corbett’s team envisions expanding GenAI capabilities significantly. One specific future use case mentioned is providing agents with contextual information about a customer before they even begin the conversation. When a customer calls, the agent would already see why they are likely calling, understand the context, and be able to assist more quickly.

The broader strategic vision is not focused on one or two large use cases but rather on enabling “a flood of ways GenAI can impact the business.” Databricks is positioned to support this by enabling multiple GenAI projects with short timelines and low costs, allowing Allianz Direct to select the right models for different use cases.

The scalability of the Databricks lakehouse architecture was cited as an original reason for platform selection, and this scalability now extends to GenAI workloads. Field Corbett’s goal is to empower more business users across the company to adopt GenAI capabilities, leveraging Unity Catalog to ensure everyone has secure access to the data they need.

Critical Assessment

While the case study presents positive results, it is important to note several limitations in the available information. The 10-15% accuracy improvement is mentioned without details on how accuracy was measured or what the baseline performance was. The case study also does not provide information on cost considerations, latency metrics, or the scale of deployment in terms of number of agents or queries processed.

Additionally, this is a Databricks customer story, so it naturally emphasizes the positive aspects of the platform. The comparison to the “previous GenAI solution” lacks specificity, making it difficult to assess whether the improvements are attributable to the platform choice or other factors such as better prompt engineering, different model selection, or refined RAG implementation.

Despite these limitations, the case study offers a useful example of a thoughtful, compliance-aware approach to deploying GenAI in a regulated industry. The human-in-the-loop architecture, focus on agent empowerment rather than replacement, and iterative expansion strategy represent practices that other organizations in similar industries might consider emulating.

More Like This

AI-Powered Customer Service and Call Center Transformation with Multi-Agent Systems

Fastweb / Vodafone 2025

Fastweb / Vodafone, a major European telecommunications provider serving 9.5 million customers in Italy, transformed their customer service operations by building two AI agent systems to address the limitations of traditional customer support. They developed Super TOBi, a customer-facing agentic chatbot system, and Super Agent, an internal tool that empowers call center consultants with real-time diagnostics and guidance. Built on LangGraph and LangChain with Neo4j knowledge graphs and monitored through LangSmith, the solution achieved a 90% correctness rate, 82% resolution rate, 5.2/7 Customer Effort Score for Super TOBi, and over 86% One-Call Resolution rate for Super Agent, delivering faster response times and higher customer satisfaction while reducing agent workload.

customer_support chatbot question_answering +32

Building Economic Infrastructure for AI with Foundation Models and Agentic Commerce

Stripe 2025

Stripe, processing approximately 1.3% of global GDP, has evolved from traditional ML-based fraud detection to deploying transformer-based foundation models for payments that process every transaction in under 100ms. The company built a domain-specific foundation model treating charges as tokens and behavior sequences as context windows, ingesting tens of billions of transactions to power fraud detection, improving card-testing detection from 59% to 97% accuracy for large merchants. Stripe also launched the Agentic Commerce Protocol (ACP) jointly with OpenAI to standardize how agents discover and purchase from merchant catalogs, complemented by internal AI adoption reaching 8,500 employees daily using LLM tools, with 65-70% of engineers using AI coding assistants and achieving significant productivity gains like reducing payment method integrations from 2 months to 2 weeks.

fraud_detection chatbot code_generation +57

GenAI Governance in Practice: Access Control, Data Quality, and Monitoring for Production LLM Systems

Xomnia 2025

Martin Der, a data scientist at Xomnia, presents practical approaches to GenAI governance addressing the challenge that only 5% of GenAI projects deliver immediate ROI. The talk focuses on three key pillars: access and control (enabling self-service prototyping through tools like Open WebUI while avoiding shadow AI), unstructured data quality (detecting contradictions and redundancies in knowledge bases through similarity search and LLM-based validation), and LLM ops monitoring (implementing tracing platforms like LangFuse and creating dynamic golden datasets for continuous testing). The solutions include deploying Chrome extensions for workflow integration, API gateways for centralized policy enforcement, and developing a knowledge agent called "Genie" for internal use cases across telecom, healthcare, logistics, and maritime industries.

healthcare customer_support document_processing +31