Intuit: GenAI-Powered Dual-Loop System for Automated Documentation Management

LLMOps Database

Tech

Intuit

Company

Intuit

Title

GenAI-Powered Dual-Loop System for Automated Documentation Management

Industry

Tech

Link

https://medium.com/intuit-engineering/revolutionizing-knowledge-discovery-with-genai-to-transform-document-management-0cdf4385c11c

Year

2024

Summary (short)

Intuit developed a sophisticated dual-loop GenAI system to address challenges in technical documentation management. The system combines an inner loop that continuously improves individual documents through analysis, enhancement, and augmentation, with an outer loop that leverages embeddings and semantic search to make knowledge more accessible. This approach not only improves document quality and maintains consistency but also enables context-aware information retrieval and synthesis.

Tags

## Overview Intuit, a major financial software company known for products like TurboTax and QuickBooks, has developed a GenAI-powered pipeline to transform how they manage and utilize their internal technical documentation. The case study, published in March 2025, describes an architectural approach designed to address the perennial challenges organizations face with knowledge bases: variable documentation quality, outdated information, poor structure for information retrieval, and content that isn't written with specific audiences or use cases in mind. The core innovation described is a "dual-loop system" that aims to continuously improve both the quality of documents in the knowledge base and the system's ability to extract relevant information to answer user queries. This represents an interesting production LLM use case focused on internal tooling and developer productivity rather than customer-facing applications. ## The Problem Space The case study begins by framing the fundamental challenge: organizations have the tools to handle vast amounts of data, but finding the right information at the right time remains difficult, particularly for technical documentation. The more comprehensive documentation becomes, the harder it is to surface exactly the right information when engineers need it. This is a common pain point in large engineering organizations where tribal knowledge and scattered documentation can significantly impact productivity. Intuit identifies four key challenges with traditional documentation systems: - Documentation quality varies significantly depending on the author - It's difficult to determine if retrieved information is current - Documents aren't necessarily structured for easy parsing of relevant information - Documentation is often not written with specific target audiences and use cases in mind ## The Dual-Loop Architecture The solution architecture employs what Intuit calls a "dual-loop system" - a design pattern that separates concerns between content preparation and content retrieval while enabling feedback between the two. ### Inner Loop: Document Enhancement Pipeline The inner loop focuses on pre-processing and improving individual documents before they enter the knowledge base. This is a notable LLMOps pattern because it represents "intelligent pre-processing" - rather than simply indexing content as-is, the system actively transforms and improves it. The inner loop consists of five specialized GenAI plugins: **Document Analyzer**: This component evaluates documents against a custom rubric, scoring them on structure, completeness, and comprehensibility. Documents that don't meet threshold scores are routed through subsequent enhancement plugins. This represents a quality gate pattern that ensures minimum standards before content enters the knowledge base. **Document Improvement Plugin**: Acting as what Intuit describes as "a skilled writer," this plugin restructures and enhances content for coherence and comprehensiveness. This addresses the variable quality problem by normalizing content regardless of the original author's writing skills. **Document Style Guide Plugin**: This ensures consistency across the knowledge base by modifying voice and style to adhere to organizational guidelines. This is an interesting application of LLMs for style transfer in technical writing contexts. **Document Discoverability Plugin**: Perhaps the most interesting from an LLMOps perspective, this plugin modifies content to optimize for retrieval. It adds semantic context and links content to relevant user queries. This is essentially "SEO for RAG" - preparing documents not just for human readers but for the embedding and retrieval system that will later need to find them. **Document Augmentation Plugin**: This plugin uses retrieval-augmented generation to pull in new, relevant information from other knowledge sources and update documents accordingly. This addresses the freshness problem by continuously enriching documents with current information. ### Outer Loop: Knowledge Retrieval The outer loop handles the actual retrieval and synthesis of information in response to user queries. It consists of three components: **Embedding Plugin**: Creates vector representations of documents for similarity search and content clustering. This is standard RAG infrastructure, enabling semantic rather than keyword-based retrieval. **Search Plugin**: Uses semantic similarity to scan content and find the most relevant chunks for a given query. The description emphasizes context-appropriate results, suggesting some level of query understanding beyond simple vector similarity. **Answer Plugin**: Synthesizes information from retrieved chunks into comprehensive answers. This is the generation component of the RAG pipeline. ## Feedback-Driven Improvement One of the more sophisticated LLMOps patterns described is the feedback loop between query failures and system improvement. When the system fails to answer a query successfully, it triggers two types of updates: - Updates to the Search Plugin to improve results for similar future queries - Updates to base documents to add missing information This represents a form of continuous learning or system adaptation that doesn't require model retraining. Instead, the system evolves by modifying its retrieval parameters and enriching its content. This is a pragmatic approach to improvement that maintains the benefits of using foundation models while still allowing the system to become more effective over time. The case study also mentions "context-aware enhancement" - the system improves its ability to answer questions in additional contexts, not just to add or update information. This suggests some form of query classification or intent understanding that influences how documents are tagged or how retrieval is parameterized. ## Technical Observations and Assessment It's worth noting what the case study does and does not claim. The article is relatively light on specific technical details - there's no mention of which LLMs are used, what the vector store technology is, how the plugins are orchestrated, or what the latency characteristics of the pipeline are. There are also no quantitative results provided - no metrics on improvement in documentation quality, query success rates, or time saved by engineers. The case study positions itself more as an architectural pattern than a benchmarked solution. The individual techniques mentioned - vector embeddings, RAG, LLM-based content analysis and generation - are indeed well-established. The contribution appears to be in their orchestration into an end-to-end pipeline that addresses the full lifecycle from document ingestion through query answering. The dual-loop concept is interesting from an LLMOps perspective because it separates the real-time query path (outer loop) from the batch processing path (inner loop). This is a common pattern in production ML systems because it allows the computationally expensive enhancement work to happen asynchronously while keeping query latency low. However, the case study doesn't explicitly discuss how these loops are scheduled or triggered. The feedback mechanism is also architecturally significant. Many RAG implementations treat the knowledge base as static between manual updates. Building in automated feedback from query failures to both retrieval tuning and content updates represents a more sophisticated approach to maintaining system effectiveness over time. However, the case study doesn't address how query success or failure is determined, which is a non-trivial problem in itself. ## Broader Applicability The case study concludes by noting that while the system was designed for technical documentation, the pattern could apply to other knowledge retrieval scenarios beyond software documentation. This is a reasonable claim - the components described are relatively domain-agnostic, though the specific rubrics and style guides would need customization for different content types. From an LLMOps maturity perspective, this represents an organization moving beyond simple RAG implementations toward more sophisticated document lifecycle management. The emphasis on pre-processing quality, consistent styling, and discoverability optimization suggests lessons learned from operating simpler systems and encountering their limitations. ## Limitations and Considerations Readers should note that this is a first-party case study from Intuit's engineering blog, so it naturally presents the solution in a favorable light. The lack of quantitative results means we cannot independently assess the effectiveness of the approach. The architectural concepts are sound, but the real-world performance, cost, and maintenance burden remain unspecified. Additionally, systems that automatically modify documents raise governance questions - how are changes tracked, who reviews them, and how is the integrity of source documentation maintained? These operational concerns are not addressed in the case study but would be important considerations for organizations looking to implement similar approaches. Overall, this case study provides a useful architectural reference for organizations looking to move beyond basic RAG implementations toward more comprehensive knowledge management solutions. The dual-loop pattern with feedback-driven improvement represents a mature approach to operating LLM-powered systems in production, even if the specific implementation details remain proprietary.

Start deploying reproducible AI workflows today

Enterprise-grade MLOps platform trusted by thousands of companies in production.

Book a Demo

Use Open Source