ZenML Website

July 20, 2025

13 mins

LangGraph vs AutoGen: How are These LLM Workflow Orchestration Platforms Different?

In this LangGraph vs Autogen article, we explain the difference between these platforms and when to use which one for the best results.

Read post

July 17, 2025

15 mins

LLMOps in Production: 287 More Case Studies of What Actually Works

287 latest curated summaries of LLMOps use cases in industry, from tech to healthcare to finance and more. This blog also highlights some of the trends observed across the case studies.

Read post

June 21, 2025

15 mins

We Tested 8 LangGraph Alternatives for Scalable Agent Orchestration

Discover the top 8 LangGraph alternatives for scalable agent orchestration.

Read post

March 10, 2025

5 mins

Chat With Your ML Pipelines: Introducing the ZenML MCP Server

Discover the new ZenML MCP Server that brings conversational AI to ML pipelines. Learn how this implementation of the Model Context Protocol allows natural language interaction with your infrastructure, enabling query capabilities, pipeline analytics, and run management through simple conversation. Explore current features, engineering decisions, and future roadmap for this timely addition to the rapidly evolving MCP ecosystem.

Read post

January 20, 2025

45 minutes

LLMOps in Production: 457 Case Studies of What Actually Works

A comprehensive overview of lessons learned from the world's largest database of LLMOps case studies (457 entries as of January 2025), examining how companies implement and deploy LLMs in production. Through nine thematic blog posts covering everything from RAG implementations to security concerns, this article synthesizes key patterns and anti-patterns in production GenAI deployments, offering practical insights for technical teams building LLM-powered applications.

Read post

January 13, 2025

7 mins

Optimizing LLM Performance and Cost: Squeezing Every Drop of Value

This comprehensive guide explores strategies for optimizing Large Language Model (LLM) deployments in production environments, focusing on maximizing performance while minimizing costs. Drawing from real-world examples and the LLMOps database, it examines three key areas: model selection and optimization techniques like knowledge distillation and quantization, inference optimization through caching and hardware acceleration, and cost optimization strategies including prompt engineering and self-hosting decisions. The article provides practical insights for technical professionals looking to balance the power of LLMs with operational efficiency.

Read post

December 2, 2024

9 mins

Building LLM Applications that Know What They're Talking About 🔓🧠

Explore real-world applications of Retrieval Augmented Generation (RAG) through case studies from leading companies in the ZenML LLMOps Database. Learn how RAG enhances LLM applications with external knowledge sources, examining implementation strategies, challenges, and best practices for building more accurate and informed AI systems.

Read post

December 2, 2024

6 mins

LLMOps Lessons Learned: Navigating the Wild West of Production LLMs 🚀

Explore key insights and patterns from 300+ real-world LLM deployments, revealing how companies are successfully implementing AI in production. This comprehensive analysis covers agent architectures, deployment strategies, data infrastructure, and technical challenges, drawing from ZenML's LLMOps Database to highlight practical solutions in areas like RAG, fine-tuning, cost optimization, and evaluation frameworks.

Read post

November 26, 2024

9 mins

Everything you ever wanted to know about LLMOps Maturity Models

As organizations rush to adopt generative AI, several major tech companies have proposed maturity models to guide this journey. While these frameworks offer useful vocabulary for discussing organizational progress, they should be viewed as descriptive rather than prescriptive guides. Rather than rigidly following these models, organizations are better served by focusing on solving real problems while maintaining strong engineering practices, building on proven DevOps and MLOps principles while adapting to the unique challenges of GenAI implementation.

Read post

genai

LangGraph vs AutoGen: How are These LLM Workflow Orchestration Platforms Different?

LLMOps in Production: 287 More Case Studies of What Actually Works

We Tested 8 LangGraph Alternatives for Scalable Agent Orchestration

Chat With Your ML Pipelines: Introducing the ZenML MCP Server

LLMOps in Production: 457 Case Studies of What Actually Works

Optimizing LLM Performance and Cost: Squeezing Every Drop of Value

Building LLM Applications that Know What They're Talking About 🔓🧠

LLMOps Lessons Learned: Navigating the Wild West of Production LLMs 🚀

Everything you ever wanted to know about LLMOps Maturity Models