evaluation

The latest news, opinions and technical guides from ZenML.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

How I Built and Evaluated a Clinical RAG System with ZenML (and Why Custom Evaluation Matters)

On custom evaluation frameworks for clinical RAG systems, showing why domain-specific metrics matter more than plug-and-play solutions when trust and safety are non-negotiable.
Read post

The Annotated Guide to the Maven Evals Course (by way of the LLMOps Database)

Lessons from the Maven Evals course are combined with 50+ real-world case studies from ZenML's LLMOps Database to show how companies like Discord, GitHub, and Coursera implement the Three Gulfs model and Analyze-Measure-Improve lifecycle to transform failing LLM systems into production-ready applications.
Read post

LLMOps in Production: 287 More Case Studies of What Actually Works

287 latest curated summaries of LLMOps use cases in industry, from tech to healthcare to finance and more. This blog also highlights some of the trends observed across the case studies.
Read post
Oops, there are no matching results for your search.