
In this episode, we discuss how to monitor the performance of Large Language Models (LLMs) in production environments. We explore common enterprise approaches to LLM deployment and evaluate the importance of monitoring for LLM quality or the quality of LLM responses over time. We discuss strategies for "drift monitoring" — tracking changes in both input prompts and output responses — allowing for proactive troubleshooting and improvement via techniques like fine-tuning or augmenting data sources. Read the article by Fiddler AI and explore additional resources on how AI observability can help developers build trust into AI services.
Podzilla Summary coming soon
Sign up to get notified when the full AI-powered summary is ready.
Free forever for up to 3 podcasts. No credit card required.

The Anatomy of Agentic Observability

Agentic Observability: The AI Architect's Essential Blueprint

How to Identify ML Drift Before You Have a Problem

Industry’s Fastest Guardrails Now Native to NVIDIA NeMo
Free AI-powered recaps of Safe and Sound AI and your other favorite podcasts, delivered to your inbox.
Free forever for up to 3 podcasts. No credit card required.