Don't Let Model Changes Break Your AI
AI models update silently, and yesterday's perfect responses can become today's problems. We test your prompts daily to catch changes before your users do.
Stop Model Changes From Breaking Production
Models can change without notice, so every morning, we test your prompts against known inputs. If today's responses don't match yesterday's patterns, you'll know instantly - before it affects your users.


Keep Your AI Responses Consistent
What worked yesterday should work today. Our daily testing compares new responses against your known-good baseline, alerting you the moment behavior starts to drift.
See Model Drift Detection In Action
Visit drift.getlibretto.com to watch live as we track how major AI models change over time. See example prompts being tested for drift with an array of major models.

Stay Ahead with Drift Detection

FAQs
Discover how Drift Detection keeps your LLM models reliable and consistent over time.
Drift Detection is a feature that monitors your LLM prompts daily. It identifies any changes in model responses over time. This ensures you stay informed about the performance and reliability of your AI models.
At the outset, Drift Detection runs a bunch of test cases through your model 100 times each. Then on every subsequent day, Drift Detection runs the test cases again and analyzes discrepancies to detect any shifts in model behavior. We catch these changes before they affect your users.
We help you take action:
1. See exactly what changed in responses
2. Test alternative prompts or models
3. Update your prompts to work with the changes
4. Verify your fixes work consistently
There are two ways to experience it:
First you can check our public dashboard at drift.getlibretto.com , which tracks 10 real prompts against major AI models and shows historical changes and trends.
Second, you can try it with your own prompts. Drift detection starts automatically in Libretto after we gather baseline data.
We start monitoring as soon as we have:
° At least 24 hours of traffic
° Enough diverse inputs to establish patterns
° Clear baseline behavior for your prompt
This usually takes 1-2 days of normal traffic, but it can take longer for low traffic prompts.