Use Cases¶
Real-world scenarios where Assay shines.
Overview¶
Assay is designed for teams building production AI agents. Here are the most common use cases:
-
CI Regression Gate
Catch breaking changes before they hit production. Every PR gets validated.
-
Trace-Driven Debugging
Reproduce and diagnose production failures using recorded traces.
-
Air-Gapped Enterprise
Run evaluations in secure environments with no external network access.
-
Agent Self-Correction
Let agents validate their own actions before executing them.
Quick Comparison¶
| Use Case | Key Benefit | Typical User |
|---|---|---|
| CI Regression Gate | Zero-flake tests | DevOps, Platform |
| Trace-Driven Debugging | Fast root cause analysis | On-call Engineer |
| Air-Gapped Enterprise | Compliance, privacy | Security, FinTech |
| Agent Self-Correction | Runtime guardrails | Agent Developer |
By Industry¶
Financial Services¶
- Requirement: No data can leave the network
- Solution: Air-gapped deployment with local-only evaluation
- Metrics: Sequence validation (auth before transactions)
Healthcare¶
- Requirement: HIPAA compliance, audit trails
- Solution: Trace recording + policy enforcement
- Metrics: Blocklist (no unauthorized data access)
E-commerce¶
- Requirement: Prevent pricing/discount errors
- Solution: Argument validation on business-critical tools
- Metrics: args_valid with min/max constraints
SaaS Platforms¶
- Requirement: Fast iteration without breaking things
- Solution: CI gates on every PR
- Metrics: Full test suite in milliseconds
Getting Started¶
- Identify your pain point — Flaky tests? Slow CI? Compliance needs?
- Pick a use case — Start with one, expand later
- Follow the guide — Each use case has step-by-step instructions
- Measure results — Track time saved, failures caught