Troubleshooting¶

Common errors and how to fix them.

Exit codes: Configuration and input errors (e.g. trace not found, invalid YAML) produce exit code 2. Infrastructure and judge errors (e.g. API unavailable, rate limit) produce exit code 3. For precise handling, use reason_code in run.json or summary.json; see run reference for the full table and compatibility (v1 vs v2).

Configuration Errors (Exit Code 2)¶

Missing configVersion¶

fatal: ConfigError: missing required field 'configVersion'

Fix: Add configVersion: 1 at the top of your config:

configVersion: 1  # Add this line
suite: my_suite
tests:
  # ...

YAML Parse Error¶

fatal: ConfigError: failed to parse YAML: did not find expected node content at line 14 column 1, while parsing a flow node

Common causes:

Missing colon after key:

# Wrong
type args_valid

# Correct
type: args_valid

Incorrect indentation:

# Wrong (mixed tabs/spaces)
tests:
 - id: test1  # Tab character

# Correct (2 spaces)
tests:
  - id: test1

Unquoted special characters:

# Wrong
pattern: [a-z]+

# Correct
pattern: "[a-z]+"

Debug tip: Use a YAML validator like yamllint to find syntax errors.

Unknown Policy Type¶

fatal: ConfigError: unknown policy type 'custom_check' in test 'my_test'

Fix: Use one of the supported policy types:

args_valid
sequence_valid
tool_blocklist
regex_match

Duplicate Test ID¶

fatal: ConfigError: duplicate test id 'my_test'

Fix: Ensure all test IDs are unique within the suite.

Test Failures (Exit Code 1)¶

Missing Required Tool¶

❌ test_flow        failed: sequence_valid  (0.0s)
      Message: Missing required tool: notify_slack

What it means: Your config requires notify_slack to be called, but the trace doesn't contain that tool call.

Possible fixes:

Update the trace: Record a new trace that includes the tool call
Remove the requirement: If the tool is optional, remove the require rule
Check tool name spelling: Ensure the tool name matches exactly

Blocked Tool Called¶

❌ security_test        failed: tool_blocklist  (0.0s)
      Message: Blocked tool called: delete_users

What it means: The agent called a tool that's on your blocklist.

Possible fixes:

Fix the agent: The agent shouldn't call this tool
Update blocklist: If the tool is now allowed, remove it from blocked

Sequence Violation¶

❌ migration_flow        failed: sequence_valid  (0.0s)
      Message: Order violation: run_migration called before create_backup

What it means: Tools were called in the wrong order.

Possible fixes:

Fix the agent logic: Ensure tools are called in the correct order
Update the rule: If the order doesn't matter, remove the before rule

Schema Validation Failed¶

❌ deploy_test        failed: args_valid  (0.0s)
      Message: Argument validation failed for deploy_service:
        - port: expected integer, got string "8080"

What it means: The tool was called with arguments that don't match the schema.

Possible fixes:

Fix the agent: Ensure arguments have correct types
Loosen the schema: If string is acceptable, update the schema

Regex Not Matched¶

❌ output_test        failed: regex_match  (0.0s)
      Message: Output did not match pattern: "temperature is \d+ degrees"

What it means: The agent's output doesn't match the expected pattern.

Debug tip: Check the actual output in the trace file to see what was returned.

Trace Issues (Exit Code 2)¶

Trace and input errors use exit code 2 with reason code E_TRACE_NOT_FOUND (or similar) in run.json / summary.json.

Trace File Not Found¶

fatal: IOError: trace file not found: traces/golden.jsonl

Exit code: 2. Reason code: E_TRACE_NOT_FOUND (in run.json / summary.json).

Fix: Check the path and ensure the file exists:

ls -la traces/

Invalid Trace Format¶

fatal: TraceError: invalid JSON at line 42: expected ',' or '}'

Fix: Validate the JSONL file:

# Check for JSON errors
cat trace.jsonl | jq -c . > /dev/null

Empty Trace¶

fatal: TraceError: trace file is empty: traces/empty.jsonl

Fix: Ensure your recording captured events. Re-record if necessary.

Judge / API Unavailable (Exit Code 3)¶

When the judge service or provider is unreachable or returns an error, Assay exits with exit code 3 and a reason code such as E_JUDGE_UNAVAILABLE, E_RATE_LIMIT, E_PROVIDER_5XX, or E_TIMEOUT in run.json / summary.json.

Typical causes:

Judge/API endpoint down or returning 5xx
Rate limit hit (provider or judge)
Network timeout or connectivity issues

What to do:

Check the error message and reason_code in run.json or summary.json.
Retry after a short delay (rate limits) or verify the service is up.
For compatibility and the full reason code list, see run.md — Exit codes and compatibility.

Cache Issues¶

Unexpected Skips¶

Running 5 tests...
⏭️  test_1        skipped (fingerprint match)
⏭️  test_2        skipped (fingerprint match)
⏭️  test_3        skipped (fingerprint match)

What it means: Tests are being skipped because the trace fingerprint matches a previous run.

To force re-run:

# Option 1: Use fresh database
assay run --config eval.yaml --trace-file trace.jsonl --db :memory:

# Option 2: Delete the cache
rm -rf .assay/store.db

Cache Corruption¶

fatal: CacheError: failed to read cache: database disk image is malformed

Fix: Delete and rebuild the cache:

rm -rf .assay/
assay run --config eval.yaml --trace-file trace.jsonl

Migration Issues¶

External Policy Not Found¶

fatal: MigrationError: could not read policy file: policies/args.yaml

Fix: Ensure the policy file exists at the referenced path.

Already Migrated¶

warn: Config already has configVersion: 1, skipping migration

What it means: The config is already in v1 format. No action needed.

Python / SDK Issues¶

`pip install assay` vs `assay-it`¶

If you ran pip install assay, you installed an unrelated package.

Fix:

pip uninstall assay
pip install assay-it

Module Not Found¶

ModuleNotFoundError: No module named 'assay'

Fix: Ensure you have installed the package (it exposes the assay module):

pip install assay-it

Trace Recording Empty¶

If your trace file is created but has no events:

Ensure you call writer.write_trace() or use the context manager.
Check if record_chat_completions_with_tools actually ran.

CI/CD Issues¶

Non-Zero Exit in CI¶

Error: Process completed with exit code 1.

Meaning: One or more tests failed. Check the logs for specific failures.

Common CI fixes:

Ensure trace files are committed:

- uses: actions/checkout@v4
  with:
    lfs: true  # If using Git LFS for traces

Use correct paths:

- run: assay run --config ./path/to/eval.yaml --trace-file ./path/to/trace.jsonl

Install Assay in CI:

- name: Install Assay
  run: cargo install assay-cli

Permission Denied¶

fatal: IOError: permission denied: .assay/store.db

Fix: Ensure the runner has write permissions, or use in-memory mode:

assay run --config eval.yaml --trace-file trace.jsonl --db :memory:

Getting Help¶

If you're stuck:

Enable debug logging:

RUST_LOG=assay=debug assay run --config eval.yaml --trace-file trace.jsonl

Check the GitHub Issues: github.com/Rul1an/assay/issues
File a bug report with:
Assay version (assay --version)
Full error output
Minimal config to reproduce

Troubleshooting¶

Configuration Errors (Exit Code 2)¶

Missing configVersion¶

YAML Parse Error¶

Unknown Policy Type¶

Duplicate Test ID¶

Test Failures (Exit Code 1)¶

Missing Required Tool¶

Blocked Tool Called¶

Sequence Violation¶

Schema Validation Failed¶

Regex Not Matched¶

Trace Issues (Exit Code 2)¶

Trace File Not Found¶

Invalid Trace Format¶

Empty Trace¶

Judge / API Unavailable (Exit Code 3)¶

Cache Issues¶

Unexpected Skips¶

Cache Corruption¶

Migration Issues¶

External Policy Not Found¶

Already Migrated¶

Python / SDK Issues¶

pip install assay vs assay-it¶

Module Not Found¶

Trace Recording Empty¶

CI/CD Issues¶

Non-Zero Exit in CI¶

Permission Denied¶

Getting Help¶

`pip install assay` vs `assay-it`¶