Trust-Chain Trifecta: Experiment Synthesis (Q2 2026)¶

Status¶

All three experiments complete. This document synthesizes the cross-experiment findings.

The Three Experiments¶

Experiment	Perspective	What it tests
Memory Poisoning	Producer-side	Can poisoned state entries shift classification later?
Delegation Spoofing	Adapter-side	Can capability/provenance/identity claims deceive consumers?
Protocol Evidence Interpretation	Consumer-side	Can incomplete reads silently downgrade trust decisions?

Consolidated Results¶

Experiment	A (unprotected)	B (integrity/precedence)	C (full stack)	FPBR
Memory Poisoning	DASR 100%	DASR 25%	DASR 0%	0%
Delegation Spoofing	4/4 bypass	¼ bypass	0/4 bypass	0%
Protocol Evidence	CCAR 0%	CCAR 50%	CCAR 100%	0%

Each experiment uses the primary metric most appropriate to its perspective: activation rate for producer-side, bypass count for adapter-side, and canonical consumer agreement for consumer-side.

The Pattern¶

All three experiments independently confirm the same structural principle:

Integrity/precedence (Condition B) handles most attacks. The hardest attack in each experiment requires a deeper verification layer (Condition C). Zero false positives across all experiments and conditions.

What B blocks¶

Experiment	Vectors blocked by B	Mechanism
Memory Poisoning	V1, V2, V4	Content-addressed hash comparison
Delegation Spoofing	V1, V2, V4	Claim-vs-emission, provenance tracking
Protocol Evidence	V1, V3	Read-path precedence

What only C blocks¶

Experiment	Hardest vector(s)	C-layer mechanism
Memory Poisoning	V3 (context envelope)	Field provenance validation
Delegation Spoofing	V3 (identity spoofing)	Trust-domain verification
Protocol Evidence	V2 + V4 (precedence + projection)	Tier-1 enforcement + completeness validation

The meta-lesson¶

The hardest attacks are never broken data. They are: - Schema-valid: pass all structural checks - Protocol-plausible: could realistically occur in production - Semantically misleading: shift interpretation through ambiguity, not corruption

Hashes, integrity checks, and read-path awareness catch most threats. But the decisive defense against the hardest attacks is always about interpreting origin, provenance, and completeness correctly — not just checking that bytes are intact.

Design Principles (derived from all three experiments)¶

1. Field provenance is a security control¶

A field's presence must be validated against when it was produced, not just whether it passes schema validation. (Memory Poisoning V3)

2. Trust-domain verification must go beyond metadata¶

Source URN, protocol name, and capability claims are insufficient for trust. Adapter identity must be pinned or signed. (Delegation Spoofing V3)

3. Consumer read precedence must be binding¶

consumer_read_path and required_consumer_fields are not optional metadata. They are security-critical contract fields that determine classification correctness. (Protocol Evidence V2, V4)

4. Compat signals are trust signals¶

ConsumerPayloadState::CompatibilityFallback is not decorative. Treating it as equivalent to Converged creates a silent downgrade path. (Protocol Evidence V3)

5. Completeness must be enforced, not inferred¶

Missing required fields must trigger fail-closed or explicit fallback — never silent path degradation. (Protocol Evidence V4)

Recommendations¶

Preserve as invariants¶

Content-addressed hashing on replay baselines and state snapshots
Field provenance validation in context contract
Trust-domain verification for adapter identity
consumer_read_path as mandatory consumer contract
required_consumer_fields completeness enforcement
Deny convergence tier-1 precedence (decision_outcome_kind wins)

Consumer/SDK guidance¶

Fail-closed on missing required fields
Follow canonical 4-tier deny precedence, not legacy flags
Preserve all required fields in forwarding chains or explicitly signal loss
Treat CompatibilityFallback as distinct from Converged

What not to do¶

Do not add new consumer fields without updating required_consumer_fields
Do not treat provenance/compat signals as optional metadata
Do not bypass trust-domain checks for convenience

Practical Implications¶

For each layer in the trust chain: - Producers must preserve field provenance — when a field was produced matters as much as its value - Adapters must preserve trust-domain identity — source URN and metadata alone are not sufficient - Consumers must fail-closed on missing required fields and follow canonical tier precedence

Scope Boundary¶

This synthesis is bounded to deterministic structural attacks on trust interpretation. It does not claim coverage over semantic persuasion, multi-agent planning, or control-plane compromise. All attacks are schema-valid, protocol-plausible, and tested without LLM involvement.

Next Frontier¶

The next open question is not whether the trust chain works in isolation, but whether external integrations and SDK consumers preserve these invariants end-to-end.

Experiment Infrastructure (snapshot as of 2026-03-15)¶

Metric	Value
Total attack vectors	12 (4 per experiment)
Total benign controls	9 (3 per experiment)
Total conditions tested	9 (3 per experiment)
Total test runs	87 (21 per matrix + overhead)
Total tests	46 (unit + integration across all three)
False positives	0 across all experiments and conditions (within bounded benign controls)
LLM calls	0 (all deterministic structural testing)
Runtime pipeline changes	0