AAgentProof
Trial workspace · Sample improvement

Improvement cycle for Sample: Order-routing assistant

Three sample improvements with what-good-looks-like, evidence to collect, related Learn topics, and a clear reassessment trigger.

Improvement guidance

Recommended improvements with Learn links + evidence to collect

  • Pin a written action-authority scope

    Why it matters. Bounded action authority is the single biggest lever for a Zone 3 agent.

    What good looks like

    A one-page document that lists record types, operations, and a denylist. Signed by the agent owner and the reviewer.

    Evidence to collect

    Signed scope document + a screenshot of the agent refusing an out-of-scope action.

    Reassess after

    Re-run readiness assessment once the scope is signed.

  • Stand up a reviewer queue with SLA

    Why it matters. Drafts without a reviewer become customer-facing mistakes.

    What good looks like

    A queue with a 4-hour SLA, a documented escalation path, and one approved + one rejected draft on file.

    Evidence to collect

    Reviewer-queue screenshot + transcripts of one approve and one reject decision.

    Reassess after

    Re-run readiness once the queue has 7 days of activity.

  • Build an adversarial-prompt test suite

    Why it matters. If a refusal pattern silently breaks, you won't know.

    What good looks like

    10–20 adversarial prompts the agent must refuse, run on a schedule, with results archived.

    Evidence to collect

    Test-suite transcripts and a 30-day archive of audit traces.

    Reassess after

    Re-run readiness once the suite has 14 days of green results.

Where to go next