Rollout status
SHIPPED assess (#2235 pilot) ·
THIS PR review-pr ·
THIS PR audit-full
One shared engine (shared/rules/adversarial-refutation.md) + a thin per-skill adapter that binds it. All three evaluative skills now refute before they emit a headline score / merge verdict / severity report.
Per-skill bindings
Effort gate (when refutation runs)
| /effort | assess (Phase 2.5) | review-pr (Phase 4.5) | audit-full (STEP 3.5) |
|---|---|---|---|
| low / medium | skip | skip | skip |
| high | ≤4 single, advisory | ≤6 single, advisory | single, advisory |
| xhigh | 3-vote majority | 3 blocker / 2 HIGH | 3 CRITICAL / 2 HIGH |
Advisory = surfaced, never auto-applied. Quorum = majority with VERIFIED counter-evidence. A 1-of-N dissent never revises (records a caveat, drops confidence to low).