Eval coverage
95 / 103
Impact
+8
eval files
8→0
WARN
0
source changes
test:
no version bump
Scope — click a skill to see its derivation
Select a skill above to see how its
.eval.json is derived from test-cases.json.Derivation flow (per skill)
src/skills/<skill>/test-cases.json
──derive──▶
tests/evaluations/<skill>.eval.json
{ $schema, skill, version, evaluations[] } ──validate──▶ eval-completeness ✓ 0 WARN
{ $schema, skill, version, evaluations[] } ──validate──▶ eval-completeness ✓ 0 WARN
Execution order
- Read all 8
test-cases.json→ extract case ids + intent - Generate
<skill>.eval.json;expected_behavioraligned per case - Guard:
should_notmust not duplicaterules/(test rule #3) - Run
test:skills:functional→ assert WARN 8→0 - Branch
test/eval-coverage-8skills· playground · PR
Risk heatmap
| source code | none — additive test files only |
| expected_behavior alignment | medium — must match test-cases intent |
| should_not vs rules/ | medium — test #3 fails if duplicated |
| blast radius | low — new files, advisory gate |