{{TITLE}}

{{META}}
{{LEGEND}}

Summary

{{SUMMARY_ROWS}}
MetricModel AModel BWinner

Labelled quality

{{QUALITY_SECTION}}

Latency distribution (single + batch)

Single-query (warm) p50 / p95 / p99 (ms)

{{CHART_WARM}}

Batch latency p50 / p95 / p99 by concurrency (ms)

{{CHART_BATCH_LATENCY}}

Throughput vs concurrency

{{CHART_THROUGHPUT}}

Storage

{{CHART_STORAGE}}

Query-level detail

Show top-5 results per model side-by-side ({{QUERY_COUNT}} queries; click a query to expand) {{QUERY_DETAILS}}

Recommendation

The picker below uses fixed thresholds (no machine-learned scorer). Each axis has an independent winner; the global "no clear winner" verdict means axes disagree. Pick the model that wins your highest-weighted axis.

{{RECOMMENDATION_TABLE}}
Disclaimers