Remove duplicate rows by field value
Taprun ships as a single MCP server exposing a catalog of compiled taps. One-time setup on macOS / Linux:
brew install LeonTing1010/tap/taprun
tap mcp connect
Or drop this into your claude_desktop_config.json (works identically in Claude Code, Cursor, Cline, Windsurf — any MCP host):
{
"mcpServers": {
"tap": {
"command": "tap",
"args": ["mcp", "start"]
}
}
}
tap/dedupeTerminal, once installed:
tap run tap/dedupe
From the MCP host — exact same compiled plan, deterministic replay, zero LLM tokens:
tap.run({ site: "tap", name: "dedupe" })
This plan was forged once — the AI read tap, picked stable structural addresses (JSON-LD, ARIA, RSS, or declared API endpoints, in that priority order), and saved them to a .plan.json. Every replay since then has used zero LLM tokens. When tap ships a site change that breaks the extraction, tap verify surfaces it before your data goes stale — not after your pipeline silently writes garbage for a week.
| tap | description |
|---|---|
tap/table | Format rows as a human-readable table (outputs one row per line) |
tap/filter | Filter rows where field matches condition (gt, lt, eq, contains) |
tap/sort | Sort rows by field (numeric or alphabetic, asc or desc) |
tap/pick | Select specific columns from rows (projection) |
tap/limit | Take first N rows (head) or skip M then take N (offset+limit) |