AI Skill Hub 推荐使用:LLM暗黑模式检测工具 是一款优质的Agent工作流。AI 综合评分 7.2 分,在同类工具中表现稳健。如果你正在寻找可靠的Agent工作流解决方案,这是一个值得深入了解的选择。
专注于检测和分析大语言模型的暗黑模式设计的开源工作流。通过Claude代码分析框架,帮助AI安全研究者和开发者识别模型中的有害模式、偏见和不当行为,提升AI系统的安全性和可靠性。
LLM暗黑模式检测工具 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
专注于检测和分析大语言模型的暗黑模式设计的开源工作流。通过Claude代码分析框架,帮助AI安全研究者和开发者识别模型中的有害模式、偏见和不当行为,提升AI系统的安全性和可靠性。
LLM暗黑模式检测工具 是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排,将复杂的多步骤任务拆解为清晰的自动化流程,实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成,适合构建数据处理管线、业务自动化和 AI 辅助决策系统。
# 克隆仓库 git clone https://github.com/waitdeadai/llm-dark-patterns cd llm-dark-patterns # 查看安装说明 cat README.md # 按 README 完成环境依赖安装后即可使用
# 查看帮助 llm-dark-patterns --help # 基本运行 llm-dark-patterns [options] <input> # 详细使用说明请查阅文档 # https://github.com/waitdeadai/llm-dark-patterns
# llm-dark-patterns 配置说明 # 查看配置选项 llm-dark-patterns --config-example > config.yml # 常见配置项 # output_dir: ./output # log_level: info # workers: 4 # 环境变量(覆盖配置文件) export LLM_DARK_PATTERNS_CONFIG="/path/to/config.yml"
A suite of single-purpose Claude Code hooks that suppress LLM dark-pattern defaults — sycophancy, paternalism, false-success, permission-loops, training-cutoff confidence, and compaction amnesia — at the textual boundary, so power-user operators can actually work.
This repo is the umbrella for a series of small hook repos, umbrella-only legacy hooks that still live here, and the research-grade closeout physics engine in waitdeadai/agent-closeout-bench. Each public standalone hook remains separately installable. The physics-backed lane uses one reproducible engine with per-category rule packs, fixtures, and decision JSON.
That does not collapse every hook into one generic detector. Each hook maps to its own category engine; the shared Rust binary is packaging for reproducible hashing, safe regex compilation, fixture testing, telemetry discipline, and paper-grade evaluation.
The shared architecture is out-of-band textual enforcement at Claude Code hook boundaries. The judge is deterministic code, not another LLM call. That means the model cannot modify the hook's code path from inside its closeout text; it does not mean the system is impossible to bypass, misconfigure, or evade by paraphrase.
| Phase | Surface | Status |
|---|---|---|
| Phase 1 — Locale loader + English pack | lib/packs.sh, packs/locale/en.txt | ✓ ships |
| Phase 2 — Spanish + Polish locale packs | packs/locale/{es,pl}.txt | ✓ ships |
| Phase 3 — Evidence binary allowlist (devops/k8s/cloud/database/system) | packs/evidence/binaries.txt (9 sections, 200+ binaries) | ✓ ships |
| Phase 4 — Destructive command surface packs (filesystem, container, git-protected, config-overwrite, cloud-prod, database, service) | packs/destructive/*.txt (7 surfaces, 56 patterns) | ✓ ships |
| Phase 5 — Bypass hardening (clause-local negation, evidence proximity + action-verb) | hooks/no-vibes.sh | ✓ ships |
| Phase 6 — Physics-backed closeout adapters | agentcloseout-physics v0.2, per-category rule packs, Claude Code wrappers, PreToolUse tamper guard | ✓ ships in AgentCloseoutBench |
Operators with a non-English session, a non-app-dev toolchain, or a load-bearing destructive surface (kubectl, terraform, redis FLUSHALL, force-push to main) can extend coverage without forking by dropping a .txt into ${XDG_CONFIG_HOME:-$HOME/.config}/llm-dark-patterns/packs/<subdir>/<name>.txt. See ROADMAP.md for the architecture spec.
claude plugin marketplace add waitdeadai/claude-plugins
claude plugin install llm-dark-patterns@waitdeadai-plugins
This installs all 31 wired hooks across Stop, SubagentStop, TaskCreated, TaskCompleted, PreToolUse, PostToolUse, PreCompact, PostCompact, and SessionStart events. Each hook remains independently disablable by editing hooks.json after install.
The self-hosted marketplace at waitdeadai/claude-plugins is the canonical install path because the Anthropic community marketplace pipeline has stalled for many submitters since at least March 2026. This plugin shows as Published in the submissions dashboard since 2026-05-11 but does not appear in the live claude-plugins-community/marketplace.json (verified 2026-05-17 — zero matches across 1715 entries; last bulk sync to that file was 2026-05-13 with no new syncs since).
The same pattern is documented across at least eight open issues on anthropics/claude-plugins-official: #984 (since 2026-03-25, 11 comments), #1272 (closed without resolution, 23+ "same here" comments), #1474, #1512, #1834, #1841, #1870, #1887. Two sync PRs (#18, #21) have been stuck unmerged for 12-15 days.
If Anthropic's pipeline resumes, the community-marketplace path becomes a redundant install option, but until then the self-hosted route above is the only one that actually resolves:
```bash
The public standalone repos are still the simplest daily-use path when you want a subset rather than the whole suite. Install the single-file hooks that already have standalone repos:
```bash mkdir -p .claude/hooks
claude plugin marketplace add anthropics/claude-plugins-community claude plugin install llm-dark-patterns@claude-community ```
垂直领域工具,针对AI安全问题的创新方案。代码简洁、专注度高,但社区认可度需提升。适合安全研究者深度应用。
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ Apache 2.0 — 宽松开源协议,可商用,需保留版权声明和 NOTICE 文件,含专利授权条款。
总体来看,LLM暗黑模式检测工具 是一款质量良好的Agent工作流,在同类工具中具备一定竞争力。AI Skill Hub 将持续追踪其更新动态,建议收藏备用,结合自身场景选择合适时机引入使用。
| 原始名称 | llm-dark-patterns |
| 原始描述 | 开源AI工作流:Umbrella for the LLM Dark Patterns Hooks suite — single-purpose Claude Code Stop。⭐10 · Shell |
| Topics | AI安全暗黑模式检测Claude工作流自动化Bash脚本 |
| GitHub | https://github.com/waitdeadai/llm-dark-patterns |
| License | Apache-2.0 |
| 语言 | Shell |
收录时间:2026-05-22 · 更新时间:2026-05-22 · License:Apache-2.0 · AI Skill Hub 不对第三方内容的准确性作法律背书。
选择 Agent 类型,复制安装指令后粘贴到对应客户端