📄 工具详情 ⚙️ 安装教程 📚 使用教程

能力标签

⚙️

Agent工作流

PestPHP AI评估插件

Q: pest-plugin-evals 如何安装和开始使用？

访问 pest-plugin-evals 的 GitHub 仓库或官方网站，按照 README 文档中的步骤安装依赖并运行。通常需要 Python 3.8+ 或 Node.js 16+ 基础环境。

Q: pest-plugin-evals 是否免费？许可证是什么？

pest-plugin-evals 完全免费，采用 MIT 许可证开源发布，任何人都可以免费使用、修改和分发。

Q: pest-plugin-evals 适合哪些用户使用？

pest-plugin-evals 主要面向有一定技术基础的用户，包括开发者、数据分析师、AI 工程师等专业人士。

Q: pest-plugin-evals 的社区活跃度和项目维护状况如何？

pest-plugin-evals 在 GitHub 上已获得 11 个 Star，处于积极发展阶段，社区在持续扩大。

基于 PHP · 无代码搭建完整 AI 自动化流程

英文名：pest-plugin-evals

⭐ 11 Stars 🍴 2 Forks 💻 PHP 📄 MIT 🏷 AI 7.5分

7.5AI 综合评分

workflowai-sdkevalslaravelpesttestphp

⬇ 下载源码 ZIP ⚙️ 配置说明

✦ AI Skill Hub 推荐

经 AI Skill Hub 精选评估，PestPHP AI评估插件获评「推荐使用」。这款Agent工作流在功能完整性、社区活跃度和易用性方面表现出色，AI 评分 7.5 分，适合有一定技术背景的用户使用。

📚 深度解析

PestPHP AI评估插件是一套完整的 AI Agent 自动化工作流方案。随着 AI 能力的不断提升，基于 Agent 的自动化工作流正在成为提升个人和团队效率的核心方式。区别于传统的 RPA 自动化（模拟鼠标键盘操作），AI Agent 工作流通过理解任务意图、动态规划执行路径，能够处理更复杂的非结构化任务。

PestPHP AI评估插件工作流的设计遵循"最小配置，最大复用"原则：核心逻辑已经封装好，用户只需配置自己的 API Key 和业务参数即可快速上手。工作流内置错误处理和重试机制，在网络波动或 API 限速等情况下仍能稳定运行，适合作为生产环境的自动化基础设施。

在实际部署时，建议先在测试环境中运行 3-5 次，验证各个环节的输出结果符合预期，再部署到生产环境。AI Skill Hub 评分 7.5 分，是同类 Agent 工作流中的精选推荐。

📋 工具概览

基于PestPHP的Laravel AI SDK评估插件，使用LLM作为评估者，支持语义评估。

PestPHP AI评估插件是一套完整的 AI Agent 自动化工作流方案。通过可视化的节点编排，将复杂的多步骤任务拆解为清晰的自动化流程，实现全程无人值守的智能处理。支持与数百种外部服务和 API 无缝集成，适合构建数据处理管线、业务自动化和 AI 辅助决策系统。

GitHub Stars

⭐ 11

开发语言

PHP

支持平台

Windows / macOS / Linux

维护状态

轻量级项目，按需更新

开源协议

MIT

AI 综合评分

7.5 分

工具类型

Agent工作流

Forks

📖 中文文档

以下内容由 AI Skill Hub 根据项目信息自动整理，如需查看完整原始文档请访问底部「原始来源」。

基于PestPHP的Laravel AI SDK评估插件，使用LLM作为评估者，支持语义评估。

📌 核心特色

可视化 Agent 工作流编排，无需编写复杂代码
支持多步骤自动化任务链，实现全流程无人值守
与外部 API、数据库和第三方服务无缝集成
内置错误处理与自动重试机制，保障稳定运行
提供可复用的自动化模板，快速在同类场景部署

🎯 主要使用场景

自动化日常重复性工作，将精力集中于创造性任务
构建数据采集 → 处理 → 输出的完整自动化管线
实现跨平台、跨系统的数据流转和业务协同

以下安装命令基于项目开发语言和类型自动生成，实际以官方 README 为准。

安装命令

# 克隆仓库
git clone https://github.com/shipfastlabs/pest-plugin-evals
cd pest-plugin-evals

# 查看安装说明
cat README.md

# 按 README 完成环境依赖安装后即可使用

📋 安装步骤说明

访问 GitHub 仓库获取工作流文件
在对应平台（Dify / Flowise / Make 等）中找到「导入工作流」功能
上传工作流文件
按照提示配置必要的环境变量和 API Key
运行测试确认流程正常后投入使用

以下用法示例由 AI Skill Hub 整理，涵盖最常见的使用场景。

常用命令 / 代码示例

# 查看帮助
pest-plugin-evals --help

# 基本运行
pest-plugin-evals [options] <input>

# 详细使用说明请查阅文档
# https://github.com/shipfastlabs/pest-plugin-evals

以下配置示例基于典型使用场景生成，具体参数请参照官方文档调整。

配置示例

# pest-plugin-evals 配置说明
# 查看配置选项
pest-plugin-evals --config-example > config.yml

# 常见配置项
# output_dir: ./output
# log_level: info
# workers: 4

# 环境变量（覆盖配置文件）
export PEST_PLUGIN_EVALS_CONFIG="/path/to/config.yml"

📑 README 深度解析真实文档完整度 70/100 查看 GitHub 原文 →

以下内容由系统直接从 GitHub README 解析整理，保留代码块、表格与列表结构。

简介

------

Agent instance (with constructor dependencies)

it('evaluates a pre-configured agent', function () {
    $agent = new RefundAgent($user);

    expectAgent($agent, 'Can I return a damaged laptop?')
        ->toContain('refund')
        ->toPassJudge('Response explains the refund policy clearly');
});

You can also use Laravel's ::make() method:

it('evaluates agent created with make()', function () {
    expectAgent(RefundAgent::make(user: $user), 'Can I return a damaged laptop?')
        ->toContain('refund');
});

Installation

composer require shipfastlabs/pest-plugin-evals --dev

Publish the config (optional):

php artisan vendor:publish --tag=eval-config

Quick Start

use function ShipFastLabs\PestEval\expectAgent;

it('answers refund questions accurately', function () {
    expectAgent(RefundAgent::class, 'Can I return a damaged laptop?')
        ->toContain('refund')
        ->toContain('return')
        ->toPassJudge('Response explains the refund policy clearly')
        ->toBeRelevant(0.8);
});

Run your evals:

pest --eval

Eval tests are excluded from normal test runs automatically. Place your eval tests in tests/Evals/ — when you run pest without --eval, the plugin excludes that directory so evals never pollute your regular test suite.

pest --eval targets the tests/Evals directory. If it does not exist, it falls back to --group=eval.

Usage Examples

Configuration

// config/eval.php
return [
    'ai' => [
        'scoring' => [
            'provider' => env('EVAL_SCORING_PROVIDER', 'openai'),
            'model' => env('EVAL_SCORING_MODEL', 'gpt-4.1-mini'),
        ],
        'embedding' => [
            'provider' => env('EVAL_EMBEDDING_PROVIDER', 'openai'),
            'model' => env('EVAL_EMBEDDING_MODEL', 'text-embedding-3-small'),
        ],
    ],
];

Faked mode (fast iteration, no agent API calls)

it('eval pipeline works with faked responses', function () {
    expectAgent(
        RefundAgent::class,
        'What is your return policy?',
        fake: ['Our return policy allows returns within 30 days.'],
    )->toContain('30 days')
        ->toMatch('/\d+ days/');
});

Factuality check against reference

it('answers factually', function () {
    expectAgent(CapitalCityAgent::class, 'What is the capital of Japan?')
        ->toBeFactual(expected: 'Tokyo');
});

Custom Expectations Reference

Expectation	Description	Scorer used
`->toBeRelevant(0.7)`	Checks if response is on-topic	`Relevance`
`->toBeSafe(0.7)`	Evaluates for harmful content	`Safety`
`->toBeFactual(0.7, expected: '...')`	Fact-checks against reference	`Factuality`
`->toPassJudge('criteria', 0.7)`	Custom LLM evaluation	`LlmJudge`
`->toBeSimilar('ref', 0.7)`	Embedding cosine similarity	`SemanticSimilarity`
`->toHaveToolCalls([...])`	Validates tool calls/arguments	`ToolCallMatch`
`->toFollowTrajectory([...])`	Validates tool call sequence	`AgentTrajectory`
`->toPassScorer($scorer, 0.7)`	Use any custom `Scorer` instance	Any

All thresholds default to 0.7 and represent the minimum score (0.0-1.0) required to pass.

`expectAgent()` API

expectAgent(
    string|Closure|Agent $agent, // Agent class name, closure, or instance
    string $prompt,              // The input prompt
    array $fake = [],            // Fake responses (bypasses agent execution)
    array $attachments = [],     // Files to pass to the agent (Document, Image)
): Expectation

// Chain ->repeat(N) for multiple runs:
->repeat(5)                  // Run agent 5 times, all assertions checked on every output

Pest Plugin Eval

A PestPHP plugin for evaluating Laravel AI SDK agents. Build evals with LLM-as-judge, semantic similarity, and deterministic matchers — all with a native Pest expect() API.

🎯 aiskill88 AI 点评 A 级 2026-05-24

该插件提供了一个基于PestPHP的Laravel AI SDK评估插件，支持语义评估和LLM评估，适合用于评估AI代理的性能。

📚 实用指南（长尾问题）

适合谁

构建多智能体协作系统的 Agent 开发者
构建企业知识库 / RAG 检索应用的团队

最佳实践

Agent 任务先做 dry-run 验证工具调用链，再开启自主执行

常见错误

API key 直接提交到 git 仓库（请用 .env 并加入 .gitignore）

部署方案

云端托管：可放在 Vercel / Railway / Fly.io 等 PaaS 平台

⚡ 核心功能

可视化 Agent 工作流编排，无需编写复杂代码
支持多步骤自动化任务链，实现全流程无人值守
与外部 API、数据库和第三方服务无缝集成
内置错误处理与自动重试机制，保障稳定运行
提供可复用的自动化模板，快速在同类场景部署

👥 适合谁

构建多智能体协作系统的 Agent 开发者
构建企业知识库 / RAG 检索应用的团队

⭐ 最佳实践

Agent 任务先做 dry-run 验证工具调用链，再开启自主执行

⚠️ 常见错误

API key 直接提交到 git 仓库（请用 .env 并加入 .gitignore）

👥 适合人群

自动化工程师和运维人员项目经理和业务分析师希望减少重复性工作的专业人士数字化转型团队

🎯 使用场景

自动化日常重复性工作，将精力集中于创造性任务
构建数据采集 → 处理 → 输出的完整自动化管线
实现跨平台、跨系统的数据流转和业务协同

⚖️ 优点与不足

✅ 优点

+MIT 协议，可免费商用
+大幅减少重复性人工操作
+可视化流程，清晰直观
+可扩展性强，支持复杂场景

⚠️ 不足

−初始配置和调试需投入一定时间
−强依赖外部服务的稳定性
−复杂场景需具备一定技术基础

⚠️ 使用须知

AI Skill Hub 为第三方内容聚合平台，本页面信息基于公开数据整理，不对工具功能和质量作任何法律背书。

建议在沙箱或测试环境中充分验证后，再部署至生产环境，并做好必要的安全评估。

📄 License 说明

🔗 相关工具推荐

LangChain AI开发框架

Agent工作流

ai-agents-for-beginners Agent工作流

微软官方开源项目，提供12堂系统课程学习AI智能体框架。涵盖工作流设计、RAG检索增强、多智能体协作等核心技能。适合AI

📰 相关 AI 新闻

🍿 AI 圈相关吃瓜

AutoGPT 自主完成了任务：把我的文件夹全部重命名了

AI 圈观察

给 Agent 的目标是"提高效率"，三小时后它关掉了所有通知

AI 圈观察

Agent 帮我订了3次机票，全部是同一天的

🗺️ 相关解决方案

ai-workflow-templates

embedding

rag-knowledge-base

🧩 你可能还需要

基于当前 Skill 的能力图谱，自动补全的工具组合

技能寻求者

MCP · Agent · 工作流

total-agent-memory MCP工具

为Claude Code和Codex CLI提供持久化记忆功能的开源MCP工具。自动提取知识图谱，支持多轮对话上下文保留，适合需要长期记忆和

❓ 常见问题 FAQ

pest-plugin-evals 是什么工具？−

pest-plugin-evals 是一款PHP开发的AI辅助工具。开源AI工作流：A PestPHP plugin for evaluating Laravel AI SDK agents with LLM-as-judge, semanti。⭐11 · PHP 主要应用场景包括：用于评估Laravel AI SDK代理，支持语义评估和LLM评估。

pest-plugin-evals 如何安装和开始使用？+

pest-plugin-evals 是否免费？许可证是什么？+

pest-plugin-evals 适合哪些用户使用？+

pest-plugin-evals 的社区活跃度和项目维护状况如何？+

什么是 Agent 工作流？和普通自动化有什么区别？+

导入工作流后，我需要修改哪些配置？+

工作流运行失败了，如何排查问题？+

💡 AI Skill Hub 点评

AI Skill Hub 点评：PestPHP AI评估插件的核心功能完整，质量良好。对于自动化工程师和运维人员来说，这是一个值得纳入个人工具库的选择。建议先在非生产环境试用，再逐步推广。

⬇️ 获取与下载

⬇ 下载源码 ZIP

✅ MIT 协议 · 可免费商用 · 直接从 aiskill88 服务器下载，无需跳转 GitHub

📚 深入学习 PestPHP AI评估插件

查看分步骤安装教程和完整使用指南，快速上手这款工具

⚙️ 安装教程 📚 使用教程

🌐 原始信息

原始名称	`pest-plugin-evals`
原始描述	开源AI工作流：A PestPHP plugin for evaluating Laravel AI SDK agents with LLM-as-judge, semanti。⭐11 · PHP
Topics	`workflowai-sdkevalslaravelpesttestphp`
GitHub	https://github.com/shipfastlabs/pest-plugin-evals
License	MIT
语言	PHP

🔗 原始来源

🐙 GitHub 仓库 https://github.com/shipfastlabs/pest-plugin-evals

收录时间：2026-05-24 · 更新时间：2026-05-30 · License：MIT · AI Skill Hub 不对第三方内容的准确性作法律背书。

PestPHP AI评估插件

📚 深度解析

📋 工具概览

📖 中文文档

简介

Agent instance (with constructor dependencies)

Installation

Quick Start

Usage Examples

Configuration

Faked mode (fast iteration, no agent API calls)

Factuality check against reference

Custom Expectations Reference

`expectAgent()` API

Pest Plugin Eval

⚡ 核心功能

👥 适合人群

🎯 使用场景

⚖️ 优点与不足

🔗 相关工具推荐

❓ 常见问题 FAQ

🤖 交给 Agent 安装 · PestPHP AI评估插件