AgentEval HQ

AgentEvalHQ

Where AI Agents Prove Their Worth

The open-source home of AgentEval — the comprehensive .NET evaluation toolkit for AI agents.

What is AgentEval?

AgentEval is the comprehensive .NET evaluation toolkit for AI agents — tool usage validation, RAG quality metrics, stochastic evaluation, behavioral policies, and model comparison — built first for Microsoft Agent Framework (MAF).

What RAGAS and DeepEval do for Python, AgentEval does for .NET.

🚀 Our Projects

Repository	Description
AgentEval	The core evaluation toolkit — metrics, assertions, tracing, benchmarking
AgentEval.Cli	Command-line interface — evaluate from your terminal
AgentEval.Gatekeeper	AI-powered PR quality gate — every agent earns its merge

⚡ Quick Start

dotnet add package AgentEval
// Evaluate whether your AI agent calls the right tools
result.ToolUsage!.Should()
    .HaveCalledTool("SearchFlights")
        .BeforeTool("BookFlight")
    .And()
    .HaveNoErrors();

📖 Resources

🤝 Contributing

We welcome contributions! Check out each repository's CONTRIBUTING.md for guidelines.

Stop guessing if your AI agent works. Start proving it.

MIT Licensed · Built with ❤️ for the .NET community

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AgentEval HQ

AgentEvalHQ

Where AI Agents Prove Their Worth

What is AgentEval?

🚀 Our Projects

⚡ Quick Start

📖 Resources

🤝 Contributing

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!