mate365

Multi-Agent Testing Environment — AI Agent Quality Testing Platform

.NET 9 Blazor Server Azure Container Apps

Project Overview

mate connects to AI agents, runs automated evaluation suites against them, tracks quality over time, and red-teams them for adversarial vulnerabilities. It supports Microsoft Copilot Studio, Azure AI Foundry*, generic HTTP agents*, and Parloa* out of the box, with a modular architecture for custom connectors, judges, and red-team providers.

* Roadmap items

At a Glance

Attack Categories

Connector Types

Judge Modes

API

OpenAPI + Scalar

Multi-agent evaluation

Pluggable judge modules

Combine deterministic rubrics, LLM scoring, CopilotStudioJudge Mode with rubrics and hybrid evaluation strategies.

Red teaming (roadmap)

Probe for jailbreaks, prompt injection, hallucination, privacy leak, and more.

Docker and Azure ready

Run locally with Docker Compose or deploy with Bicep to Azure Container Apps.

About the Author

Created by Holger Imbery. Connect on GitHub or LinkedIn to learn more.

GitHub Profile LinkedIn Profile

Latest Release

Open the newest release package and changelog directly on GitHub.

Open Latest Release

Explore the project

Use the repository for source code and releases, or open the wiki for setup and architecture docs.

GitHub Repository Documentation Wiki Latest Release