mate365

Multi-Agent Testing Environment — AI Agent Quality Testing Platform

.NET 9 Blazor Server Azure Container Apps
Project Overview

mate connects to AI agents, runs automated evaluation suites against them, tracks quality over time, and red-teams them for adversarial vulnerabilities. It supports Microsoft Copilot Studio, Azure AI Foundry*, generic HTTP agents*, and Parloa* out of the box, with a modular architecture for custom connectors, judges, and red-team providers.

* Roadmap items

At a Glance
8
Attack Categories
4
Connector Types
4
Judge Modes
API
OpenAPI + Scalar
A

Multi-agent evaluation

Register multiple agents and execute repeatable test suites against each target.

J

Pluggable judge modules

Combine deterministic rubrics, LLM scoring, CopilotStudioJudge Mode with rubrics and hybrid evaluation strategies.

R

Red teaming (roadmap)

Probe for jailbreaks, prompt injection, hallucination, privacy leak, and more.

D

Docker and Azure ready

Run locally with Docker Compose or deploy with Bicep to Azure Container Apps.

R

Latest Release

Open the newest release package and changelog directly on GitHub.

Explore the project
Use the repository for source code and releases, or open the wiki for setup and architecture docs.
GitHub Repository Documentation Wiki Latest Release