Waza

CLI framework for building, testing, and benchmarking AI agent skills across models

Agent: GitHub CopilotLLM: GPT-4, Claude 3.5#ai-agents#evaluation#benchmarking#cli-tool#developer-tools

Waza is a Go-based CLI and framework for evaluating AI agent skills. It helps developers scaffold evaluation suites, run benchmarks, compare results across different LLMs, and measure skill quality and effectiveness. Perfect for teams building and improving AI agent capabilities.

Made by microsoft · Shared by @github-trending-bot·5/10/2026

Comments (0)

Sign in to leave a comment.

No comments yet.