Learning and using Agentic QE Fleet

01

Why it was built

The problem every team hits: tests that don’t keep up with code.

Before: chaos of broken tests, red X marks, coverage gaps. After: organized green checkmarks, golden shield reading 96 percent, clean and confident. — Before Agentic QE: manual testing can't keep up. After: 60 AI agents auto-generate, analyze, and gate quality across the full surface.

Before: manual testing can't keep up with shipping speed. After: 60 AI agents auto-generate, analyze, and gate quality across the full surface.

Every team ships faster than their tests can follow. New features land daily; test suites fall behind. Coverage gaps hide in the cracks. Flaky tests erode trust until the team stops reading CI results.

Dragan Spiridonov built Agentic QE because existing solutions attacked one slice of the problem — generation or coverage or flakiness — but never the whole surface. A coordinated fleet of AI agents, each a specialist in one QE domain, can cover the full lifecycle: generate, analyze, execute, learn, and gate.

The project is open-source (MIT), free to use, fork, and contribute to. It is based on the Agentic QE Framework created by Dragan Spiridonov.

View the source → github.com/proffesor-for-testing/agentic-qe

02

What problem it solves

Tests fall behind, coverage rots, flaky tests erode trust.

Six problems that plague every team's quality process, each addressed by a dedicated domain of specialized AI agents.

Modern codebases grow faster than any human team can test. Agentic QE attacks the full quality surface at once, not just one slice. Each problem above maps to a dedicated agent domain with specialists trained for that exact task.

03

Why now

Coding agents exist — now QE needs agents too.

Three technological forces converging: coding agents accelerate dev, MCP enables tool interop, free local models cut costs to zero for routine tasks.

Coding agents like Claude Code, Cursor, and Copilot accelerate development. Code output has increased 10x for many teams. But quality engineering hasn’t kept pace. The Model Context Protocol (MCP) standard now lets AI tools share context, and free local models make routine test generation cost $0.

AQE plugs into all three: one aqe init --auto and your coding agent has 60 QE specialists on call. No new IDE, no new workflow. It meets developers where they already work.

04

How it works

A fleet of 60 specialized agents, coordinated by a Queen.

The Queen Coordinator decomposes your request, fans out to domain specialists running in parallel, then synthesizes results with quality gates.

Domain	Agents	What they do
Test Generation	4	Generate tests, test-driven development (TDD) workflows, mutation testing, property testing
Test Execution	3	Run tests in parallel, handle retries, integration testing
Coverage Analysis	2	Find untested code, prioritize by risk
Quality Assessment	4	Go/no-go gates, risk scoring, adversarial review
Defect Intelligence	4	Predict bugs, find root causes, fix flaky tests
Requirements	2	Validate testability, generate behavior-driven scenarios
Code Intelligence	4	Knowledge graphs, semantic search, change impact
Security	3	Static & dynamic security analysis, compliance audits, exploit validation
Contracts	2	API contracts, GraphQL schema testing
Visual & Accessibility	3	Visual regression, accessibility compliance, viewport testing
Chaos & Performance	3	Fault injection, load testing, performance validation
Learning	4	Cross-project learning, pattern discovery, metrics
Enterprise	7	SAP systems, legacy web services, message queues (Kafka, RabbitMQ), enterprise integrations

Three tiers of intelligent cost routing: a small green lane for simple tasks (fast and cheap), a medium amber lane for moderate tasks, and a large copper lane for critical reasoning — a sorting funnel directs each task to the right tier. — TinyDancer sorts tasks by complexity and routes them to the cheapest model that can handle them — from free local models to full Opus reasoning.

TinyDancer scores task complexity (0–100) and routes to the cheapest model that can handle it. Free-tier mode uses local Ollama for routine tasks at $0.

05

What solved looks like

Ask for tests and get them — generated, validated, gated.

A glowing golden terminal showing test results: green checkmarks flowing down, a circular progress gauge reading 96 percent in warm amber, and neat test file cards below. — What "solved" looks like: ask for tests in plain English, get 48 tests at 96% coverage, quality-gated and ready to commit.

A real interaction: the Queen spawns 4 specialists in parallel, each handling their domain. Result: 48 tests at 96.2% coverage, quality-gated.

75 Skills, Trust-Tiered

75 skills rated by trust tier. 49 are fully verified with test suites. Tier-0 untested skills are excluded by policy.

06

Platform support

One server, 11 coding agent platforms.

One MCP server connects to 11 coding agent platforms. Set up all at once or add platforms one by one.

# Set up all platforms at once
aqe init --auto --with-all-platforms

# Or add a platform later
aqe platform setup cursor
aqe platform list       # show install status
aqe platform verify cursor  # validate config

LLM Providers

AQE’s HybridRouter auto-detects available providers. Set one or more API keys and it picks the best route:

Claude (ANTHROPIC_API_KEY) — default
OpenAI (OPENAI_API_KEY)
Gemini (GOOGLE_AI_API_KEY) — free tier available
OpenRouter (OPENROUTER_API_KEY) — 300+ models
Ollama — local, free, offline
Azure OpenAI, Bedrock

07

How to start

Three commands and you’re running.

Three commands: install, initialize, use. MCP tools are available immediately in your coding agent.

Quick Start

# Install globally from npm (public, free)
npm install -g agentic-qe

# Initialize your project (auto-detects your tech stack and connects to your coding agent)
cd your-project && aqe init --auto

# That's it — 60 QE agents are available immediately in Claude Code

All packages are public on npm: npmjs.com/package/agentic-qe

Alternative: Claude Code Plugin

# From a local checkout
git clone https://github.com/proffesor-for-testing/agentic-qe.git
claude --plugin-dir ./agentic-qe/plugins/agentic-qe-fleet

Development Setup

git clone https://github.com/proffesor-for-testing/agentic-qe.git
cd agentic-qe && npm install && npm run build && npm test -- --run

Full source on GitHub → github.com/proffesor-for-testing/agentic-qe

08

Use cases

From test generation to chaos engineering to security audits.

Test Generation

Generate a full test suite

qe-test-architect generates unit, integration, property-based, and behavior-driven tests across 15+ frameworks.

"Create tests for PaymentService with 95% coverage"

Coverage Analysis

Find coverage gaps by risk

qe-coverage-specialist finds untested code and ranks it by risk, not line count.

"Find coverage gaps in src/ and prioritize"

Flaky Tests

Hunt and fix flaky tests

qe-flaky-hunter uses ML detection, root-cause analysis, and stabilization fixes.

"Stabilize flaky tests in tests/integration/"

Security

Security audit

qe-security-scanner: static and dynamic security analysis, dependency scanning, API security, and chaos testing.

"Run security scan on the auth module"

TDD

Red-Green-Refactor

qe-tdd-specialist coordinates 5 subagents through the full TDD cycle.

"Implement UserAuth with full TDD cycle"

Chaos Engineering

Fault injection

qe-chaos-engineer injects network partitions, latency spikes, resource exhaustion.

"Inject network partitions into order flow"

09

Get it

All links are public. No login, no paywall, no lock-in.

One download splitting into two halves: For You (a person reading a primer and watching a video) and For Your AI (a glowing brain with a magnifying glass searching it). — Everything is public: source on GitHub, package on npm, releases freely downloadable. No login, no paywall, no lock-in.

Everything about Agentic QE is publicly available. No Vercel login, no gated downloads, no auth walls. Every link below goes to a public URL you can access right now.

Source Code

Command-Line Reference

aqe init [--auto]              # Initialize project
aqe agent list                 # List available agents
aqe fleet status               # Fleet health
aqe learning stats             # Learning statistics
aqe brain export/import        # Portable intelligence
aqe platform list/setup/verify # Manage platforms
aqe health                     # System health
aqe code index src/            # Index codebase
aqe code search "auth"         # Semantic search

Quality is
the product.

Why it was built

What problem it solves

Why now

How it works

What solved looks like

75 Skills, Trust-Tiered

Platform support

LLM Providers

How to start

Quick Start

Alternative: Claude Code Plugin

Development Setup

Use cases

Generate a full test suite

Find coverage gaps by risk

Hunt and fix flaky tests

Security audit

Red-Green-Refactor

Fault injection

Get it

GitHub Repository

npm Package

v3.11.1

Platform Setup Guide

Issues & Discussions

Contributors

Command-Line Reference

Quality isthe product.

Why it was built

What problem it solves

Why now

How it works

What solved looks like

75 Skills, Trust-Tiered

Platform support

LLM Providers

How to start

Quick Start

Alternative: Claude Code Plugin

Development Setup

Use cases

Generate a full test suite

Find coverage gaps by risk

Hunt and fix flaky tests

Security audit

Red-Green-Refactor

Fault injection

Get it

GitHub Repository

npm Package

v3.11.1

Platform Setup Guide

Issues & Discussions

Contributors

Command-Line Reference

Quality is
the product.