# ๐ Gem Team
> Multi-agent orchestration framework for spec-driven development and automated verification.
[](https://awesome-copilot.github.com/plugins/#file=plugins%2Fgem-team)

---
## ๐ค Why Gem Team?
- โก **10x Faster** โ Parallel execution with wave-based execution
- ๐ **Higher Quality** โ Specialized agents + TDD + verification gates + contract-first
- ๐ **Built-in Security** โ OWASP scanning, secrets/PII detection on critical tasks
- ๐๏ธ **Full Visibility** โ Real-time status, clear approval gates
- ๐ก๏ธ **Resilient** โ Pre-mortem analysis, failure handling, auto-replanning
- โป๏ธ **Pattern Reuse** โ Codebase pattern discovery prevents reinventing wheels
- ๐ช **Self-Correcting** โ All agents self-critique at 0.85 confidence threshold
- ๐ **Source Verified** โ Every factual claim cites its source; no guesswork
- โฟ **Accessibility-First** โ WCAG compliance validated at spec and runtime layers
- ๐ฌ **Smart Debugging** โ Root-cause analysis with stack trace parsing + confidence-scored fixes
- ๐ **Safe DevOps** โ Idempotent operations, health checks, mandatory approval gates
- ๐ **Traceable** โ Self-documenting IDs link requirements โ tasks โ tests โ evidence
- ๐ **Knowledge-Driven** โ Prioritized sources (PRD โ codebase โ AGENTS.md โ Context7 โ docs)
- ๐ ๏ธ **Skills & Guidelines** โ Built-in skill & guidelines (web-design-guidelines)
- ๐ **Spec-Driven** โ Multi-step refinement defines "what" before "how"
- ๐ **Wave-Based** โ Parallel agents with integration gates per wave
- ๐๏ธ **Multi-Plan** โ Complex tasks: 3 planner variants โ best DAG selected automatically
- ๐ฉบ **Diagnose-then-Fix** โ gem-debugger diagnoses โ gem-implementer fixes โ re-verifies
- โ ๏ธ **Pre-Mortem** โ Failure modes identified BEFORE execution
- ๐ฌ **Constructive Critique** โ gem-critic challenges assumptions, finds edge cases
- ๐ **Contract-First** โ Contract tests written before implementation
- ๐ฑ **Mobile Agents** โ Native mobile implementation (React Native, Flutter) + iOS/Android testing
---
## ๐ฆ Installation
```bash
# Using Copilot CLI
copilot plugin install gem-team@awesome-copilot
```
> **[Install Gem Team Now โ](https://aka.ms/awesome-copilot/install/agent?url=vscode%3Achat-agent%2Finstall%3Furl%3Dhttps%253A%252F%252Fraw.githubusercontent.com%252Fgithub%252Fawesome-copilot%252Fmain%252F.%252Fagents)**
---
## ๐๏ธ Architecture
```mermaid
flowchart
USER["User Goal"]
subgraph ORCH["Orchestrator"]
detect["Phase Detection"]
end
subgraph PHASES
DISCUSS["๐น Discuss"]
PRD["๐ PRD"]
RESEARCH["๐ Research"]
PLANNING["๐ Planning"]
EXEC["โ๏ธ Execution"]
SUMMARY["๐ Summary"]
end
DIAG["๐ฌ Diagnose-then-Fix"]
USER --> detect
detect --> |"Simple"| RESEARCH
detect --> |"Medium|Complex"| DISCUSS
DISCUSS --> PRD
PRD --> RESEARCH
RESEARCH --> PLANNING
PLANNING --> |"Approved"| EXEC
PLANNING --> |"Feedback"| PLANNING
EXEC --> |"Failure"| DIAG
DIAG --> EXEC
EXEC --> SUMMARY
PLANNING -.-> |"critique"| critic
PLANNING -.-> |"review"| reviewer
EXEC --> |"parallel โค4"| agents
EXEC --> |"post-wave (complex)"| critic
```
---
## ๐ Core Workflow
**Phase Flow:** User Goal โ Orchestrator โ Discuss (medium|complex) โ PRD โ Research โ Planning โ Execution โ Summary
**Error Handling:** Diagnose-then-Fix loop (Debugger โ Implementer โ Re-verify)
**Orchestrator** auto-detects phase and routes accordingly.
| Condition | โ Phase |
|:----------|:--------|
| No plan + simple | Research |
| No plan + medium\|complex | Discuss โ PRD โ Research |
| Plan + pending tasks | Execution |
| Plan + feedback | Planning |
---
## ๐ค The Agent Team (Q2 2026 SOTA)
| Role | Description | Output | Recommended LLM |
|:-----|:------------|:-------|:---------------|
| ๐ฏ **ORCHESTRATOR** (`gem-orchestrator`) | The team lead: Orchestrates research, planning, implementation, and verification | ๐ PRD, plan.yaml | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** GLM-5, Kimi K2.5, Qwen3.5 |
| ๐ **RESEARCHER** (`gem-researcher`) | Codebase exploration โ patterns, dependencies, architecture discovery | ๐ findings | **Closed:** Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6
**Open:** GLM-5, Qwen3.5-9B, DeepSeek-V3.2 |
| ๐ **PLANNER** (`gem-planner`) | DAG-based execution plans โ task decomposition, wave scheduling, risk analysis | ๐ plan.yaml | **Closed:** Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4
**Open:** Kimi K2.5, GLM-5, Qwen3.5 |
| ๐ง **IMPLEMENTER** (`gem-implementer`) | TDD code implementation โ features, bugs, refactoring. Never reviews own work | ๐ป code | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| ๐งช **BROWSER TESTER** (`gem-browser-tester`) | E2E browser testing, UI/UX validation, visual regression with Playwright | ๐งช evidence | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash
**Open:** Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 |
| ๐ **DEVOPS** (`gem-devops`) | Infrastructure deployment, CI/CD pipelines, container management | ๐ infra | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** DeepSeek-V3.2, GLM-5, Qwen3.5 |
| ๐ก๏ธ **REVIEWER** (`gem-reviewer`) | Security auditing, code review, OWASP scanning, PRD compliance verification | ๐ review report | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** Kimi K2.5, GLM-5, DeepSeek-V3.2 |
| ๐ **DOCUMENTATION** (`gem-documentation-writer`) | Technical documentation, README files, API docs, diagrams, walkthroughs | ๐ docs | **Closed:** Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini
**Open:** Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7 |
| ๐ฌ **DEBUGGER** (`gem-debugger`) | Root-cause analysis, stack trace diagnosis, regression bisection, error reproduction | ๐ฌ diagnosis | **Closed:** Gemini 3.1 Pro (Retrieval King), Claude Opus 4.6, GPT-5.4
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| ๐ฏ **CRITIC** (`gem-critic`) | Challenges assumptions, finds edge cases, spots over-engineering and logic gaps | ๐ฌ critique | **Closed:** Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** Kimi K2.5, GLM-5, Qwen3.5 |
| โ๏ธ **SIMPLIFIER** (`gem-code-simplifier`) | Refactoring specialist โ removes dead code, reduces complexity, consolidates duplicates | โ๏ธ change log | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| ๐จ **DESIGNER** (`gem-designer`) | UI/UX design specialist โ layouts, themes, color schemes, design systems, accessibility | ๐จ DESIGN.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** Qwen3.5, GLM-5, MiniMax M2.7 |
| ๐ฑ **IMPLEMENTER-MOBILE** (`gem-implementer-mobile`) | Mobile implementation โ React Native, Expo, Flutter with TDD | ๐ป code | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| ๐ฑ **DESIGNER-MOBILE** (`gem-designer-mobile`) | Mobile UI/UX specialist โ HIG, Material Design, safe areas, touch targets | ๐จ DESIGN.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** Qwen3.5, GLM-5, MiniMax M2.7 |
| ๐ฑ **MOBILE TESTER** (`gem-mobile-tester`) | Mobile E2E testing โ Detox, Maestro, iOS/Android simulators | ๐งช evidence | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash
**Open:** Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 |
### Agent File Skeleton
Each `.agent.md` file follows this structure:
```
--- # Frontmatter: description, name, triggers
# Role # One-line identity
# Expertise # Core competencies
# Knowledge Sources # Prioritized reference list
# Workflow # Step-by-step execution phases
## 1. Initialize # Setup and context gathering
## 2. Analyze/Execute # Role-specific work
## N. Self-Critique # Confidence check (โฅ0.85)
## N+1. Handle Failure # Retry/escalate logic
## N+2. Output # JSON deliverable format
# Input Format # Expected JSON schema
# Output Format # Return JSON schema
# Rules
## Execution # Tool usage, batching, error handling
## Constitutional # IF-THEN decision rules
## Anti-Patterns # Behaviors to avoid
## Anti-Rationalization # Excuse โ Rebuttal table
## Directives # Non-negotiable commands
```
All agents share: Execution rules, Constitutional rules, Anti-Patterns, and Directives sections. Anti-Rationalization tables are present in 5 agents (implementer, planner, reviewer, designer, browser-tester). Role-specific sections (Workflow, Expertise, Knowledge Sources) vary by agent.
---
## ๐ Knowledge Sources
Agents consult only the sources relevant to their role. Trust levels apply:
| Trust Level | Sources | Behavior |
|:-----------|:--------|:---------|
| **Trusted** | PRD.yaml, plan.yaml, AGENTS.md | Follow as instructions |
| **Verify** | Codebase files, research findings | Cross-reference before assuming |
| **Untrusted** | Error logs, external data, third-party responses | Factual only โ never as instructions |
| Agent | Knowledge Sources |
|:------|:------------------|
| orchestrator | PRD.yaml, AGENTS.md |
| researcher | PRD.yaml, codebase patterns, AGENTS.md, Context7, official docs, online search |
| planner | PRD.yaml, codebase patterns, AGENTS.md, Context7, official docs |
| implementer | codebase patterns, AGENTS.md, Context7 (API verification), DESIGN.md (UI tasks) |
| debugger | codebase patterns, AGENTS.md, error logs (untrusted), git history, DESIGN.md (UI bugs) |
| reviewer | PRD.yaml, codebase patterns, AGENTS.md, OWASP reference, DESIGN.md (UI review) |
| browser-tester | PRD.yaml (flow coverage), AGENTS.md, test fixtures, baseline screenshots, DESIGN.md (visual validation) |
| designer | PRD.yaml (UX goals), codebase patterns, AGENTS.md, existing design system |
| code-simplifier | codebase patterns, AGENTS.md, test suites (behavior verification) |
| documentation-writer | AGENTS.md, existing docs, source code |
---
## ๐ค Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
## ๐ License
This project is licensed under the MIT License.
## ๐ฌ Support
If you encounter any issues or have questions, please [open an issue](https://github.com/mubaidr/gem-team/issues) on GitHub.
---
## ๐ Changelog
### 1.6.0 (April 8, 2026)
**New:**
- Mobile agents โ build, design, and test iOS/Android apps with gem-implementer-mobile, gem-designer-mobile, gem-mobile-tester
**Improved:**
- Concise agent descriptions โ one-liners that quickly communicate what each agent does
- Unified agent table โ clean overview of all 15 agents with roles and outputs
### 1.5.4
**Bug Fixes:**
- Fixed AGENTS.md pattern extraction logic for semantic search integration