[gem-team] token, tool call and request optimziations (#1625)

* feat: move to xml top tags for ebtter llm parsing and structure - Orchestrator is now purely an orchestrator - Added new calrify phase for immediate user erequest understanding and task parsing before workflow - Enforce review/ critic to plan instea dof 3x plan generation retries for better error handling and self-correction - Add hins to all agents - Optimize defitons for simplicity/ conciseness while maintaining clarity * feat(critic): add holistic review and final review enhancements * chore: bump marketplace version to 1.10.0 - Updated `.github/plugin/marketplace.json` to version 1.10.0. - Revised `agents/gem-browser-tester.agent.md` to improve the BROWSER TESTER role documentation with a clearer structure, explicit role header, and organized knowledge sources section. * refactor: streamline verification and self‑critique steps across browser‑tester, code‑simplifier, critic, and debugger agents * feat(researcher): improve mode selection workflow and research implementation details - Refine **Clarify** mode description to emphasize minimal research for detecting ambiguities. - Reorder steps and clarify intent detection (`continue_plan`, `modify_plan`, `new_task`). - Add explicit sub‑steps for presenting architectural and task‑specific clarifications. - Update **Research** mode section with clearer initialization workflow. - Simplify and reformat the confidence calculation comments for readability. - Minor formatting tweaks and added blank lines for visual separation. * Update gem-orchestrator.agent.md * docs(gem-browser-tester): enhance BROWSER TESTER role description and clarify workflow steps- Expanded the BROWSER TESTER role with explicit responsibilities and constraints - Reformatted the Knowledge Sources list using consistent numbered items for readability- Updated the Workflow section to detail initialization, execution, and teardown steps more clearly- Refined the Output Format and Research Format Guide structures to use proper markdown syntax - Improved overall formatting and consistency of documentation for better maintainability * docs: fix typo in delegation description * feat(metadata): bump marketplace version to 1.15.0 and enrich agent documentation The marketplace plugin metadata has been updated to reflect the newer self‑learning multi‑agent orchestration description and the version hasbeen upgraded from 1.13.0 to 1.15.0. Documentation for the following agents has been expanded with new sections: - **gem-browser-tester.agent.md** – added an “Output” section outlining strict JSON output rules and a new “I/O Optimization” section covering parallel batch operations, read efficiency, and scoping techniques. - **gem-code-simplifier.agent.md** – similarly added “Output” and “I/O Optimization” sections describing concisely formatted JSON, parallel I/O, and batch processing best practices. - **gem-reviewer.agent.md** – updated its output format and added detailed guidance on review scope, anti‑patterns, and I/O strategies. These changes provide clearer usage instructions and performance‑focused recommendations for the agents while aligning the marketplace metadata with the updated version. * feat(plugin): add agents list and README for gem-team plugin * docs: update readme * chore: match version with gem-team * docs: standardize execution order and output format sections in agent documentation * docs: fix typo in agent documentation files * refactor: replace "framework" with "harness" in gem‑team marketplace, plugin, and README descriptions
2026-06-24 00:17:35 +00:00 · 2026-05-06 05:01:10 +05:00
parent 2231c315d0
commit ef40bff1da
19 changed files with 693 additions and 338 deletions
@@ -1,220 +1,181 @@
-# 💎 Gem Team
->
-> Multi-agent orchestration framework for spec-driven development and automated verification.
->
-> **Turning Model Quality into System Quality.**
->
+# Gem Team

-![VS Code](https://img.shields.io/badge/VS_Code-5A6D7C?style=flat)
-![VS Code Insiders](https://img.shields.io/badge/VS_Code_Insiders-5A6D7C?style=flat)
-![Copilot CLI](https://img.shields.io/badge/Copilot_CLI-5A6D7C?style=flat)
-![Cursor](https://img.shields.io/badge/Cursor-5A6D7C?style=flat)
-![OpenCode](https://img.shields.io/badge/OpenCode-5A6D7C?style=flat)
-![Claude Code](https://img.shields.io/badge/Claude_Code-5A6D7C?style=flat)
-![Windsurf](https://img.shields.io/badge/Windsurf-5A6D7C?style=flat)
+Self-Learning Multi-agent orchestration harness for spec-driven development and automated verification.
+
+[![Support Me](https://img.shields.io/badge/patreon-000000?logo=patreon&logoColor=FFFFFF&style=flat)](https://patreon.com/mubaidr)
+
+## Quick Start
+
+See [all supported installation options](#installation) below.

 ---

-## 🚀 Quick Start
+## Contents

-See [all installation options](#-installation) below.
+- [Quick Start](#quick-start)
+- [Why Gem Team?](#why-gem-team)
+- [Harness Architecture](#harness-architecture)
+- [Installation](#installation)
+- [The Agent Team](#the-agent-team)
+- [Knowledge Sources](#knowledge-sources)
+- [Contributing](#contributing)

 ---

-## 🤔 Why Gem Team?
+## Why Gem Team?

- ⚡ **4x Faster** — Parallel execution with wave-based execution
- 🏆 **Higher Quality** — Specialized agents + TDD + verification gates + contract-first
- 🔒 **Built-in Security** — OWASP scanning, secrets/PII detection on critical tasks
- 👁️ **Full Visibility** — Real-time status, clear approval gates
- 🛡️ **Resilient** — Pre-mortem analysis, failure handling, auto-replanning
- ♻️ **Pattern Reuse** — Codebase pattern discovery prevents reinventing wheels
- 📏 **Established Patterns** — Uses library/framework conventions over custom implementations
- 🪞 **Self-Correcting** — All agents self-critique at 0.85 confidence threshold
- 🧠 **Context Scaffolding** — Maps large-scale dependencies _before_ the model reads code, preventing context-loss in legacy repos
- ⚖️ **Intent vs. Compliance** — Shifts the burden from writing "perfect prompts" to enforcing strict, YAML-based approval gates
- 📋 **Source Verified** — Every factual claim cites its source; no guesswork
- ♿ **Accessibility-First** — WCAG compliance validated at spec and runtime layers
- 🔬 **Smart Debugging** — Root-cause analysis with stack trace parsing + confidence-scored fixes
- 🚀 **Safe DevOps** — Idempotent operations, health checks, mandatory approval gates
- 🔗 **Traceable** — Self-documenting IDs link requirements → tasks → tests → evidence
- 📚 **Knowledge-Driven** — Prioritized sources (PRD → codebase → AGENTS.md → Context7 → docs)
- 🛠️ **Skills & Guidelines** — Built-in skill & guidelines (web-design-guidelines)
- 📐 **Spec-Driven** — Multi-step refinement defines "what" before "how"
- 🌊 **Wave-Based** — Parallel agents with integration gates per wave
- 🗂️ **Verified-Plan** — Complex tasks: Plan → Verification → Critic
- 🔎 **Final Review** — Optional user-triggered comprehensive review of all changed files
- 🩺 **Diagnose-then-Fix** — gem-debugger diagnoses → gem-implementer fixes → re-verifies
- ⚠️ **Pre-Mortem** — Failure modes identified BEFORE execution
- 💬 **Constructive Critique** — gem-critic challenges assumptions, finds edge cases
- 📝 **Contract-First** — Contract tests written before implementation
- 📱 **Mobile Agents** — Native mobile implementation (React Native, Flutter) + iOS/Android testing
+### Performance

-### 🚀 The "System-IQ" Multiplier
+- **4x Faster** — Parallel execution with wave-based execution
+- **Pattern Reuse** — Codebase pattern discovery prevents reinventing wheels

-Raw reasoning isn't enough in single-pass chat. Gem-Team wraps your preferred LLM in a rigid, verification-first loop, fundamentally boosting its effective capability on SWE-benchmarks:
+### Quality & Security

- **For Small Models (e.g., Qwen 1.7B - 8B):** The framework provides the "executive brain." Task decomposition and isolated 50-line chunks can up to **double** their localized debugging success rates.
- **For Reasoning Models (e.g., DeepSeek 3.2):** TDD loops and parallel research stabilize their native file I/O fragility, yielding up to a **+25% lift** in execution reliability.
- **For SOTA Models (e.g., GLM 5.1, Kimi K2.5):** The `gem-reviewer` acts as a noise-filter, pruning verbosity and enforcing strict PRD compliance to prevent over-engineering.
+- **Higher Quality** — Specialized harness agents + TDD + verification gates + contract-first
+- **Built-in Security** — OWASP scanning, secrets/PII detection on critical tasks
+- **Resilient** — Pre-mortem analysis, failure handling, auto-replanning
+- **Accessibility-First** — WCAG compliance validated at spec and runtime layers
+- **Safe DevOps** — Idempotent operations, health checks, mandatory approval gates
+- **Constructive Critique** — gem- critic challenges assumptions, finds edge cases

-### 🎨 Design Support
+### Intelligence

-Gem Team includes specialized design agents with **anti-"AI slop" guidelines** for distinctive, modern aesthetics:
+- **Established Patterns** — Uses library/harness conventions over custom implementations
+- **Source Verified** — Every factual claim cites its source; no guesswork
+- **Knowledge-Driven** — Prioritized sources (PRD → codebase → AGENTS.md → Context7 → docs)
+- **Continuous Learning** — Memory tool persists patterns, gotchas, user preferences across sessions
+- **Auto-Skills** — Agents extract reusable SKILL.md files from successful tasks (high confidence: auto, medium: confirm)
+- **Skills & Guidelines** — Built-in skill & guidelines (web-design-guidelines)

-| Agent | Focus | Key Capabilities |
-|:------|:------|:-----------------|
-| **DESIGNER** | Web UI/UX | Layouts, themes, design systems, accessibility (WCAG), 7 design movements (Brutalism → Maximalism), 5-level elevation system |
-| **DESIGNER-MOBILE** | Mobile UI/UX | iOS HIG, Material 3, safe areas, haptics, platform-specific adaptations of design movements |
+### Process

-**Anti-AI Slop Principles:**
- Distinctive fonts (Cabinet Grotesk, Satoshi, Clash Display — never Inter/Roboto defaults)
- 60-30-10 color strategy with sharp accents
- Break predictable layouts (asymmetric grids, overlap, bento patterns)
- Purposeful motion with orchestrated page loads
- Design movement library: Brutalism, Neo-brutalism, Glassmorphism, Claymorphism, Minimalist Luxury, Retro-futurism, Maximalism
+- **Spec-Driven** — Multi-step refinement defines "what" before "how"
+- **Verified-Plan** — Complex tasks: Plan → Verification → Critic
+- **Traceable** — Self-documenting IDs link requirements → tasks → tests → evidence
+- **Intent vs. Compliance** — Shifts the burden from writing "perfect prompts" to enforcing strict, YAML-based approval gates
+- **Diagnose-then-Fix** — gem-debugger diagnoses → gem-implementer fixes → re-verifies
+- **Pre-Mortem** — Failure modes identified BEFORE execution
+- **Contract-First** — Contract tests written before implementation

-Both agents include quality checklists for generating unique, memorable designs.
+### Token Efficiency
+
+Optimized for reduced LLM token consumption without quality loss:
+
+- **Concise Output** — No preamble, no meta commentary, no verbose explanations
+- **Strict Formats** — JSON/YAML exactly matching schemas — eliminates parse errors and retries
+- **Empty is OK** — Skip empty arrays, nulls, verbose fields where not needed
+- **File-Based** — Researcher/Planner save to YAML files (not all in JSON output)
+- **Learnings** — Empty patterns/conventions unless critical
+
+> **Result:** ~40-60% reduction on output tokens while maintaining quality.
+
+### Design
+
+- **Design Agents** — Dedicated agents for web and mobile UI/UX with anti-"AI slop" guidelines for distinctive aesthetics
+- **Mobile Agents** — Native mobile implementation (React Native, Flutter) + iOS/Android testing

 ---

-## 🔄 Core Workflow
+## Core Concepts

-**Phase Flow:** User Goal → Orchestrator → Discuss (medium|complex) → PRD → Research → Planning → Plan Review (medium|complex) → Execution → Summary → (Optional) Final Review
+### The "System- IQ" Multiplier

-**Error Handling:** Diagnose-then-Fix loop (Debugger → Implementer → Re-verify)
+Raw reasoning isn't enough in single-pass chat. Gem-Team wraps your preferred LLM in a rigid harness with verification-first loops, fundamentally boosting its effective capability on SWE tasks.

-**Orchestrator** auto-detects phase and routes accordingly. Any feedback or steer message is handled to re-plan.
+### Design Support

-| Condition | Phase | Outcome |
-|:----------|:------|:--------|
-| No plan + simple | Research → Planning | Quick execution path |
-| No plan + medium\|complex | Discuss → PRD → Research | Spec-driven approach |
-| Plan + pending tasks | Execution | Wave-based implementation |
-| Plan + feedback | Planning | Replan with steer |
-| Plan + completed | Summary | User decision (feedback / final review / approve) |
-| User requests final review | Final Review | Parallel review by gem-reviewer + gem-critic |
+Gem Team includes specialized design agents with anti-"AI slop" guidelines for distinctive, modern and unique aesthetics with accessibility compliance.
+
+### Triple Learning System
+
+| Type            | Storage        | 1-liner                               |
+| :-------------- | :------------- | :------------------------------------ |
+| **Memory**      | `/memories/`   | Facts & user preferences (auto- save) |
+| **Skills**      | `docs/skills/` | Procedures with code examples         |
+| **Conventions** | `AGENTS.md`    | Static rules (requires approval)      |

 ---

-## 📦 Installation
+## Harness Architecture

-| Method | Command / Link | Docs |
-|:-------|:---------------|:-----|
-| **Code** | **[Install Now](https://aka.ms/awesome-copilot/install/agent?url=vscode%3Achat-agent%2Finstall%3Furl%3Dhttps%253A%252F%252Fraw.githubusercontent.com%252Fgithub%252Fawesome-copilot%252Fmain%252F.%252Fagents)** | [Copilot Docs](https://docs.github.com/en/copilot/using-github-copilot/using-github-copilot-chat) |
-| **Code Insiders** | **[Install Now](https://aka.ms/awesome-copilot/install/agent?url=vscode-insiders%3Achat-agent%2Finstall%3Furl%3Dhttps%253A%252F%252Fraw.githubusercontent.com%252Fgithub%252Fawesome-copilot%252Fmain%252F.%252Fagents)** | [Copilot Docs](https://docs.github.com/en/copilot/using-github-copilot/using-github-copilot-chat) |
-| **APM <br/> (All AI coding agents)** | `apm install mubaidr/gem-team` | [APM Docs](https://microsoft.github.io/apm/) |
-| **Copilot CLI (Marketplace)** | `copilot plugin install gem-team@awesome-copilot` | [CLI Docs](https://github.com/github/copilot-cli) |
-| **Copilot CLI (Direct)** | `copilot plugin install gem-team@mubaidr` | [CLI Docs](https://github.com/github/copilot-cli) |
-| **Windsurf** | `codeium agent install mubaidr/gem-team` | [Windsurf Docs](https://docs.codeium.com/windsurf) |
-| **Claude Code** | `claude plugin install mubaidr/gem-team` | [Claude Docs](https://docs.anthropic.com/en/docs/claude-code) |
-| **OpenCode** | `opencode plugin install mubaidr/gem-team` | [OpenCode Docs](https://opencode.ai/docs/) |
-| **Manual <br/> (Copy agent files)** | VS Code: `~/.vscode/agents/` <br/> VS Code Insiders: `~/.vscode-insiders/agents/` <br/> GitHub Copilot: `~/.github/copilot/agents/` <br/> GitHub Copilot (project): `.github/plugin/agents/` <br/> Windsurf: `~/.windsurf/agents/` <br/> Claude: `~/.claude/agents/` <br/> Cursor: `~/.cursor/agents/` <br/> OpenCode: `~/.opencode/agents/` | — |
-
---
-
-## 🏗️ Architecture
-
-```mermaid
-flowchart
-    USER["User Goal"]
-
-    subgraph ORCH["Orchestrator"]
-        detect["Phase Detection"]
-    end
-
-    subgraph PHASES
-        DISCUSS["🔹 Discuss"]
-        PRD["📋 PRD"]
-        RESEARCH["🔍 Research"]
-        PLANNING["📝 Planning"]
-        EXEC["⚙️ Execution"]
-        SUMMARY["📊 Summary"]
-        FINAL["🔎 Final Review"]
-    end
-
-    DIAG["🔬 Diagnose-then-Fix"]
-
-    USER --> detect
-
-    detect --> |"Simple"| RESEARCH
-    detect --> |"Medium|Complex"| DISCUSS
-
-    DISCUSS --> PRD
-    PRD --> RESEARCH
-    RESEARCH --> PLANNING
-    PLANNING --> |"Approved"| EXEC
-    PLANNING --> |"Feedback"| PLANNING
-    EXEC --> |"Failure"| DIAG
-    DIAG --> EXEC
-    EXEC --> SUMMARY
-    SUMMARY --> |"Review files"| FINAL
-    FINAL --> |"Clean"| SUMMARY
-
-    PLANNING -.-> |"critique"| critic
-    PLANNING -.-> |"review"| reviewer
-
-    EXEC --> |"parallel ≤4"| agents
-    EXEC --> |"post-wave (complex)"| critic
+```text
+User Goal → Orchestrator → [Simple: Research/Plan] or [Complex: Discuss → PRD → Research → Plan → Approve] → Execute (waves) → Summary → Final Review
+                ↓
+            Diagnose → Fix → Re- verify
 ```

 ---

-## 🤖 The Agent Team (Q2 2026 SOTA)
+## Installation

-| Role | Description | Output | Recommended LLM |
-|:-----|:------------|:-------|:---------------|
-| 🎯 **ORCHESTRATOR** | The team lead: Orchestrates research, planning, implementation, and verification | 📋 PRD, plan.yaml | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** GLM-5, Kimi K2.5, Qwen3.5 |
-| 🔍 **RESEARCHER** | Codebase exploration — patterns, dependencies, architecture discovery | 🔍 findings | **Closed:** Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6<br>**Open:** GLM-5, Qwen3.5-9B, DeepSeek-V3.2 |
-| 📋 **PLANNER** | DAG-based execution plans — task decomposition, wave scheduling, risk analysis | 📄 plan.yaml | **Closed:** Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4<br>**Open:** Kimi K2.5, GLM-5, Qwen3.5 |
-| 🔧 **IMPLEMENTER** | TDD code implementation — features, bugs, refactoring. Never reviews own work | 💻 code | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
-| 🧪 **BROWSER TESTER** | E2E browser testing, UI/UX validation, visual regression with Playwright | 🧪 evidence | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash<br>**Open:** Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 |
-| 🚀 **DEVOPS** | Infrastructure deployment, CI/CD pipelines, container management | 🌍 infra | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3.5 |
-| 🛡️ **REVIEWER** | **Zero-Hallucination Filter** — Security auditing, code review, OWASP scanning, PRD compliance verification | 📊 review report | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** Kimi K2.5, GLM-5, DeepSeek-V3.2 |
-| 📝 **DOCUMENTATION** | Technical documentation, README files, API docs, diagrams, walkthroughs | 📝 docs | **Closed:** Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini<br>**Open:** Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7 |
-| 🔬 **DEBUGGER** | Root-cause analysis, stack trace diagnosis, regression bisection, error reproduction | 🔬 diagnosis | **Closed:** Gemini 3.1 Pro (Retrieval King), Claude Opus 4.6, GPT-5.4<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
-| 🎯 **CRITIC** | Challenges assumptions, finds edge cases, spots over-engineering and logic gaps | 💬 critique | **Closed:** Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** Kimi K2.5, GLM-5, Qwen3.5 |
-| ✂️ **SIMPLIFIER** | Refactoring specialist — removes dead code, reduces complexity, consolidates duplicates | ✂️ change log | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
-| 🎨 **DESIGNER** | UI/UX design specialist — layouts, themes, color schemes, design systems, accessibility | 🎨 DESIGN.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** Qwen3.5, GLM-5, MiniMax M2.7 |
-| 📱 **IMPLEMENTER-MOBILE** | Mobile implementation — React Native, Expo, Flutter with TDD | 💻 code | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
-| 📱 **DESIGNER-MOBILE** | Mobile UI/UX specialist — HIG, Material Design, safe areas, touch targets | 🎨 DESIGN.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** Qwen3.5, GLM-5, MiniMax M2.7 |
-| 📱 **MOBILE TESTER** | Mobile E2E testing — Detox, Maestro, iOS/Android simulators | 🧪 evidence | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash<br>**Open:** Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 |
+| Method                               | Command / Link                                                                                                                                                                                                                                                                                                                                | Docs                                                                                              |
+| :----------------------------------- | :-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :------------------------------------------------------------------------------------------------ |
+| **Code**                             | **[Install Now](https://aka.ms/awesome-copilot/install/agent?url=vscode%3Achat-agent%2Finstall%3Furl%3Dhttps%253A%252F%252Fraw.githubusercontent.com%252Fgithub%252Fawesome-copilot%252Fmain%252F.agents)**                                                                                                                                   | [Copilot Docs](https://docs.github.com/en/copilot/using-github-copilot/using-github-copilot-chat) |
+| **Code Insiders**                    | **[Install Now](https://aka.ms/awesome-copilot/install/agent?url=vscode-insiders%3Achat-agent%2Finstall%3Furl%3Dhttps%253A%252F%252Fraw.githubusercontent.com%252Fgithub%252Fawesome-copilot%252Fmain%252F.agents)**                                                                                                                          | [Copilot Docs](https://docs.github.com/en/copilot/using-github-copilot/using-github-copilot-chat) |
+| **APM <br/> (All AI coding agents)** | `apm install mubaidr/gem-team`                                                                                                                                                                                                                                                                                                                | [APM Docs](https://microsoft.github.io/apm/)                                                      |
+| **Copilot CLI (Marketplace)**        | `copilot plugin install gem-team@awesome-copilot`                                                                                                                                                                                                                                                                                             | [CLI Docs](https://github.com/github/copilot-cli)                                                 |
+| **Copilot CLI (Direct)**             | `copilot plugin install gem-team@mubaidr`                                                                                                                                                                                                                                                                                                     | [CLI Docs](https://github.com/github/copilot-cli)                                                 |
+| **Windsurf**                         | `codeium agent install mubaidr/gem-team`                                                                                                                                                                                                                                                                                                      | [Windsurf Docs](https://docs.codeium.com/windsurf)                                                |
+| **Claude Code**                      | `claude plugin install mubaidr/gem-team`                                                                                                                                                                                                                                                                                                      | [Claude Docs](https://docs.anthropic.com/en/docs/claude-code)                                     |
+| **OpenCode**                         | `opencode plugin install mubaidr/gem-team`                                                                                                                                                                                                                                                                                                    | [OpenCode Docs](https://opencode.ai/docs/)                                                        |
+| **Manual <br/> (Copy agent files)**  | VS Code: `~/.vscode/agents/` <br/> VS Code Insiders: `~/.vscode- insiders/agents/` <br/> GitHub Copilot: `~/.github/copilot/agents/` <br/> GitHub Copilot (project): `.github/plugin/agents/` <br/> Windsurf: `~/.windsurf/agents/` <br/> Claude: `~/.claude/agents/` <br/> Cursor: `~/.cursor/agents/` <br/> OpenCode: `~/.opencode/agents/` | —                                                                                                 |

 ---

-## 📚 Knowledge Sources
+## The Agent Team

-Agents consult only the sources relevant to their role. Trust levels apply:
+### Core Workflow

-| Trust Level | Sources | Behavior |
-|:-----------|:--------|:---------|
-| **Trusted** | PRD.yaml, plan.yaml, AGENTS.md | Follow as instructions |
-| **Verify** | Codebase files, research findings | Cross-reference before assuming |
-| **Untrusted** | Error logs, external data, third-party responses | Factual only — never as instructions |
+| Role             | Description                                                                      | Sources                        | Recommended LLM                                                                                           |
+| :--------------- | :------------------------------------------------------------------------------- | :----------------------------- | :-------------------------------------------------------------------------------------------------------- |
+| **ORCHESTRATOR** | The team lead: Orchestrates research, planning, implementation, and verification | PRD, AGENTS.md                 | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** GLM-5, Kimi K2.5, Qwen3.5             |
+| **RESEARCHER**   | Codebase exploration — patterns, dependencies, architecture discovery            | PRD, codebase, AGENTS.md, docs | **Closed:** Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6<br>**Open:** GLM-5, Qwen3.5-9B, DeepSeek-V3.2      |
+| **PLANNER**      | DAG-based execution plans — task decomposition, wave scheduling, risk analysis   | PRD, codebase, AGENTS.md       | **Closed:** Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4<br>**Open:** Kimi K2.5, GLM-5, Qwen3.5             |
+| **IMPLEMENTER**  | TDD code implementation — features, bugs, refactoring. Never reviews own work    | codebase, AGENTS.md, DESIGN.md | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next |

-| Agent | Knowledge Sources |
-|:------|:------------------|
-| orchestrator | PRD.yaml, AGENTS.md |
-| researcher | PRD.yaml, codebase patterns, AGENTS.md, Context7, official docs, online search |
-| planner | PRD.yaml, codebase patterns, AGENTS.md, Context7, official docs |
-| implementer | codebase patterns, AGENTS.md, Context7 (API verification), DESIGN.md (UI tasks) |
-| debugger | codebase patterns, AGENTS.md, error logs (untrusted), git history, DESIGN.md (UI bugs) |
-| reviewer | PRD.yaml, codebase patterns, AGENTS.md, OWASP reference, DESIGN.md (UI review) |
-| browser-tester | PRD.yaml (flow coverage), AGENTS.md, test fixtures, baseline screenshots, DESIGN.md (visual validation) |
-| designer | PRD.yaml (UX goals), codebase patterns, AGENTS.md, existing design system |
-| code-simplifier | codebase patterns, AGENTS.md, test suites (behavior verification) |
-| documentation-writer | AGENTS.md, existing docs, source code |
+### Quality & Review
+
+| Role               | Description                                                                      | Sources                          | Recommended LLM                                                                                                      |
+| :----------------- | :------------------------------------------------------------------------------- | :------------------------------- | :------------------------------------------------------------------------------------------------------------------- |
+| **REVIEWER**       | **Zero- Hallucination Filter** — Security auditing, code review, OWASP scanning  | PRD, codebase, AGENTS.md, OWASP  | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** Kimi K2.5, GLM-5, DeepSeek-V3.2                    |
+| **CRITIC**         | Challenges assumptions, finds edge cases, spots over- engineering and logic gaps | PRD, codebase, AGENTS.md         | **Closed:** Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** Kimi K2.5, GLM-5, Qwen3.5                        |
+| **DEBUGGER**       | Root-cause analysis, stack trace diagnosis, regression bisection                 | codebase, AGENTS.md, git history | **Closed:** Gemini 3.1 Pro, Claude Opus 4.6, GPT-5.4<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next            |
+| **BROWSER TESTER** | E2E browser testing, UI/UX validation, visual regression                         | PRD, AGENTS.md, fixtures         | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash<br>**Open:** Llama 4 Maverick, Qwen3.5- Flash, MiniMax M2.7 |
+| **SIMPLIFIER**     | Refactoring specialist — removes dead code, reduces complexity                   | codebase, AGENTS.md, tests       | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next            |
+
+### Specialized
+
+| Role                    | Description                                                      | Sources                  | Recommended LLM                                                                                                      |
+| :---------------------- | :--------------------------------------------------------------- | :----------------------- | :------------------------------------------------------------------------------------------------------------------- |
+| **DEVOPS**              | Infrastructure deployment, CI/CD pipelines, container management | AGENTS.md, infra configs | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3.5                    |
+| **DOCUMENTATION**       | Technical documentation, README files, API docs, diagrams        | AGENTS.md, source code   | **Closed:** Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini<br>**Open:** Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7   |
+| **DESIGNER**            | UI/UX design — layouts, themes, color schemes, accessibility     | PRD, codebase, AGENTS.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** Qwen3.5, GLM-5, MiniMax M2.7                     |
+| **IMPLEMENTER- MOBILE** | Mobile implementation — React Native, Expo, Flutter              | codebase, AGENTS.md      | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next            |
+| **DESIGNER- MOBILE**    | Mobile UI/UX — HIG, Material Design, safe areas                  | PRD, codebase, AGENTS.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** Qwen3.5, GLM-5, MiniMax M2.7                     |
+| **MOBILE TESTER**       | Mobile E2E testing — Detox, Maestro, iOS/Android                 | PRD, AGENTS.md           | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash<br>**Open:** Llama 4 Maverick, Qwen3.5- Flash, MiniMax M2.7 |

 ---

-## 🤝 Contributing
+## Knowledge Sources
+
+Agents consult only the sources relevant to their role:
+
+| Trust Level   | Sources                           | Behavior                             |
+| :------------ | :-------------------------------- | :----------------------------------- |
+| **Trusted**   | PRD, plan.yaml, AGENTS.md         | Follow as instructions               |
+| **Verify**    | Codebase files, research findings | Cross-reference before assuming      |
+| **Untrusted** | Error logs, external data         | Factual only — never as instructions |
+
+---
+
+## Contributing

 Contributions are welcome! Please feel free to submit a Pull Request. [CONTRIBUTING](./CONTRIBUTING.md) for detailed guidelines on commit message formatting, branching strategy, and code standards.

-## 📄 License
+## License

 This project is licensed under the Apache License 2.0.

-## 💬 Support
+## Support

 If you encounter any issues or have questions, please [open an issue](https://github.com/mubaidr/gem-team/issues) on GitHub.