feat: Move to xml top tags, plan review, hints and more (#1411)

* feat: move to xml top tags for ebtter llm parsing and structure

- Orchestrator is now purely an orchestrator
- Added new calrify  phase for immediate user erequest understanding and task parsing before workflow
- Enforce review/ critic to plan instea dof 3x plan generation retries for better error handling and self-correction
- Add hins to all agents
- Optimize defitons for simplicity/ conciseness while maintaining clarity

* feat(critic): add holistic review and final review enhancements
This commit is contained in:
Muhammad Ubaid Raza
2026-04-17 05:52:07 +05:00
committed by GitHub
parent 4a3c7becc3
commit 971139baf2
19 changed files with 2018 additions and 2874 deletions

View File

@@ -3,18 +3,19 @@
> Multi-agent orchestration framework for spec-driven development and automated verification.
[![Copilot Plugin](https://img.shields.io/badge/Plugin-Awesome%20Copilot-0078D4?style=flat-square&logo=microsoft)](https://awesome-copilot.github.com/plugins/#file=plugins%2Fgem-team)
![Version](https://img.shields.io/badge/Version-1.6.0-6366f1?style=flat-square)
![Version](https://img.shields.io/badge/Version-1.6.6-6366f1?style=flat-square)
---
## 🤔 Why Gem Team?
-**10x Faster** — Parallel execution with wave-based execution
-**4x Faster** — Parallel execution with wave-based execution
- 🏆 **Higher Quality** — Specialized agents + TDD + verification gates + contract-first
- 🔒 **Built-in Security** — OWASP scanning, secrets/PII detection on critical tasks
- 👁️ **Full Visibility** — Real-time status, clear approval gates
- 🛡️ **Resilient** — Pre-mortem analysis, failure handling, auto-replanning
- ♻️ **Pattern Reuse** — Codebase pattern discovery prevents reinventing wheels
- 📏 **Established Patterns** — Uses library/framework conventions over custom implementations
- 🪞 **Self-Correcting** — All agents self-critique at 0.85 confidence threshold
- 📋 **Source Verified** — Every factual claim cites its source; no guesswork
-**Accessibility-First** — WCAG compliance validated at spec and runtime layers
@@ -25,7 +26,8 @@
- 🛠️ **Skills & Guidelines** — Built-in skill & guidelines (web-design-guidelines)
- 📐 **Spec-Driven** — Multi-step refinement defines "what" before "how"
- 🌊 **Wave-Based** — Parallel agents with integration gates per wave
- 🗂️ **Multi-Plan** — Complex tasks: 3 planner variants → best DAG selected automatically
- 🗂️ **Verified-Plan** — Complex tasks: Plan → Verificationn → Critic
- 🔎 **Final Review** — Optional user-triggered comprehensive review of all changed files
- 🩺 **Diagnose-then-Fix** — gem-debugger diagnoses → gem-implementer fixes → re-verifies
- ⚠️ **Pre-Mortem** — Failure modes identified BEFORE execution
- 💬 **Constructive Critique** — gem-critic challenges assumptions, finds edge cases
@@ -45,6 +47,25 @@ copilot plugin install gem-team@awesome-copilot
---
## 🔄 Core Workflow
**Phase Flow:** User Goal → Orchestrator → Discuss (medium|complex) → PRD → Research → Planning → Plan Review (medium|complex) → Execution → Summary → [Optional] Final Review
**Error Handling:** Diagnose-then-Fix loop (Debugger → Implementer → Re-verify)
**Orchestrator** auto-detects phase and routes accordingly. Any feedback or steer message is handled to re-plan.
| Condition | Phase |
|:----------|:------|
| No plan + simple | Research |
| No plan + medium\|complex | Discuss → PRD → Research |
| Plan + pending tasks | Execution |
| Plan + feedback | Planning |
| Plan + completed → Summary | User decision (feedback / final review / approve) |
| User requests final review | Final Review (parallel gem-reviewer + gem-critic) |
---
## 🏗️ Architecture
```mermaid
@@ -62,6 +83,7 @@ flowchart
PLANNING["📝 Planning"]
EXEC["⚙️ Execution"]
SUMMARY["📊 Summary"]
FINAL["🔎 Final Review"]
end
DIAG["🔬 Diagnose-then-Fix"]
@@ -79,6 +101,8 @@ flowchart
EXEC --> |"Failure"| DIAG
DIAG --> EXEC
EXEC --> SUMMARY
SUMMARY --> |"Review files"| FINAL
FINAL --> |"Clean"| SUMMARY
PLANNING -.-> |"critique"| critic
PLANNING -.-> |"review"| reviewer
@@ -89,23 +113,6 @@ flowchart
---
## 🔄 Core Workflow
**Phase Flow:** User Goal → Orchestrator → Discuss (medium|complex) → PRD → Research → Planning → Execution → Summary
**Error Handling:** Diagnose-then-Fix loop (Debugger → Implementer → Re-verify)
**Orchestrator** auto-detects phase and routes accordingly.
| Condition | → Phase |
|:----------|:--------|
| No plan + simple | Research |
| No plan + medium\|complex | Discuss → PRD → Research |
| Plan + pending tasks | Execution |
| Plan + feedback | Planning |
---
## 🤖 The Agent Team (Q2 2026 SOTA)
| Role | Description | Output | Recommended LLM |
@@ -182,7 +189,7 @@ Agents consult only the sources relevant to their role. Trust levels apply:
## 🤝 Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Contributions are welcome! Please feel free to submit a Pull Request. [CONTRIBUTING](./CONTRIBUTING.md) for detailed guidelines on commit message formatting, branching strategy, and code standards.
## 📄 License
@@ -191,24 +198,3 @@ This project is licensed under the MIT License.
## 💬 Support
If you encounter any issues or have questions, please [open an issue](https://github.com/mubaidr/gem-team/issues) on GitHub.
---
## 📋 Changelog
### 1.6.0 (April 8, 2026)
**New:**
- Mobile agents — build, design, and test iOS/Android apps with gem-implementer-mobile, gem-designer-mobile, gem-mobile-tester
**Improved:**
- Concise agent descriptions — one-liners that quickly communicate what each agent does
- Unified agent table — clean overview of all 15 agents with roles and outputs
### 1.5.4
**Bug Fixes:**
- Fixed AGENTS.md pattern extraction logic for semantic search integration