feat: Move to xml top tags, plan review, hints and more (#1411)

* feat: move to xml top tags for ebtter llm parsing and structure - Orchestrator is now purely an orchestrator - Added new calrify phase for immediate user erequest understanding and task parsing before workflow - Enforce review/ critic to plan instea dof 3x plan generation retries for better error handling and self-correction - Add hins to all agents - Optimize defitons for simplicity/ conciseness while maintaining clarity * feat(critic): add holistic review and final review enhancements
2026-06-13 03:23:30 +00:00 · 2026-04-17 05:52:07 +05:00
parent 4a3c7becc3
commit 971139baf2
19 changed files with 2018 additions and 2874 deletions
@@ -3,18 +3,19 @@
 > Multi-agent orchestration framework for spec-driven development and automated verification.

 [![Copilot Plugin](https://img.shields.io/badge/Plugin-Awesome%20Copilot-0078D4?style=flat-square&logo=microsoft)](https://awesome-copilot.github.com/plugins/#file=plugins%2Fgem-team)
-![Version](https://img.shields.io/badge/Version-1.6.0-6366f1?style=flat-square)
+![Version](https://img.shields.io/badge/Version-1.6.6-6366f1?style=flat-square)

 ---

 ## 🤔 Why Gem Team?

- ⚡ **10x Faster** — Parallel execution with wave-based execution
+- ⚡ **4x Faster** — Parallel execution with wave-based execution
 - 🏆 **Higher Quality** — Specialized agents + TDD + verification gates + contract-first
 - 🔒 **Built-in Security** — OWASP scanning, secrets/PII detection on critical tasks
 - 👁️ **Full Visibility** — Real-time status, clear approval gates
 - 🛡️ **Resilient** — Pre-mortem analysis, failure handling, auto-replanning
 - ♻️ **Pattern Reuse** — Codebase pattern discovery prevents reinventing wheels
+- 📏 **Established Patterns** — Uses library/framework conventions over custom implementations
 - 🪞 **Self-Correcting** — All agents self-critique at 0.85 confidence threshold
 - 📋 **Source Verified** — Every factual claim cites its source; no guesswork
 - ♿ **Accessibility-First** — WCAG compliance validated at spec and runtime layers
@@ -25,7 +26,8 @@
 - 🛠️ **Skills & Guidelines** — Built-in skill & guidelines (web-design-guidelines)
 - 📐 **Spec-Driven** — Multi-step refinement defines "what" before "how"
 - 🌊 **Wave-Based** — Parallel agents with integration gates per wave
- 🗂️ **Multi-Plan** — Complex tasks: 3 planner variants → best DAG selected automatically
+- 🗂️ **Verified-Plan** — Complex tasks: Plan → Verificationn → Critic
+- 🔎 **Final Review** — Optional user-triggered comprehensive review of all changed files
 - 🩺 **Diagnose-then-Fix** — gem-debugger diagnoses → gem-implementer fixes → re-verifies
 - ⚠️ **Pre-Mortem** — Failure modes identified BEFORE execution
 - 💬 **Constructive Critique** — gem-critic challenges assumptions, finds edge cases
@@ -45,6 +47,25 @@ copilot plugin install gem-team@awesome-copilot

 ---

+## 🔄 Core Workflow
+
+**Phase Flow:** User Goal → Orchestrator → Discuss (medium|complex) → PRD → Research → Planning → Plan Review (medium|complex) → Execution → Summary → [Optional] Final Review
+
+**Error Handling:** Diagnose-then-Fix loop (Debugger → Implementer → Re-verify)
+
+**Orchestrator** auto-detects phase and routes accordingly. Any feedback or steer message is handled to re-plan.
+
+| Condition | Phase |
+|:----------|:------|
+| No plan + simple | Research |
+| No plan + medium\|complex | Discuss → PRD → Research |
+| Plan + pending tasks | Execution |
+| Plan + feedback | Planning |
+| Plan + completed → Summary | User decision (feedback / final review / approve) |
+| User requests final review | Final Review (parallel gem-reviewer + gem-critic) |
+
+---
+
 ## 🏗️ Architecture

 ```mermaid
@@ -62,6 +83,7 @@ flowchart
        PLANNING["📝 Planning"]
        EXEC["⚙️ Execution"]
        SUMMARY["📊 Summary"]
+        FINAL["🔎 Final Review"]
    end

    DIAG["🔬 Diagnose-then-Fix"]
@@ -79,6 +101,8 @@ flowchart
    EXEC --> |"Failure"| DIAG
    DIAG --> EXEC
    EXEC --> SUMMARY
+    SUMMARY --> |"Review files"| FINAL
+    FINAL --> |"Clean"| SUMMARY

    PLANNING -.-> |"critique"| critic
    PLANNING -.-> |"review"| reviewer
@@ -89,23 +113,6 @@ flowchart

 ---

-## 🔄 Core Workflow
-
-**Phase Flow:** User Goal → Orchestrator → Discuss (medium|complex) → PRD → Research → Planning → Execution → Summary
-
-**Error Handling:** Diagnose-then-Fix loop (Debugger → Implementer → Re-verify)
-
-**Orchestrator** auto-detects phase and routes accordingly.
-
-| Condition | → Phase |
-|:----------|:--------|
-| No plan + simple | Research |
-| No plan + medium\|complex | Discuss → PRD → Research |
-| Plan + pending tasks | Execution |
-| Plan + feedback | Planning |
-
---
-
 ## 🤖 The Agent Team (Q2 2026 SOTA)

 | Role | Description | Output | Recommended LLM |
@@ -182,7 +189,7 @@ Agents consult only the sources relevant to their role. Trust levels apply:

 ## 🤝 Contributing

-Contributions are welcome! Please feel free to submit a Pull Request.
+Contributions are welcome! Please feel free to submit a Pull Request. [CONTRIBUTING](./CONTRIBUTING.md) for detailed guidelines on commit message formatting, branching strategy, and code standards.

 ## 📄 License

@@ -191,24 +198,3 @@ This project is licensed under the MIT License.
 ## 💬 Support

 If you encounter any issues or have questions, please [open an issue](https://github.com/mubaidr/gem-team/issues) on GitHub.
-
---
-
-## 📋 Changelog
-
-### 1.6.0 (April 8, 2026)
-
-**New:**
-
- Mobile agents — build, design, and test iOS/Android apps with gem-implementer-mobile, gem-designer-mobile, gem-mobile-tester
-
-**Improved:**
-
- Concise agent descriptions — one-liners that quickly communicate what each agent does
- Unified agent table — clean overview of all 15 agents with roles and outputs
-
-### 1.5.4
-
-**Bug Fixes:**
-
- Fixed AGENTS.md pattern extraction logic for semantic search integration