136 Commits

Author SHA1 Message Date
Dan Velton
eb7d223446 Add doublecheck plugin: three-layer verification pipeline for AI output (#978)
* Add doublecheck plugin: three-layer verification pipeline for AI output

Adds a new plugin that helps users verify AI-generated content before
acting on it. Designed for sensitive contexts (legal, medical, financial,
compliance) where hallucinations carry real consequences.

Three verification layers:
- Self-Audit: extracts verifiable claims, checks internal consistency
- Source Verification: web searches per claim, produces URLs for human review
- Adversarial Review: assumes errors exist, checks hallucination patterns

Supports persistent mode (auto-verifies every factual response inline)
and one-shot mode (full report on specific text). Confidence ratings:
VERIFIED, PLAUSIBLE, UNVERIFIED, DISPUTED, FABRICATION RISK.

Includes:
- Skill (skills/doublecheck/) with bundled report template
- Agent (agents/doublecheck.agent.md) for interactive verification
- Plugin package (plugins/doublecheck/) bundling both

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Address review: fix tools YAML format, remove materialized artifacts

- Fix tools frontmatter in agents/doublecheck.agent.md to use standard
  YAML list format instead of flow sequence with trailing comma
- Remove plugins/doublecheck/agents/ and plugins/doublecheck/skills/
  from tracking; these paths are in .gitignore as CI-materialized
  artifacts that should not be committed

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-03-12 12:35:43 +11:00
PrimedPaul
623083f7b1 Adds the 'Oracle-to-PostgreSQL Migration Expert' Custom Agent, Asociated Skills, and Plugin Manifest (#950)
* Add the 'Oracle-to-PostgreSQL Migration Expert' Custom Agent, its associated skills, plugin manifest

* Update READMEs using 'npm run build'

* Resolve PR comments:
- Fix BOM characters
- Rerun 'npm run build'
- Clarify timestampz date kind
- Remove consufing text for SELECT INTO exception
- Remove dangerous VB.NET example

* Update README and refcursor handling documentation for clarity and consistency

* Update skills/creating-oracle-to-postgres-master-migration-plan/SKILL.md

Add .slnx to discovery of projects

Co-authored-by: Aaron Powell <me@aaron-powell.com>

---------

Co-authored-by: TCPrimedPaul <paul.delannoy@tc.gc.ca>
Co-authored-by: Aaron Powell <me@aaron-powell.com>
2026-03-11 10:46:06 +11:00
Aymen
15d1720375 Add Python Notebook Sample Builder agent (#945) 2026-03-10 10:43:28 +11:00
Muhammad Ubaid Raza
9239e8e320 feat: Support mulitple browser tools envrionment (#893)
- Make browser tester generic to support for chrome devotols mcp, playwright, agentic browser tools.
- Add Team lead and energetci peronsality to Orchestrator
- Add progress updates between phases/ waves
2026-03-06 08:10:34 +11:00
Muhammad Ubaid Raza
f522ca8a08 feat: (gem-team) PRD/ Steer Support (#868)
* feat: Prd/ steer support

- Add supprot for PRD
- Vscode steer/ queue support
- Consistent artifacts
- Improved parallel running; for researchers too

* chore: improve prd update support

* chore: Make reviewer use prd for compaince

* chore: imrpvoe websearch in researcher

* fix(gem-team): revert gem-team plugin version from 1.5.0 to 1.2.0
2026-03-05 15:43:28 +11:00
Niels Laute
dcc2ee1086 Add WinUI 3 Expert (#865)
* WinUI agent

* Improvements

* Fixes

* Add WinUI 3 unit testing guidance across agent, instructions, and skill

- Expanded Testing section in agent with Unit Test App project setup,
  [TestMethod] vs [UITestMethod] attributes, and code examples
- Added Testing rules to instructions file
- Added Testing Migration section to migration guide skill with
  UWP-to-WinUI 3 test project mapping and updated checklist

Reference: https://learn.microsoft.com/en-us/windows/apps/winui/winui3/testing/create-winui-unit-test-project

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Update agents/winui3-expert.agent.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Copilot tweaks

* Update agents/winui3-expert.agent.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update agents/winui3-expert.agent.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2026-03-04 10:04:56 +11:00
connerlambden
2a180a107b Add Scientific Paper Research agent (BGPT MCP) (#808)
* Add Scientific Paper Research agent (BGPT MCP)

* Run npm run build to update README.agents.md

---------

Co-authored-by: connerlambden <connerlambden12@gmail.com>
2026-03-03 15:36:08 +11:00
François
306a2d30e6 feat: add mentoring-juniors skill for Socratic mentoring methodology (#825)
feat: add Sensei Socratic mentor for junior developers

feat: add Sensei Junior Mentor skill for Socratic mentoring of junior developers
2026-03-03 10:06:32 +11:00
Anders Eide
0164092b2f Azure cloud development import infrastructure (#844)
* Modified tools to make the agent more efficient

* Added skill for importing Azure Infrastructure as Code

* Updated readme

* Some wording improvements

* Added / to skills path endings

* ran npm start
2026-03-02 19:32:24 +11:00
dwizzle204
3c670cdab6 Add Terratest module testing agent 2026-02-27 14:29:53 -06:00
Geoffrey Casaubon
2c4cd8b828 feat: add Nuxt and Vue.js expert agents
Add two new agents for the Vue.js ecosystem:

- nuxt-expert.agent.md: Expert Nuxt Developer agent covering Nuxt 3,
  Nitro, rendering modes, data fetching, and legacy Nuxt 2 compatibility.
- vuejs-expert.agent.md: Expert Vue.js Frontend Engineer agent covering
  Vue 3 Composition API, Pinia, Vue Router, TypeScript integration, and
  legacy Vue 2/Options API compatibility.

Both agents use Claude Sonnet 4.5 and follow existing agent conventions.
README.agents.md regenerated via npm run build.
2026-02-26 12:03:35 +01:00
Aaron Powell
81f7996e9b Merge pull request #768 from v-rperez030/markdown-accessibility-assistant
Add Markdown Accessibility Assistant agent
2026-02-26 09:58:57 +11:00
Muhammad Ubaid Raza
30ead89753 chore: remove conlciting subagent instruction from orchestrator 2026-02-25 02:04:34 +05:00
Muhammad Ubaid Raza
59d26c54fa chore: remove conflicting artifact instruction 2026-02-25 01:50:33 +05:00
Muhammad Ubaid Raza
4dea294547 chore: Add evidence to browser tester 2026-02-24 19:56:31 +05:00
Muhammad Ubaid Raza
bf33085e18 docs(agent): prefer accessibility_snapshot for element identification
Updated agent guidelines to specify the use of accessibility_snapshot over visual screenshots for element identification. Accessibility snapshots provide structured DOM/ARIA data, which is more reliable for browser automation than pixel-based visual analysis.
2026-02-24 04:21:41 +05:00
Muhammad Ubaid Raza
cc9061e4c1 chore(agents): remove duplicate config in gem-orchestrator
Remove redundant `<operating_rules>`, agent-specific parameters, and
`<delegation_validation>` sections. This removes duplicate content
to ensure a single source of truth for the agent's behavior protocols
and configuration.
2026-02-23 21:51:00 +05:00
Muhammad Ubaid Raza
5db9683096 refactor: standardize agent workflows and artifact paths
- Refined `gem-browser-tester` workflow to separate initialization from
  execution and enforce an Observation-First loop.
- Added retry logic for transient failures (e.g., timeouts, network issues)
  in browser automation tasks.
- Standardized artifact generation paths to `docs/plan/{plan_id}/`
  across multiple agents.
- Updated failure actions to specify evidence capture locations
  (logs, network) for improved debugging and traceability.
2026-02-23 16:40:51 +05:00
Muhammad Ubaid Raza
c91c374d47 refactor: standardize browser tester agent structure
Introduce explicit sections for input, output, and verification criteria.
Define structured JSON output including detailed evidence paths and error counts.
Update workflow to reference new guides and move Observation-First loop to operating rules.
Clarify verification steps with specific pass/fail conditions for console, network, and accessibility checks.
2026-02-23 02:10:15 +05:00
Muhammad Ubaid Raza
213d15ac83 refactor: update agent workflows and orchestrator logic
- Remove redundant `<mission>` section from gem-browser-tester
- Add "Reflect" step to gem-documentation-writer for self-review on high-priority or failed tasks
- Refactor gem-orchestrator completion phase to generate a walkthrough markdown file instead of a review
- Update orchestrator rules to allow direct execution for creating walkthrough files
2026-02-22 00:55:07 +05:00
Muhammad Ubaid Raza
432241e197 Merge branch 'main' into remove-conflict 2026-02-19 23:02:05 +05:00
Muhammad Ubaid Raza
21507bf644 fix: invlaid file references 2026-02-19 22:59:27 +05:00
Roberto Perez
21fec15360 Add Markdown Accessibility Assistant agent 2026-02-19 16:40:53 +00:00
Imran Siddique
33b7464774 feat: add agent-safety instructions and governance reviewer agent
- instructions/agent-safety.instructions.md: Guidelines for building safe,
  governed AI agent systems (tool access controls, content safety,
  multi-agent safety, audit patterns, framework-specific notes)
- agents/agent-governance-reviewer.agent.md: Expert agent that reviews code
  for governance gaps and helps implement policy enforcement

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-02-18 13:30:30 -08:00
Muhammad Ubaid Raza
63cdc6c14b fix: remove conlciting solid from implementer 2026-02-18 14:01:03 +05:00
Aaron Powell
35cd212801 Merge pull request #733 from mubaidr/chrome-to-browser
refactor: rename gem-chrome-tester to gem-browser-tester
2026-02-18 13:59:41 +11:00
Muhammad Ubaid Raza
d477f8745f chore: add think before act 2026-02-18 03:10:15 +05:00
Muhammad Ubaid Raza
b0d59d8f78 refactor: standardize agent operating rules across gem agents
Remove "detailed thinking on" directive and consolidate operating_rules sections for consistency. Both gem-browser-tester.agent.md and gem-devops.agent.md now share standardized rules: unified tool activation phrasing ("Always activate tools before use"), merged context-efficient reading instructions, and removed agent-specific variations. This simplifies maintenance and ensures consistent behavior across different agent types while preserving core functionality like evidence storage, error handling, and output constraints.
2026-02-17 16:37:34 +05:00
Jan Krivanek
a6550b1025 Remove overspecifications 2026-02-16 16:23:26 +01:00
Jan Krivanek
1cd34d5d25 Add polygot test agent 2026-02-16 15:51:07 +01:00
Muhammad Ubaid Raza
448ad46e72 refactor: rename gem-chrome-tester to gem-browser-tester
Rename the Chrome-specific testing agent to a browser-agnostic version
to support multiple automation tools (Playwright, Chrome DevTools, etc.).
Updates all references in orchestrator and planner configurations, and
broadens the description and execution workflow to be tool-flexible.
Evidence storage rule clarified to apply primarily on failures.
2026-02-16 13:42:29 +05:00
Aaron Powell
ff2123f759 Merge pull request #721 from mubaidr/update-gem-team
Update agent defitions to match new settings user-invokable > user-invocable
2026-02-16 11:24:14 +11:00
Muhammad Ubaid Raza
6ec3104cdc chore: planner to avoid overengineering 2026-02-16 01:41:03 +05:00
Muhammad Ubaid Raza
6ce691e19e chore: planner to avoid overengineering 2026-02-16 01:34:02 +05:00
Muhammad Ubaid Raza
acad3f5953 chore: researcher use skim technique to reduce context 2026-02-16 01:07:49 +05:00
Muhammad Ubaid Raza
cfe75537ae chore: researcher doc type 2026-02-16 01:00:18 +05:00
Muhammad Ubaid Raza
82a54652ed chore: researcher doc type 2026-02-15 23:56:30 +05:00
Muhammad Ubaid Raza
ffbef1aa74 chore: explicit suer interaction 2026-02-15 13:37:37 +05:00
Muhammad Ubaid Raza
93d4e723b5 chore: orchestrator now valdiates if research findings exists or not 2026-02-15 01:11:39 +05:00
Muhammad Ubaid Raza
7484058642 chore: orchestrator now valdiates if research findings exists or not 2026-02-15 00:49:23 +05:00
Muhammad Ubaid Raza
0355730828 chore: orchestrator now valdiates if research findings exists or not 2026-02-15 00:12:19 +05:00
Muhammad Ubaid Raza
dba425d9d2 chore: orchestrator now valdiates if research findings exists or not 2026-02-14 22:03:36 +05:00
Muhammad Ubaid Raza
63f8cd9803 chore: Enforce silent execution 2026-02-14 01:47:50 +05:00
Muhammad Ubaid Raza
f09df95320 Fix typo in agent configuration: change 'user-invokable' to 'user-invocable' 2026-02-14 00:45:41 +05:00
Ivan Charapanau
a5b2a29aa2 Merge branch 'main' into add-rug-swe-qa-agents-collection 2026-02-13 09:57:46 +01:00
Muhammad Ubaid Raza
d193446afe chore: enforce breifness 2026-02-13 00:30:18 +05:00
Muhammad Ubaid Raza
e3c9760b6b chore: enforce batch tool calls 2026-02-12 23:31:13 +05:00
Muhammad Ubaid Raza
d67f5d7bb9 fix: update tldr and task description fields to use literal scalars for better formatting 2026-02-12 21:19:26 +05:00
Ivan Charapanau
35561ea60e Fix codespell reuse typo in RUG agent 2026-02-12 16:00:42 +01:00
Ivan Charapanau
21ebb31a72 Add RUG/SWE/QA agents and workflow collection 2026-02-12 15:55:28 +01:00