3 Commits

Author SHA1 Message Date
github-actions[bot]
6c1edcdc1b chore: publish from staged 2026-03-12 03:28:13 +00:00
Dan Velton
f391bc9c30 doublecheck: auto-escalate to full verification report for high-risk content (#980)
* doublecheck: auto-escalate to full report for high-risk content

In active (persistent) mode, doublecheck previously used abbreviated
inline verification for all content types, with the full three-layer
report only available if the user explicitly requested it. Most users
did not know the command existed, so high-stakes content (legal
analysis, regulatory guidance) and responses with DISPUTED or
FABRICATION RISK findings were presented with only a summary table.

Changes:
- Auto-escalate to the full verification report when any claim rates
  DISPUTED or FABRICATION RISK during inline verification
- Always use the full report for legal analysis, regulatory
  interpretation, compliance guidance, and content with case citations
  or statutory references
- Update the activation message to mention the 'full report' command
- Add a discoverability footer to inline verification sections
- Update README.md to document the new behavior

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Address review feedback: fix table reference, reconcile bullets, clarify callout placement, add trigger phrase

- Table row now references 'high-stakes content rule' (not 'auto-escalation
  rule') and lists all content types that match the later rule text
- README first bullet no longer claims legal content gets inline
  verification, resolving the contradiction with the next bullet
- Clarify that the 'Heads up' callout goes at the top of the full report
  when auto-escalation applies, not in a nonexistent inline section
- Add 'full report' to the explicit trigger phrase list in the 'Offer
  full verification on request' rule

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Update README.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update README.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2026-03-12 14:27:52 +11:00
Dan Velton
eb7d223446 Add doublecheck plugin: three-layer verification pipeline for AI output (#978)
* Add doublecheck plugin: three-layer verification pipeline for AI output

Adds a new plugin that helps users verify AI-generated content before
acting on it. Designed for sensitive contexts (legal, medical, financial,
compliance) where hallucinations carry real consequences.

Three verification layers:
- Self-Audit: extracts verifiable claims, checks internal consistency
- Source Verification: web searches per claim, produces URLs for human review
- Adversarial Review: assumes errors exist, checks hallucination patterns

Supports persistent mode (auto-verifies every factual response inline)
and one-shot mode (full report on specific text). Confidence ratings:
VERIFIED, PLAUSIBLE, UNVERIFIED, DISPUTED, FABRICATION RISK.

Includes:
- Skill (skills/doublecheck/) with bundled report template
- Agent (agents/doublecheck.agent.md) for interactive verification
- Plugin package (plugins/doublecheck/) bundling both

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

* Address review: fix tools YAML format, remove materialized artifacts

- Fix tools frontmatter in agents/doublecheck.agent.md to use standard
  YAML list format instead of flow sequence with trailing comma
- Remove plugins/doublecheck/agents/ and plugins/doublecheck/skills/
  from tracking; these paths are in .gitignore as CI-materialized
  artifacts that should not be committed

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-03-12 12:35:43 +11:00