Awesome/awesome-copilot

mirror of https://github.com/github/awesome-copilot.git synced 2026-04-11 18:55:55 +00:00

Files

github-actions[bot] a68b190031 chore: publish from staged

2026-04-09 06:26:21 +00:00

1.3 KiB

Raw Permalink Blame History

Error Analysis: Multi-Turn Conversations

Debugging complex multi-turn conversation traces.

The Approach

End-to-end first - Did the conversation achieve the goal?
Find first failure - Trace backwards to root cause
Simplify - Try single-turn before multi-turn debug
N-1 testing - Isolate turn-specific vs capability issues

Find First Upstream Failure

Turn 1: User asks about flights ✓
Turn 2: Assistant asks for dates ✓
Turn 3: User provides dates ✓
Turn 4: Assistant searches WRONG dates ← FIRST FAILURE
Turn 5: Shows wrong flights (consequence)
Turn 6: User frustrated (consequence)

Focus on Turn 4, not Turn 6.

Simplify First

Before debugging multi-turn, test single-turn:

# If single-turn also fails → problem is retrieval/knowledge
# If single-turn passes → problem is conversation context
response = chat("What's the return policy for electronics?")

N-1 Testing

Give turns 1 to N-1 as context, test turn N:

context = conversation[:n-1]
response = chat_with_context(context, user_message_n)
# Compare to actual turn N

This isolates whether error is from context or underlying capability.

Checklist

Did conversation achieve goal? (E2E)
Which turn first went wrong?
Can you reproduce with single-turn?
Is error from context or capability? (N-1 test)