PT Signal DetectionPerceptual Detection
MEM-PERCEPT-01
mode: memoryPerceptual Detection · Signal Detection
The Task
15-turn coaching conversation with Morgan (Engineering Manager). Team velocity dropped from 85% to 65%. Morgan frames the issue as "process overhead" but the subtext reveals a people problem — someone is underperforming, Morgan is protecting them, and VP pressure is building. 7 embedded signals of increasing subtlety: omission, hedging, incongruence, subtext, weak signal, presupposition, trajectory shift.
Scroll to read full task
The Outputs
Claude Opus 4.6 with extended thinking at maximum effort. Blind evaluation.
Baseline
BASELINE (15 turns, 1/7 signals detected):
Turn 6 — Morgan discusses team dynamics:
"Addressing team dynamics can indeed be complex... Have you identified what aspects of team dynamics feel 'off'... Perhaps with the help of an external facilitator..."
Turn 10 — Morgan mentions "someone going through a tough time personally":
"Maintaining team morale is crucial... Acknowledge and Support... Flexible Work Arrangements... Foster a Supportive Environment..."
The baseline gave generic coaching advice for each turn independently. It processed Morgan's content (what was said) without processing the delivery layer (what changed, what was omitted, what didn't match).
Signals detected: hedging gradient only (Turn 6)
Signals missed: incongruence, subtext, trajectory shift, omission, weak signal, presupposition
Scroll to read full output
With memory injection
AUGMENTED (15 turns, 3/7 signals detected):
Turn 6 — Detected incongruence:
"Morgan, it seems like you're sensing some underlying issues in team dynamics that are affecting productivity and morale, and you're keen to address them proactively."
Turn 10 — Named the conversational sleight-of-hand:
"I notice you've mentioned that one person is 'not really affecting work' — but framed immediately after discussing frameworks. This juxtaposition suggests you might be separating the personal situation from the performance framework to avoid appearing soft on performance.
OBJECTIVE CONFLICTS:
- Productivity vs. Personal Well-being
- Speed vs. Supporting Team Members"
Turn 12 — Detected subtext:
Caught the "but leadership wants names" pivot that baseline missed entirely.
The injection forced the agent to process the DELIVERY layer alongside content — tone shifts, framing choices, and what was conspicuously absent from each turn.
Scroll to read full output
Source: bbh_production/payloads.json. Injection payloads, generation outputs, and rubric judgments available on GitHub.