SIGNALAI·Jun 29, 2026, 4:00 AMSignal75Long term

Triadic Werewolf: A Jester Role for Multi-Hop Theory of Mind in LLMs

arXiv:2606.27909v1 Announce Type: cross Abstract: Theory-of-mind evaluations of large language models typically use dyadic social-deduction games, where every observable cue points to a single hidden side, so a model with strong language priors can score well without ever simulating opponents' incentives. We extend the Werewolf game with a Jester, a third faction whose utility on peer suspicion is inverted because it wins by being voted out, so optimal play requires reasoning across three opposing utility functions. Across 60 games on GPT-4.1, DeepSeek-V3.1, and Llama-3.3-70B with Jester self-

Why this matters

Why now

The paper was published on arXiv, indicating ongoing academic research and development in AI theory of mind, specifically addressing limitations of current evaluation methods.

Why it’s important

This research indicates a significant step towards developing AI models with more sophisticated understanding of complex social incentives, crucial for advanced agentic systems.

What changes

Current theory-of-mind evaluations are challenged, leading to new benchmarks and potentially more robust AI agents capable of navigating multi-hop social dilemmas.

Winners

· AI researchers
· Developers of AI agents
· Companies investing in advanced AI

Losers

· LLMs with superficial social reasoning
· Simpler AI evaluation methodologies

Second-order effects

Direct

More accurate and nuanced evaluations of LLM social reasoning capabilities will emerge.

Second

This could lead to the development of highly sophisticated AI agents capable of complex strategic interaction in diverse environments.

Third

Such agents might redefine human-AI collaboration and autonomous system behavior in fields like negotiation, gaming, and potentially even diplomacy.

Editorial confidence: 90 / 100 · Structural impact: 65 / 100

Original report

This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.

Read at arXiv cs.AI

#cs.CL #cs.AI #cs.GT #cs.MA

Tracked by The Continuum Brief · live intelligence network

The Brief · Weekly Dispatch

Stay ahead of the systems reshaping markets.