SIGNALAI·May 25, 2026, 4:00 AMSignal75Medium term

Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception

arXiv:2604.28048v2 Announce Type: replace Abstract: Large Language Models (LLMs) are increasingly used as proxies for human perception in urban analysis, yet it remains unclear whether persona prompting produces meaningful and reproducible behavioral diversity. We investigate whether distinct personas influence urban sentiment judgments generated by multimodal LLMs. Using a factorial set of personas spanning gender, economic status, political orientation, and personality, we instantiate multiple agents per persona to evaluate urban scene images from the PerceptSent dataset and assess both with

Why this matters

Why now

The proliferation of LLMs and their increasing application in diverse fields, particularly those requiring human-like judgment, necessitates a deeper understanding of their reliability and biases when persona prompting is used.

Why it’s important

This research directly addresses the validity and reproducibility of LLM outputs for nuanced tasks, which is critical for their safe and effective deployment as proxies for human perception in sensitive areas like urban analysis.

What changes

The findings suggest that simply assigning personas to LLMs may not consistently achieve the desired behavioral diversity, prompting a re-evaluation of current LLM agent design and prompting strategies for social perception tasks.

Winners

· AI researchers focusing on explainable AI
· Developers of robust LLM evaluation frameworks
· Ethical AI advocates

Losers

· Organizations relying on simple persona prompting for LLM agents
· Users expecting nuanced, diverse opinions from persona-prompted LLMs without val
· LLM applications in social science without rigorous testing

Second-order effects

Direct

It highlights potential limitations in current LLM persona-based prompting for generating diverse and reliable 'human-like' insights.

Second

This could lead to a demand for more sophisticated and validated methods for instilling specific perspectives into LLMs, moving beyond superficial persona assignments.

Third

Long-term, it may drive the development of 'personality' architectures within LLMs or specialized fine-tuning approaches grounded in psychological models, rather than prompt engineering alone.

Editorial confidence: 90 / 100 · Structural impact: 60 / 100

Original report

This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.

Read at arXiv cs.CL

#cs.CL #cs.SI

Tracked by The Continuum Brief · live intelligence network

The Brief · Weekly Dispatch

Stay ahead of the systems reshaping markets.