SIGNALAI·May 28, 2026, 4:00 AMSignal75Short term

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know?

arXiv:2605.28721v1 Announce Type: new Abstract: Are LLM-based search agents genuinely searching, or using the web to verify what they already know? We study this question on BrowseComp with three diagnostics. Our analysis reveals Intrinsic Knowledge Dependence (IKD): even with tool access, agents often rely on intrinsic knowledge -- information encoded in the model before retrieval -- rather than on external evidence. Agents answer up to 44.5% of BrowseComp questions without tools, generate more than half of their search queries from internally produced hypotheses rather than retrieved leads,

Why this matters

Why now

The proliferation of LLM-based search agents necessitates understanding their true operational mechanisms to improve their effectiveness and reliability.

Why it’s important

This study challenges the assumption that AI agents primarily rely on external web data, highlighting a significant dependence on internal knowledge, which impacts their utility and development.

What changes

The understanding of how AI agents perform search tasks shifts, indicating a need to rethink agent design, training data, and evaluation metrics.

Winners

· AI model developers focused on knowledge grounding
· Companies specializing in AI agent evaluation tools
· Researchers developing advanced retrieval-augmented generation techniques

Losers

· AI agent developers relying solely on external search for intelligence
· Users expecting agents to always leverage the most current external data
· Platforms providing only basic web search APIs for agents

Second-order effects

Direct

AI agent architectures will be redesigned to more explicitly manage the interaction between internal knowledge and external retrieval.

Second

There will be increased investment in developing better mechanisms for agents to identify and mitigate their 'Intrinsic Knowledge Dependence' (IKD).

Third

The perceived value and trustworthiness of AI agents for critical information retrieval tasks may be temporarily lowered until these issues are addressed.

Editorial confidence: 90 / 100 · Structural impact: 60 / 100

Original report

This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.

Read at arXiv cs.AI

#cs.AI

Tracked by The Continuum Brief · live intelligence network

The Brief · Weekly Dispatch

Stay ahead of the systems reshaping markets.