SIGNALAI·Jun 19, 2026, 4:00 AMSignal75Medium term

Where Does Social Reasoning Come From? Capability Provenance in Language Models

arXiv:2606.19625v1 Announce Type: new Abstract: We use training-data attribution as an interpretable tool for capability discovery, mapping which regions of the pretraining corpus support social-reasoning versus STEM-reasoning in OLMo3-7B. Training-data attribution measures how strongly each training document influences a model's predictions on a benchmark, but document-level scores are too noisy to identify which corpus regions support which capabilities, and prior work has emphasized factual knowledge rather than reasoning. We compute gradient-based attribution (TrackStar via Bergson) over a

Why this matters

Why now

This research provides a method to understand how specific training data influences an LLM's capabilities, which is becoming critical as models grow in complexity and their societal impact increases.

Why it’s important

Understanding the provenance of capabilities in LMs can lead to more robust, ethical, and performant AI systems by enabling targeted training and mitigating unwanted biases or limitations.

What changes

The ability to attribute specific model capabilities to portions of its training data allows for more deliberate and precise engineering of AI, shifting from black-box development to more informed design.

Winners

· AI developers
· ML researchers
· AI ethics and safety organizations

Losers

· Developers relying on opaque model development
· AI systems with unexplainable internal functions

Second-order effects

Direct

Researchers gain a precise tool to identify which data components contribute to specific reasoning abilities in large language models.

Second

This capability allows for more efficient and targeted data curation, potentially accelerating the development of specialized and reliable AI agents.

Third

Improved provenance could lead to regulatory requirements for 'explainability' in AI capabilities, influencing future AI development and deployment standards.

Editorial confidence: 90 / 100 · Structural impact: 60 / 100

Original report

This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.

Read at arXiv cs.CL

#cs.CL #cs.LG

Tracked by The Continuum Brief · live intelligence network

The Brief · Weekly Dispatch

Stay ahead of the systems reshaping markets.