SIGNALAI·May 25, 2026, 4:00 AMSignal85Medium term

QED: An Open-Source Multi-Agent System for Generating Mathematical Proofs on Open Problems

arXiv:2604.24021v3 Announce Type: replace Abstract: We present \textbf{QED}, an open-source multi-agent system that turns human-provided research questions into complete mathematical proofs without further human guidance. Its pipeline is designed to overcome common failures of single-query proof generation by separating planning, proving, and verification: a decomposition agent structures the proof search, prover agents generate candidate arguments, and verifier agents check correctness. In collaboration with domain experts, we evaluated QED on 18 research-level projects of varying difficulty.

Why this matters

Why now

The accelerating trend in large language models and multi-agent system architectures has matured to a point where complex, abstract problem-solving is becoming viable.

Why it’s important

This marks a significant step towards fully autonomous AI systems capable of generating novel, verifiable intellectual output, potentially automating core intellectual labor.

What changes

The ability to generate complete mathematical proofs without human guidance moves AI beyond assistive tooling into independent knowledge creation across a foundational scientific domain.

Winners

· AI research and development (academia and industry)
· Mathematics and theoretical sciences
· Proof automation software developers
· Open-source AI communities

Losers

· Tasks requiring manual proof generation
· Specific segments of academic research that rely on human-only proof discovery

Second-order effects

Direct

QED immediately demonstrates a new capability boundary for AI in abstract reasoning and problem-solving.

Second

This could lead to accelerated discovery in mathematics and other formal sciences by automating the generation and verification of complex proofs.

Third

The success of multi-agent systems for knowledge creation may drive a broader re-evaluation of 'human-only' intellectual domains and accelerate the development of autonomous agents across various professional fields.

Editorial confidence: 95 / 100 · Structural impact: 70 / 100

Original report

This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.

Read at arXiv cs.AI

#cs.AI #math.AP

Tracked by The Continuum Brief · live intelligence network

The Brief · Weekly Dispatch

Stay ahead of the systems reshaping markets.