TLA-Prover: Verifiable TLA+ Specification Synthesis via Preference-Optimized Low-Rank Adaptation

arXiv:2606.06133v1 Announce Type: cross Abstract: TLA+ is a formal specification language for verifying distributed systems and safety-critical protocols. Large language models (LLMs) frequently produce TLA+ specifications that fail the TLC model checker for semantic reasons. Across 25 LLMs, the best public baseline is 26.6% syntactic parse and 8.6% semantic model-check. We present TLA-Prover, a 20-billion-parameter model for TLA+ specification synthesis. Training combines supervised fine-tuning (SFT) on verified examples with repair-based group-relative policy optimization (GRPO). In the GRPO
The proliferation of LLMs creates an urgent need for more reliable AI-driven code generation, especially for critical systems where formal verification is paramount.
This development addresses a critical weakness in current LLM capabilities for generating verifiable code, opening new possibilities for automated formal methods in software development.
The ability to synthesize formally verifiable specifications with high accuracy via LLMs could significantly accelerate the development and safety assurance of complex systems.
- · AI agents developers
- · Cybersecurity sector
- · Aerospace and defence software
- · Formal methods researchers
- · Software engineers specializing in manual formal verification
Improved reliability and safety of AI-generated software for critical applications.
Accelerated adoption of AI in areas requiring high assurance, such as autonomous systems and infrastructure.
Potential for fully autonomous, formally verified software development pipelines, reducing human error and development time.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at arXiv cs.LG