NOISEAI·Jun 25, 2026, 4:00 AMSignal5Long term

Three Buddhist Vocabularies: Computational Stylometry of the English Pali Canon across Sutta, Vinaya, and Abhidhamma

arXiv:2606.25372v1 Announce Type: new Abstract: We present a computational stylometric analysis of the Tipitaka across all three Pitakas in English translation, extending earlier work on the Sutta Pitaka alone. The corpus spans 134,831 segments from Bhikkhu Sujato's Sutta Pitaka (114,591 segments, CC0), Bhikkhu Brahmali's Vinaya Pitaka (7,923 segments, CC0 2026), I.B. Horner's 1938 Vinaya translation (2,826 segments), three English translations of the Abhidhammattha Sangaha compendium (2,077 segments), and cross-tradition Vinaya texts from the Dharmaguptaka and Mulasarvastivada schools. We com

Why this matters

Why now

The proliferation of computational linguistic tools enables detailed stylometric analysis of historical texts.

Why it’s important

While interesting for digital humanities and religious studies, this research has no material impact on markets, geopolitics, or the tech stack.

What changes

Little changes beyond a deeper computational understanding of Buddhist texts, which is a niche academic interest.

Winners

· Digital humanities researchers
· Pali Canon scholars

Losers

Second-order effects

Direct

The Sutta, Vinaya, and Abhidhamma Pitakas are computationally analyzed for stylistic differences.

Second

New insights into the authorship and compilation processes of these ancient texts could emerge.

Third

This could lead to new interdisciplinary academic programs combining computer science and religious studies.

Editorial confidence: 90 / 100 · Structural impact: 0 / 100

Original report

This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.

Read at arXiv cs.CL

#cs.CL #cs.IR

Tracked by The Continuum Brief · live intelligence network

The Brief · Weekly Dispatch

Stay ahead of the systems reshaping markets.