Three Buddhist Vocabularies: Computational Stylometry of the English Pali Canon across Sutta, Vinaya, and Abhidhamma

arXiv:2606.25372v1 Announce Type: new Abstract: We present a computational stylometric analysis of the Tipitaka across all three Pitakas in English translation, extending earlier work on the Sutta Pitaka alone. The corpus spans 134,831 segments from Bhikkhu Sujato's Sutta Pitaka (114,591 segments, CC0), Bhikkhu Brahmali's Vinaya Pitaka (7,923 segments, CC0 2026), I.B. Horner's 1938 Vinaya translation (2,826 segments), three English translations of the Abhidhammattha Sangaha compendium (2,077 segments), and cross-tradition Vinaya texts from the Dharmaguptaka and Mulasarvastivada schools. We com
The proliferation of computational linguistic tools enables detailed stylometric analysis of historical texts.
While interesting for digital humanities and religious studies, this research has no material impact on markets, geopolitics, or the tech stack.
Little changes beyond a deeper computational understanding of Buddhist texts, which is a niche academic interest.
- · Digital humanities researchers
- · Pali Canon scholars
The Sutta, Vinaya, and Abhidhamma Pitakas are computationally analyzed for stylistic differences.
New insights into the authorship and compilation processes of these ancient texts could emerge.
This could lead to new interdisciplinary academic programs combining computer science and religious studies.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at arXiv cs.CL