Moral Semantics Survive Machine Translation: Cross-Lingual Evidence from Moral Foundations Corpora

arXiv:2605.22660v1 Announce Type: new Abstract: Moral language is subtle and culturally variable, making it difficult to translate faithfully across languages. Idiomatic expressions, slang, and cultural references introduce hard-to-avoid translation artifacts. Yet automated moral values classification depends on language-specific annotated corpora that exist almost exclusively in English. We investigate whether LLM-based translation can bridge this gap, taking Polish as a test case. Using $\sim$50k morally-annotated social media posts from a diverse range of topics, we apply a principled four-
The proliferation of LLMs and the increasing demand for culturally nuanced AI applications make this research timely and essential for cross-lingual understanding.
This research suggests that LLMs can overcome language barriers in understanding complex moral concepts, which is critical for developing globally applicable and ethically aligned AI systems.
The ability to accurately translate and classify moral semantics across languages using LLMs enables the development of AI that can operate effectively and ethically in diverse cultural contexts.
- · AI developers
- · Multinational corporations
- · Social media platforms
- · Ethical AI research
- · Companies with single-language AI models
- · Translators focused solely on literal translation
AI models will become more effective at understanding and responding to moral and ethical dilemmas in non-English languages.
This improved cross-cultural moral understanding could lead to more nuanced global content moderation and culturally sensitive AI assistants.
It might accelerate the development of AI ethics frameworks that are inherently multinational, reducing Western-centric bias in AI development.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at arXiv cs.CL