
arXiv:2606.27380v1 Announce Type: new Abstract: Automated coaching for oral presentations sits at the intersection of computer-assisted pronunciation training (CAPT), prosody modeling, and speech synthesis, yet no prior work has systematically surveyed and compared existing systems along these dimensions. This survey reviews and categorizes automated presentation coaching systems, spanning pronunciation tutors, fluency and prosody coaches, multimodal trainers, and conference Q&A practice tools. We introduce a five-dimensional task taxonomy - covering segmental pronunciation, lexical stress, su
The proliferation of AI in speech processing and synthesis makes the systematic review and advancement of automated presentation coaching timely, as the technology matures to offer sophisticated human-computer interaction.
This survey provides a foundational understanding of the current state and future directions of AI-driven communication training, which is critical for organizations and individuals seeking to leverage technology for skill development and efficiency.
The explicit categorization of automated presentation coaching systems establishes a clearer framework for development and evaluation in this nascent field, highlighting gaps and opportunities for innovation in AI agents focused on communication.
- · AI Speech Technology Developers
- · Corporate Training Platforms
- · Educational Technology Providers
- · Professional Coaches
- · Inefficient Manual Coaching Services
Automated presentation coaching will become more sophisticated, offering real-time feedback on various communication aspects.
Improved public speaking and presentation skills driven by AI tools could enhance professional development and business pitches across industries.
Widespread adoption of such tools might lead to a more standardized and perhaps less authentic style of communication, as individuals optimize for AI-driven feedback.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at arXiv cs.CL