Browse Tag

speech technology

Conversational AI and Voice Assistants – Key Developments (June–July 2025)

Conversational AI and Voice Assistants – Key Developments (June–July 2025)

Amazon’s Alexa+ entered broad early access by mid-2025 and surpassed 1 million users by June 2025. Alexa+ is free for Amazon Prime members and costs $19.99/month for others, and it orchestrates “experts” across services like OpenTable, Spotify, and Uber Eats. Alexa+ can remember personal details and even has its own email for user-provided information, enabling more complex multi-step requests than the old Alexa. In Early Access, about 90% of Alexa+ features are live, with hands-free shopping and advanced video controls still under development. Apple’s Siri saw no major generative AI launch in June 2025, with delays to a likely iOS
Top 10 AI Voice and Speech Technologies Dominating 2025 (TTS, STT, Voice Cloning)

Top 10 AI Voice and Speech Technologies Dominating 2025 (TTS, STT, Voice Cloning)

Google Cloud Speech AI provides Text-to-Speech with 380+ voices across 50+ languages using WaveNet/Neural2, Speech-to-Text in 125+ languages, and Custom Voice generally available in 2024. Azure Speech Service offers Neural Text-to-Speech with 446 voices in 144 languages (as of mid-2024), Speech-to-Text in 75+ languages, and Custom Neural Voice with cloud or on-prem deployment. Amazon Polly delivers 100+ voices in 40+ languages, includes Neural Generative TTS with 13 ultra-expressive voices by late 2024, and Amazon Transcribe supports 100+ languages. IBM Watson Speech Services provide Text-to-Speech in 13+ languages and Speech-to-Text in 8–10 languages, with 2024 Large Speech Models and on-prem deployment
Go toTop