Introducing Amazon Polly Bidirectional Streaming: Real-time speech synthesis for conversational AI

Amazon AWS AI Blog / 3/27/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical Usage

Key Points

  • Amazon Polly has introduced a Bidirectional Streaming API that allows sending text and receiving synthesized audio simultaneously for real-time TTS.
  • The API is designed for conversational AI workloads where text is produced incrementally (e.g., output from LLMs) and audio playback must begin before the full response is ready.
  • This streaming approach is intended to reduce latency and improve responsiveness in voice and conversational user experiences.
  • By enabling incremental input/output, the API streamlines integration patterns for developers building interactive speech interfaces driven by conversational systems.
Today, we’re excited to announce the new Bidirectional Streaming API for Amazon Polly, enabling streamlined real-time text-to-speech (TTS) synthesis where you can start sending text and receiving audio simultaneously. This new API is built for conversational AI applications that generate text or audio incrementally, like responses from large language models (LLMs), where users must begin synthesizing audio before the full text is available.