Streaming Text-to-Speech (Streaming TTS) generates spoken audio incrementally as text becomes available instead of waiting for the complete response to be produced.
Voice AI platforms use Streaming TTS to begin speaking almost immediately after response generation starts, reducing perceived latency and making conversations feel more natural and responsive.