Im experiencing a latency difference between TTS-1 and TTS-1-Max

harpreet5 · November 19, 2025, 3:27pm

Is there a difference between both models in this regard?

InworldAI · November 19, 2025, 3:29pm

Hey Harpreet,

There is a difference. For inworld-tts-1, expect ~200-400ms time-to-first-chunk in streaming mode. Overall latency depends on text length, but we’re optimized for real-time applications. inworld-tts-max is slower but more expressive, right now it’s not ideal for real-time use cases.

Pro tip: Always use streaming for interactive experiences. The perceived latency is much better.

Topic		Replies	Views
Difference between TTS 1 vs TTS Max TTS tts-api , tts-models	1	28	November 6, 2025
What latency do you have when running through a full conversational pipeline? Not just TTS Runtime latency , runtime	1	18	November 19, 2025
About the TTS category TTS	2	13	November 20, 2025
Do you guys offer volume discounts? TTS tts	1	17	November 18, 2025
What audio formats does your model support? TTS tts	2	17	November 18, 2025

Im experiencing a latency difference between TTS-1 and TTS-1-Max

Related topics