Inworld AI Launches Realtime TTS-2: A Closed-Loop Voice Model That Adapts to How You Actually Talk

By Topline Newsroom

6 hours ago2 min readSource: www.marktechpost.com

Inworld AI Launches Realtime TTS-2: A Closed-Loop Voice Model That Adapts to How You Actually Talk

The Inworld AI's new model conditions on full audio context, not just transcripts — a meaningful architectural shift for voice-first AI agents The post Inworld AI Launches Realtime TTS-2: A Closed-Loop Voice Model That Adapts to How You Actually Talk appeared first on MarkTechPost .

From the source

News Hub News Hub Premium Content Read our exclusive articles Facebook Instagram X Home Open Source/Weights AI Agents Tutorials Voice AI Robotics Promote with us News Hub Home Open Source/Weights AI Agents Tutorials Voice AI Robotics Promote with us Home Technology Artificial Intelligence Inworld AI Launches Realtime TTS-2: A Closed-Loop Voice Model That Adapts to... Technology Artificial Intelligence Language Model Audio Language Model Editors Pick New Releases Staff Voice AI Voice AI has a dirty secret: most of it was never designed for conversation. The dominant paradigm — feed text in, get audio out — traces its lineage to audiobook narration and voiceover production, where the model never hears the person on the other end. That s fine when you re generating a podcast intro. It s not f

Inworld AI is calling that out directly with the launch of Realtime TTS-2, a new voice model released as a research preview via its Inworld API and Inworld Realtime API. The model hears the full audio of the exchange, picks up the user s tone, pacing and emotional state, then takes voice direction in plain English the way developers prompt an LLM.

The meaningful architectural distinction with TTS-2 is that it operates as a closed-loop system. The model takes the actual audio of the prior turns of the exchange as input, not just a transcript — it hears how the user actually sounded. That s a non-trivial difference. A transcript of okay, fine gives you the words. The audio of okay, fine tells you whether the person is relieved, resigned, or sarcastic. TTS-2 is designed to use that signal.

Who and what

Key names and topics in this story: Inworld AI Launches Realtime, Closed, Loop Voice Model That, Adapts.

Where to follow next

Read the full piece at www.marktechpost.com
More from our AI & prompts coverage

#ai#inworld-ai-launches-realtime#closed#loop-voice-model-that#adapts

Apple plans to make iOS 27 a Choose Your Own Adventure of AI models

With Apple's latest operating system updates, users will reportedly have their pick of which third-party AI models they want to use for a host of tasks.

Build a Modular Skill-Based Agent System for LLMs with Dynamic Tool Routing in Python

In this tutorial, we build a complete skill-based agent system for large language models and explore how modular capabilities can be structured like an operating system for AI agents. We define reusable skills, attach metadata and schemas to them, register them in a central regis

Pennsylvania sues Character.AI after a chatbot allegedly posed as a doctor

According to Pennsylvania's filing, a Character.AI chatbot presented itself as a licensed psychiatrist during a state investigation, and also fabricated a serial number for its state medical license.

SAP bets $1.16B on 18-month-old German AI lab and says yes to NemoClaw

SAP plans to buy German AI startup Prior Labs and invest heavily in it. It is also prohibiting customers' agents use to a select few like Nvidia's NemoClaw.