🧵 This week in conversational AI:
More and more viral fails are popping up in drive-thru voice AI. And now Taco Bell is rethinking its pilot after the infamous 18,000-water order.
My hot take: the first wave of providers shipped without proper observability and skipped realistic stress testing. At Coval, one of the hardest problems we’ve solved is running ultra-realistic drive-thru simulations before you ever hit production. So if you want to avoid becoming the next viral headline… Taco Bell (and everyone else) 👉 let’s talk.
HappyRobot raises $44M Series B, bringing total funding to $62M. Already powering DHL, Ryder, and Werner & pushing supply chain work from manual to autonomous. Congrats Pablo Palafox & team!!!
Congrats on the launch of Pipecat TV, Kwindla Hultman Kramer & team! They just launched their very first podcast episode. Stay tuned for tutorials, community projects on YouTube! Will link their first episode in the comment.
ElevenLabs introduces a v2 SFX to create even better sound effects via prompt - higher fidelity, seamless looping, and longer lasting effects! Big step for creative teams building with audio.
Cartesia named a 2025 IA40 winner, joining the ranks of top AI companies shaping the future of intelligent applications.
This week, Rime, ConverseNow.AI, Pipecat, and Coval hosted an event on TTS pronunciation challenges. Here are the main take-aways
👉 In voice AI, expectations are sky-high, people want STT/TTS to be sharper than humans, especially with names, menus, and medical terms. Success depends not just on the model, but on being close to the audio source and making smart protocol choices to avoid unnecessary disfluencies. A lot of teams also stressed modular architectures and probabilistic evaluation --> the goal is to be able to adapt, measure, and improve continuously! --> We'll share a recording of the event soon!