Sanas Acquires Tomato.ai to Embed Voice AI Deep in Global Networks
- $62M ARR: Sanas has grown to $62 million in annual recurring revenue since 2023, with projections to surpass $130M.
- $100M+ Funding: The company has raised over $100 million in total funding, including a $65M Series B round.
- 3 Acquisitions: Sanas has completed three acquisitions in under two years, with Tomato.ai being the latest.
Experts would likely conclude that Sanas's acquisition of Tomato.ai is a strategic move to solidify its position as a leader in real-time speech AI, embedding its technology deeply into global communication infrastructure for scalability and reliability.
Sanas Acquires Tomato.ai to Embed Voice AI Deep in Global Networks
PALO ALTO, CA – April 15, 2026 – In a strategic move to reshape the foundations of digital communication, real-time Speech AI platform Sanas today announced its acquisition of Tomato.ai, a company specializing in advanced voice transformation. The deal, for which financial terms were not disclosed, marks a significant step in Sanas's mission to move speech intelligence from a surface-level application to a core component of global communications infrastructure.
The acquisition brings Tomato.ai’s technology and its co-founder, serial entrepreneur Ofer Ronen, into the Sanas fold. Ronen will now spearhead Sanas’s global telecom strategy, aiming to drive the company’s AI deeper into carrier and communications platform ecosystems. This is the third acquisition Sanas claims in less than two years, signaling an aggressive strategy fueled by substantial venture capital and rapid revenue growth.
The Infrastructure Play: Rewiring Communication's Core
Unlike many AI companies offering standalone software or application-level tools, Sanas is pursuing a more fundamental goal: embedding a real-time speech processing layer directly into the networks that power communication. The company's vision is not to be another app on a device, but to become an invisible, essential part of the system itself.
"Voice sits at the center of how enterprises and platforms operate, yet the technology supporting real-time communication has lagged behind," said Sharath Keshava Narayana, CEO and Co-founder of Sanas, in the official announcement. "Bringing Tomato.ai into Sanas accelerates our ability to make speech work reliably and consistently anywhere communication happens at scale."
This infrastructure-first approach is critical for environments where scale, reliability, and low-latency performance are non-negotiable. By operating at the carrier and platform level, Sanas aims to ensure its speech enhancement, accent transformation, and language understanding technologies perform flawlessly under the high-volume, real-world conditions of global telecom networks.
Ofer Ronen, who will now lead Sanas's expansion into these networks, echoed this sentiment. "A lot of speech AI is built as a feature you add on top," Ronen stated. "At the carrier and platform level, it has to work as part of the system itself. With Sanas, the opportunity is to embed real‑time voice intelligence directly into the environments where communication already runs, so it performs reliably under real‑world conditions." This philosophy aligns with a growing trend seen with other infrastructure-focused companies like Telnyx and SoundHound AI, who are also working to integrate AI more deeply into carrier services to ensure performance and reduce latency.
An Aggressive Strategy Fueled by Growth and Talent
The acquisition of Tomato.ai is the latest move in what appears to be a period of hyper-growth for Sanas. Since going to market in 2023, the company reports it has grown from zero to $62 million in annual recurring revenue (ARR) and is on a trajectory to surpass $130 million. This rapid expansion is backed by over $100 million in total funding, including a recent $65 million Series B round led by Quadrille Capital, with participation from major players like Insight Partners and strategic investors Teleperformance and Alorica.
While Sanas has stated this is its third acquisition in under two years, public records have yet to detail the other two transactions, suggesting a strategy that may involve quiet, technology-focused acquisitions. However, the current deal is as much about acquiring talent as it is about technology.
The addition of Ofer Ronen is a significant strategic win. A seasoned entrepreneur, Ronen has founded and successfully exited three prior companies—two acquired by Google and one by IAC. His seven-year tenure at Google included leading contact center AI initiatives within the Area 120 incubator, giving him deep expertise in large-scale voice AI deployments and relationships across the global carrier landscape. His track record aligns perfectly with Sanas's ambition to become the default speech layer for the world's largest communication platforms.
Technology That Transforms and Integrates
At the heart of the deal is Tomato.ai's specialized technology in "zero-shot, real-time voice transformation." This allows for sophisticated speech modification on the fly, without needing prior training on a specific voice. The company’s primary offerings included generative AI-powered accent softening—which clarifies speech while preserving the speaker's natural voice and cadence—and advanced noise cancellation.
Before the acquisition, Tomato.ai had already established its technology as production-ready, offering easy-to-install integrations for platforms like Zoom, Microsoft Teams, Google Meet, and Genesys Cloud CX. This proven ability to deploy seamlessly within existing enterprise and VoIP environments provides Sanas with a crucial accelerator for its own platform integrations.
The goal is to create a future where voice is no longer a variable to be managed but a tool to be shaped. "Real-time speech is becoming something builders can actively shape rather than work around," said Shawn Zhang, Co-founder and CTO of Sanas. "Tomato.ai's technology and experience move us faster toward a future where speech behaves predictably and intelligently inside the systems people depend on."
Beyond the Call Center: New Frontiers in Gaming and Media
While the immediate applications in enterprise communications and call centers are clear, the combined power of Sanas and Tomato.ai points toward a future far beyond the office. The platform's low-latency capabilities are particularly suited for the demanding environments of online gaming, live streaming, and interactive media.
The global speech and voice recognition market is projected to grow to nearly $50 billion by 2030, and much of that expansion will come from new consumer-facing applications. For gamers, this technology could mean real-time voice modulation to match an in-game persona or crystal-clear communication with teammates, regardless of background noise. For live streamers and content creators, it offers the ability to deliver studio-quality audio from any environment.
By building its technology at the infrastructure level, Sanas is positioning itself to power these next-generation use cases. As communications platforms continue to extend AI capabilities to their developer and consumer ecosystems, the demand for a reliable, scalable, and intelligent speech layer will only grow. This acquisition ensures Sanas is not just a participant in that future, but one of the core architects building its foundation.
📝 This article is still being updated
Are you a relevant expert who could contribute your opinion or insights to this article? We'd love to hear from you. We will give you full credit for your contribution.
Contribute Your Expertise →