Workato Cuts AI Inference Costs 67% with DigitalOcean’s O... — DigitalOcean Holdings, Inc.

Event summary

Workato’s AI Research Lab migrated workloads to DigitalOcean’s inference-optimized platform, reducing inference costs by 67% to $0.77 per million tokens.
Throughput increased by 67% to 13,561 tokens per second per GPU, while Time-to-First-Token improved by 77% to 1,455 ms under high load.
DigitalOcean’s managed Kubernetes environment and NVIDIA Hopper GPUs enabled Workato to accelerate Time-to-Value from weeks to days.
The collaboration optimized distributed inference architecture, reducing redundant processing and improving price-performance by 33%.
Workato, with over 1 trillion tasks deployed since 2013, aims to scale enterprise AI agents with DigitalOcean’s infrastructure.

The big picture

Workato’s migration to DigitalOcean highlights the critical role of optimized cloud infrastructure in scaling AI workloads. As enterprises increasingly adopt agentic AI, the ability to reduce inference costs and improve performance will be key to maintaining competitive advantage. DigitalOcean’s vertically integrated approach positions it as a strategic partner for AI-native companies looking to balance operational efficiency with rapid innovation.

What we're watching

Inference Economics: How DigitalOcean’s cost-efficiency gains will impact Workato’s margins and competitive positioning in enterprise AI.
Scalability Challenges: Whether Workato can sustain its performance improvements as it scales AI agent deployments across 14,000+ applications.
Market Differentiation: The pace at which DigitalOcean can differentiate itself in the AI inference cloud space amid growing competition.

🍪 We use cookies

Cookie Preferences

🔒 Necessary Cookies

📊 Analytics Cookies

🎯 Marketing Cookies