Every Photo Can Talk: Banuba's New API and the Future of Video

📊 Key Data

99% cost reduction and 97% cut in production time for businesses using similar AI video technologies compared to traditional video shoots.
$12 million in funding raised by Banuba from investors like VP Capital and Larnabel Ventures.
Over nine years of experience in computer vision and augmented reality since Banuba's founding in 2016.

🎯 Expert Consensus

Experts would likely conclude that Banuba's AI Talking Photo API represents a significant advancement in democratizing video production, offering substantial cost and time savings while raising important ethical questions about synthetic media that the industry must address responsibly.

Sam Lidman

Sam Lidman: Daily

3 months ago

Every Photo Can Talk: Banuba's New API and the Future of Video

DUBAI, United Arab Emirates – February 26, 2026 – Augmented reality firm Banuba has unveiled a new developer tool that enables any static photograph to be transformed into a realistic, talking video, a move that stands to significantly lower the barrier to video production while intensifying the debate around synthetic media.

The company announced the launch of its AI Talking Photo API, a tool designed to let developers integrate AI-powered video generation into their own applications. With a single portrait and an audio file, the technology generates a video featuring natural facial movements, accurate lip-syncing, and subtle body motion, effectively creating a lifelike digital presenter without a camera, studio, or complex editing software.

“Our goal was to make talking photo generation practical, not experimental,” said Anton Liskevich, Co-Founder and Chief Product Officer at Banuba, in the company’s announcement. “Developers can now add high-quality avatar video features without building custom AI pipelines or sacrificing visual reliability.”

The Democratization of Content Creation

At its core, Banuba’s API is engineered for accessibility and quality. The company emphasizes that its models are trained to avoid the visual artifacts, distorted faces, and unnatural “puppet-like” movements that often plague AI-generated video. This focus on consistency and realism is aimed squarely at production use, where quality and user trust are paramount. The technology promises to deliver stable, convincing footage, even for videos extending beyond five minutes.

This push for quality opens the door to a wide array of practical applications that could reshape content strategies across industries. In marketing, businesses can now generate personalized video messages at scale, addressing individual customers by name or creating countless variations of an ad campaign with minimal overhead. The reported savings are substantial, with some businesses leveraging similar technologies claiming up to a 99% cost reduction and a 97% cut in production time compared to traditional video shoots.

Beyond marketing, the potential use cases are vast. Educational institutions can animate historical figures to bring lessons to life, while corporations can use AI avatars for employee onboarding and training modules, providing a consistent, 24/7 resource. In customer support, these digital personas can field thousands of queries simultaneously in multiple languages, freeing up human agents to handle more complex issues. By making video creation faster and cheaper, the technology empowers small businesses, educators, and individual creators to produce engaging content that was once the exclusive domain of well-funded studios.

Navigating a Crowded and Evolving Market

Banuba enters a rapidly growing and competitive market for AI video synthesis. The company will vie for developer attention against established players like Synthesia, which has a strong foothold in corporate training content, and HeyGen, which focuses on business communication and localization. Other competitors, such as D-ID and Magic Hour API, are also carving out niches by focusing on scalability, safety, or hyper-realism.

However, Banuba is no newcomer to the underlying technology. Founded in 2016 as an AI lab, the company has over nine years of experience in computer vision and augmented reality, backed by $12 million in funding from investors including VP Capital and Larnabel Ventures. Its extensive portfolio includes a Face AR SDK, a video editing SDK, and TINT, a virtual try-on platform used in retail. This deep expertise in facial tracking and modification provides a strong foundation for its claims of superior realism.

The company’s differentiator appears to be its deep-seated R&D culture, which it claims has allowed it to solve common problems like facial distortion and the uncanny valley effect. By leveraging dedicated neural networks trained in-house, Banuba aims to position its API not just as another tool, but as a more reliable and polished solution for developers building customer-facing products where visual integrity is non-negotiable.

The Double-Edged Sword of Synthetic Media

While the creative and commercial benefits are clear, the launch arrives at a time of heightened scrutiny over AI-generated content. The ability to create convincing videos of people saying things they never said carries significant ethical baggage. The same technology that can animate a historical figure for a classroom can also be used to create deepfakes for spreading misinformation, committing fraud, or generating non-consensual content.

Regulators and industry bodies are scrambling to keep pace. The European Union’s AI Act and China’s “Deep Synthesis” provisions now mandate that most AI-generated media be clearly labeled. In the United States, proposed legislation like the TAKE IT DOWN Act aims to combat the proliferation of non-consensual intimate imagery, whether real or synthetic. These developments highlight a growing global consensus that transparency is essential for maintaining trust in a digital ecosystem flooded with artificial content.

The challenge for companies like Banuba is to balance innovation with responsibility. Frameworks developed by organizations like the Partnership on AI (PAI) advocate for responsible practices, including watermarking and metadata to help identify synthetic media. The onus is increasingly on technology creators to build safeguards into their products and provide clear guidelines to prevent misuse.

Banuba's stated focus on “consistency and realism rather than novelty” for professional use may signal an awareness of these risks. By targeting enterprise and developer clients who have a vested interest in maintaining their brand reputation, the company may be able to foster more responsible use cases. Nevertheless, the release of another powerful synthetic media tool places further pressure on the industry to address the potential for harm. As these technologies become more accessible and realistic, the line between authentic and artificial blurs, forcing society to confront difficult questions about truth, identity, and the future of digital communication.

Event: Product Launch

Theme: AI Governance Artificial Intelligence Generative AI ESG

Sector: Software & SaaS AI & Machine Learning Venture Capital Education & Research

Product: ChatGPT Claude Gemini Copilot

Metric: Revenue EBITDA

UAID: 18463

Every Photo Can Talk: Banuba's New API and the Future of Video

The Democratization of Content Creation

Navigating a Crowded and Evolving Market

The Double-Edged Sword of Synthetic Media

Never miss what matters in your industry

🍪 We use cookies

Cookie Preferences

🔒 Necessary Cookies

📊 Analytics Cookies

🎯 Marketing Cookies