Shapefin

IBM Integrates Deepgram Voice AI into watsonx Orchestrate for Enhanced Enterprise Solutions

Share It:

IBM and Deepgram have announced a collaboration to integrate Deepgram’s speech-to-text and text-to-speech capabilities into IBM’s watsonx Orchestrate generative AI solution. This partnership positions Deepgram as IBM’s first voice partner for its generative AI offering.

The integration aims to meet client demands for high-performance, enterprise-grade transcription and real-time captioning. IBM will embed Deepgram’s technology directly into watsonx Orchestrate, enhancing its capabilities to automate operations and support the growing need for conversational AI.

Deepgram’s contribution includes advanced speech-to-text voice recognition, enabling users to interact with digital agents using natural speech. The system is designed to handle real-world audio conditions, such as background noise and diverse accents. It offers support for a wide range of languages and dialects, including various Arabic and Indian variants, alongside regional accents. Additional features include custom tuning options, real-time captioning, and natural-sounding speech output.

These technological advancements are expected to open new possibilities for automated customer care, call analysis, and voice-driven data entry across sectors like healthcare and finance.

Scott Stephenson, CEO and Co-Founder of Deepgram, commented on the development: “Voice is rapidly becoming the default interface between humans and technology, and enterprise deployments require a real-time platform that is accurate, low latency, and reliable at scale. By embedding Deepgram inside watsonx Orchestrate Agent Builder, IBM clients can build voice agents and voice-enabled workflows on top of a real-time foundation that has been developed and refined over more than a decade.”

Nick Holda, Vice President of AI Technology Partnerships at IBM, added: “Our watsonx Orchestrate integration powered by Deepgram APIs introduces new speech recognition and transcription capabilities to IBM clients, refining and modernizing their operations. This collaboration aims to help enterprise organizations accelerate their AI initiatives and reinforces IBM’s open ecosystem, bringing choice and cutting-edge voice technology to partners and customers.”

This collaboration underscores the increasing importance of voice interfaces in enterprise AI solutions. For IBM, it strengthens its portfolio of modern, flexible offerings. For Deepgram, it provides expanded access to new customers through a recognized enterprise partner, further solidifying its standing as a reliable, real-time voice platform built for large-scale enterprise use.

Deepgram provides a real-time API platform that supports the Voice AI economy, offering speech-to-text (STT), text-to-speech (TTS), and full speech-to-speech (STS) capabilities through its enterprise-grade runtime. The company reports over 200,000 developers utilize its voice-native foundational models, accessible via cloud APIs or as self-hosted/on-premises APIs. Deepgram highlights its accuracy, low latency, and pricing, having processed over 50,000 years of audio and transcribed over 1 trillion words.

IBM is a global provider of hybrid cloud and AI solutions and consulting expertise, serving clients in over 175 countries. The company focuses on helping clients leverage data insights, streamline business processes, and reduce costs. Its hybrid cloud platform and Red Hat OpenShift are utilized by governments and corporations in critical infrastructure areas, including financial services, telecommunications, and healthcare, for digital transformations. IBM’s innovations in AI, quantum computing, and industry-specific cloud solutions offer open and flexible options to its clients, supported by a commitment to trust, transparency, responsibility, inclusivity, and service.

Latest Posts