Blog - Company
Sep 13, 2024 | Read time 2 min

Transforming live broadcasts: Speechmatics' real-time ASR now on NVIDIA Holoscan for media

Speechmatics Team Meet the Team

Speechmatics is thrilled to announce that it is the first speech-to-text provider integrated into the NVIDIA Holoscan for Media platform.

Holoscan for Media is a software-defined platform that enables live video pipelines to run on the same infrastructure as AI with unprecedented flexibility and efficiency, leveraging an IP-based, cloud-native architecture.

The integration of Speechmatics’ industry-leading automatic speech recognition (ASR) capabilities, known for delivering unparalleled accuracy at low latency, with Holoscan for Media will set the stage for transformational change in how live captions are generated and delivered.

Real-time transcription is critical for live media, particularly in high-stakes environments like live sports programming, streaming services and network broadcasts. The ability to deliver accurate, low-latency captions enhances the viewing experience, helping ensure that all audiences, including those with hearing impairments, can engage with the content as it happens. Speechmatics’ integration into Holoscan for Media elevates this experience by providing high levels of accuracy at the lowest latency, essential for maintaining the quality and immediacy that live media demands.

Early adopters of Holoscan for Media include companies like ASG, Beamr, Comprimato, Lawo, Monks, Pebble, RED Digital Cinema, Sony Corporation, and Telestream. By incorporating Speechmatics' advanced ASR capabilities, these companies can now offer their audiences lightning-fast, highly accurate captions, significantly enhancing accessibility for their live media.

David Agmen-Smith, Director of Product at Speechmatics, shared his enthusiasm about the integration: "Speechmatics is delighted to extend our collaboration with NVIDIA and become the first speech-to-text provider on the software-defined NVIDIA Holoscan for Media platform. Our years of foundational research into speech AI have allowed us to lead the automatic speech recognition field in terms of accuracy, even at very low latency. This in turn has led to our wide industry recognition as a pre-eminent choice for automated real-time captions in live broadcast. The combination of Speechmatics with Holoscan for Media allows lightning-quick and highly accurate captions to be broadcast, enhancing viewer experiences." 

“With Holoscan for Media, NVIDIA is transforming live media production by enabling live AI processing to run on the same platform as other traditional broadcast applications. Speechmatics’ integration is the first exciting example, providing customers with advanced speech to text with very low latency in their live production.” said Guillaume Polaillon, product line manager, live media solutions at NVIDIA.

Furthermore, Speechmatics' support for over 50 languages helps ensure that content on Holoscan for Media is accessible to a global audience. This commitment to inclusivity not only broadens the reach of live broadcasts but also enhances the viewer experience across diverse demographics, making content more engaging and comprehensible worldwide.

This integration exemplifies Speechmatics’ commitment to innovation and its mission to "Understand Every Voice", empowering industries to leverage state-of-the-art AI technology in the latest evolution of real-time media applications.

"Speechmatics’ integration is the first exciting example, providing customers with advanced speech to text with very low latency in their live production."

Guillaume PolaillonProduct Line Manager, NVIDIA

Unlock the value of speech

Everything you need to deliver incredible voice-powered products and features, globally.