AI TRACK: Running OpenAI's Whisper Automatic Speech Recognition on a Live Video Transcoding Server

Streaming Summit

Originally Aired - Monday, April 15 | 10:15 AM - 10:45 AM PT

W110

Pass Required: Streaming Summit Pass

Don't have this pass? Register Now!

Create or Log in to myNAB Show to see Videos and Resources.

Videos Resources

Videos

Resources

Click to Join the Zoom Meeting

Log in to your myNAB Show to join the zoom meeting!

{{resource.title}}

Resources

{{resource.title}}

This Session Has Not Started Yet

Be sure to come back after the session starts to have access to session resources.

As video streaming workflows increasingly move from file-based to live, functions that previously could be done offline can pose technical challenges or be exceedingly expensive to implement. One such function is the requirement for subtitle creation and transcription for live video streams. In this demonstration, you will see how ten NETINT T1U VPUs installed in a 1RU server with a 96-core Ampere Altra Max processor running OpenAI’s Whisper ASR model can produce dozens of simultaneous ABR ladder transcodes using AV1, HEVC, or H.264.

Keywords: Streaming Media, Streaming Summit, Streaming Video, Dan Rayburn, OTT

Speakers

Alex Liu

Co-founder and COO

NETINT Technologies

Sean Varley

Chief Evangelist, VP, Business Development

Ampere

2024 NAB Show

AI TRACK: Running OpenAI's Whisper Automatic Speech Recognition on a Live Video Transcoding Server

Videos

Resources

{{video.title}}

Resources

This Session Has Not Started Yet

Speakers