OpenAI Releases Whisper Transcription Action On Zapier

This is is part of my live-learning series! I will be updating this post as I continue through my journey. I apologize for any grammatical errors or incoherent thoughts. This is a practice to help me share things that are valuable without falling apart from the pressure of perfection. 

YouTube Video

Episode Summary

In this video, I walk you through the powerful new action added by OpenAI on Zapier: the Whisper Transcription Action.

Leveraging their Whisper API for highly accurate speech-to-text conversions, this action brings OpenAI’s cutting-edge technology for transcription right to your fingertips.

Not only does it allow you to create transcriptions from audio and video files up to 25 MB, but it also supports multiple languages and formats.

Watch as I explore the potential of this action to revolutionize your workflow, manage data more effectively, and integrate seamlessly with other tools like Google Docs and Trello. Don’t forget to like, comment, subscribe, and share your thoughts on how you plan to use this innovative addition to Zapier!

Timestamps:

0:00 Introduction
1:32 Context on Whisper API and its capabilities
3:08 Different voice models and benchmarks
4:36 Setting up the Whisper Transcription Action on Zapier
7:13 Testing action and output formats
8:54 Creating a Google Doc from transcription output
10:19 Importance of workflow enablement and potential disruption
11:45 Conclusion and invitation for feedback

Resources:

Zapier on Twitter: “Zapier’s @OpenAI integration just shipped a new action: Create Transcription

List of ISO 639-1 codes – Wikipedia
https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes

Introducing ChatGPT and Whisper APIs
https://openai.com/blog/introducing-chatgpt-and-whisper-apis

Pricing
https://openai.com/pricing

GitHub – openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
https://github.com/openai/whisper

Get transcription, research, data analysis and NLP software from Speak Ai

Turn unstructured language data into competitive insights with transcription and NLP

Embeddable Audio & Video Recorder – Speak Ai

Embeddable Audio & Video Recorder

Introducing Nova: World’s Most Powerful Speech-to-Text API – Deepgram Blog ⚡️
https://blog.deepgram.com/nova-speech-to-text-whisper-api/

Meet the World’s Most Powerful Speech-to-Text API: Deepgram Nova. – YouTube

AK on Twitter: “Whisper JAX This repository contains optimized JAX code for OpenAI’s Whisper Model, largely built on the 🤗 Hugging Face Transformers Whisper implementation.

GitHub – sanchit-gandhi/whisper-jax
https://github.com/sanchit-gandhi/whisper-jax

Whisper JAX – a Hugging Face Space by sanchit-gandhi
https://huggingface.co/spaces/sanchit-gandhi/whisper-jax

Yohei on Twitter: “I’m still proud of the fact that I introduced LLMs to NoCode w this Zapier integration – which I built without permission (this eventually became the official integration). This was pre-ChatGPT!” / Twitter

OpenAI And Zapier Release No Code Integration For GPT and DALL-E! – YouTube

Speak Ai Integrations | Connect Your Apps with Zapier
https://zapier.com/apps/speak-ai/integrations

OpenAI (GPT-3 & DALL·E) Integrations | Connect Your Apps with Zapier
https://zapier.com/apps/openai/integrations

Whisper API FAQ | OpenAI Help Center
https://help.openai.com/en/articles/7031512-whisper-api-faq

More To Explore

Podcast

Founder Wealth

Interested in Founder Wealth? Check out the latest video and resources from Tyler Bryden on Founder Wealth!

Read More »

Share This Post

Join My Personal Newsletter ❤

Get insights and resources into awareness, well-being, productivity, technology, psychedelics and more.

Don't want to chat but want to keep updated?

I'd love if you subscribed today. I promise I will only send you great, valuable content that has transformed me and helped others flourish. 

You have Successfully Subscribed!

Pin It on Pinterest

Shares