OpenAI Releases Whisper Transcription Action On Zapier

This is is part of my live-learning series! I will be updating this post as I continue through my journey. I apologize for any grammatical errors or incoherent thoughts. This is a practice to help me share things that are valuable without falling apart from the pressure of perfection. 

YouTube Video

Episode Summary

In this video, I walk you through the powerful new action added by OpenAI on Zapier: the Whisper Transcription Action.

Leveraging their Whisper API for highly accurate speech-to-text conversions, this action brings OpenAI’s cutting-edge technology for transcription right to your fingertips.

Not only does it allow you to create transcriptions from audio and video files up to 25 MB, but it also supports multiple languages and formats.

Watch as I explore the potential of this action to revolutionize your workflow, manage data more effectively, and integrate seamlessly with other tools like Google Docs and Trello. Don’t forget to like, comment, subscribe, and share your thoughts on how you plan to use this innovative addition to Zapier!

Timestamps:

0:00 Introduction
1:32 Context on Whisper API and its capabilities
3:08 Different voice models and benchmarks
4:36 Setting up the Whisper Transcription Action on Zapier
7:13 Testing action and output formats
8:54 Creating a Google Doc from transcription output
10:19 Importance of workflow enablement and potential disruption
11:45 Conclusion and invitation for feedback

Resources:

Zapier on Twitter: “Zapier’s @OpenAI integration just shipped a new action: Create Transcription
https://twitter.com/zapier/status/1649454241990180866?s=19

List of ISO 639-1 codes – Wikipedia
https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes

Introducing ChatGPT and Whisper APIs
https://openai.com/blog/introducing-chatgpt-and-whisper-apis

Pricing
https://openai.com/pricing

GitHub – openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
https://github.com/openai/whisper

Get transcription, research, data analysis and NLP software from Speak Ai
https://speakai.co/

Embeddable Audio & Video Recorder – Speak Ai
https://speakai.co/embeddable-audio-video-recorder/

Introducing Nova: World’s Most Powerful Speech-to-Text API – Deepgram Blog ⚡️
https://blog.deepgram.com/nova-speech-to-text-whisper-api/

Meet the World’s Most Powerful Speech-to-Text API: Deepgram Nova. – YouTube

AK on Twitter: “Whisper JAX This repository contains optimized JAX code for OpenAI’s Whisper Model, largely built on the 🤗 Hugging Face Transformers Whisper implementation.
https://twitter.com/_akhaliq/status/1649807089861050369

GitHub – sanchit-gandhi/whisper-jax
https://github.com/sanchit-gandhi/whisper-jax

Whisper JAX – a Hugging Face Space by sanchit-gandhi
https://huggingface.co/spaces/sanchit-gandhi/whisper-jax

Yohei on Twitter: “I’m still proud of the fact that I introduced LLMs to NoCode w this Zapier integration – which I built without permission (this eventually became the official integration). This was pre-ChatGPT!” / Twitter
https://twitter.com/yoheinakajima/status/1649568148084109313

OpenAI And Zapier Release No Code Integration For GPT and DALL-E! – YouTube

Speak Ai Integrations | Connect Your Apps with Zapier
https://zapier.com/apps/speak-ai/integrations

OpenAI (GPT-3 & DALL·E) Integrations | Connect Your Apps with Zapier
https://zapier.com/apps/openai/integrations

Whisper API FAQ | OpenAI Help Center
https://help.openai.com/en/articles/7031512-whisper-api-faq

More To Explore

How To Succeed In Capitalism

I share some thoughts on working within the capitalistic system successfully while maintaining your sanity and some meaning in life.

Share This Post

Join My Personal Newsletter ❤

Get insights and resources into awareness, well-being, productivity, technology, psychedelics and more.