YouTube Video
Episode Summary
In this video, I walk you through the powerful new action added by OpenAI on Zapier: the Whisper Transcription Action.
Leveraging their Whisper API for highly accurate speech-to-text conversions, this action brings OpenAI’s cutting-edge technology for transcription right to your fingertips.
Not only does it allow you to create transcriptions from audio and video files up to 25 MB, but it also supports multiple languages and formats.
Watch as I explore the potential of this action to revolutionize your workflow, manage data more effectively, and integrate seamlessly with other tools like Google Docs and Trello. Don’t forget to like, comment, subscribe, and share your thoughts on how you plan to use this innovative addition to Zapier!
Timestamps:
0:00 Introduction
1:32 Context on Whisper API and its capabilities
3:08 Different voice models and benchmarks
4:36 Setting up the Whisper Transcription Action on Zapier
7:13 Testing action and output formats
8:54 Creating a Google Doc from transcription output
10:19 Importance of workflow enablement and potential disruption
11:45 Conclusion and invitation for feedback
Resources:
Zapier on Twitter: “Zapier’s @OpenAI integration just shipped a new action: Create Transcription
https://twitter.com/zapier/status/1649454241990180866?s=19
List of ISO 639-1 codes – Wikipedia
https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes
Introducing ChatGPT and Whisper APIs
https://openai.com/blog/introducing-chatgpt-and-whisper-apis
Pricing
https://openai.com/pricing
GitHub – openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
https://github.com/openai/whisper
Get transcription, research, data analysis and NLP software from Speak Ai
https://speakai.co/
Embeddable Audio & Video Recorder – Speak Ai
https://speakai.co/embeddable-audio-video-recorder/
Introducing Nova: World’s Most Powerful Speech-to-Text API – Deepgram Blog ⚡️
https://blog.deepgram.com/nova-speech-to-text-whisper-api/
Meet the World’s Most Powerful Speech-to-Text API: Deepgram Nova. – YouTube
AK on Twitter: “Whisper JAX This repository contains optimized JAX code for OpenAI’s Whisper Model, largely built on the 🤗 Hugging Face Transformers Whisper implementation.
https://twitter.com/_akhaliq/status/1649807089861050369
GitHub – sanchit-gandhi/whisper-jax
https://github.com/sanchit-gandhi/whisper-jax
Whisper JAX – a Hugging Face Space by sanchit-gandhi
https://huggingface.co/spaces/sanchit-gandhi/whisper-jax
Yohei on Twitter: “I’m still proud of the fact that I introduced LLMs to NoCode w this Zapier integration – which I built without permission (this eventually became the official integration). This was pre-ChatGPT!” / Twitter
https://twitter.com/yoheinakajima/status/1649568148084109313
OpenAI And Zapier Release No Code Integration For GPT and DALL-E! – YouTube
Speak Ai Integrations | Connect Your Apps with Zapier
https://zapier.com/apps/speak-ai/integrations
OpenAI (GPT-3 & DALL·E) Integrations | Connect Your Apps with Zapier
https://zapier.com/apps/openai/integrations
Whisper API FAQ | OpenAI Help Center
https://help.openai.com/en/articles/7031512-whisper-api-faq



