Skip to content

yuvraj108c/ComfyUI-Whisper

Repository files navigation

ComfyUI Whisper

Transcribe audio and add subtitles to videos using Whisper in ComfyUI. Support multiple languages, prompt guidance and multiple whisper models.

demo-image

⭐ Support

If you like my projects and wish to see updates and new features, please consider supporting me. It helps a lot!

ComfyUI-Depth-Anything-Tensorrt ComfyUI-Upscaler-Tensorrt ComfyUI-Dwpose-Tensorrt ComfyUI-Rife-Tensorrt

ComfyUI-Whisper ComfyUI_InvSR ComfyUI-Thera ComfyUI-Video-Depth-Anything ComfyUI-PiperTTS

buy-me-coffees paypal-donation

Installation

Install via ComfyUI Manager

Usage

Load this workflow into ComfyUI

Models are auto-downloaded to /ComfyUI/models/stt/whisper

Supported Models

'tiny.en', 'tiny', 'base.en', 'base', 'small.en', 'small', 'medium.en', 'medium', 'large-v1', 'large-v2', 'large-v3', 'large', 'large-v3-turbo', 'turbo'

Nodes

Apply Whisper

Transcribe audio and get timestamps for each segment and word.

Add Subtitles To Frames

Add subtitles on the video frames. You can specify font family, font color and x/y positions.

Add Subtitles To Background (Experimental)

Add subtitles like wordcloud on blank frames

Updates

27 August 2025

  • Merge #22 by @francislabountyjr for model patcher, more whisper models support, comfyui model directory support
  • Merge #18 by @qy8502 for Prompt Guidance support
  • Support YRDZST Semibold Font

2 May 2025

Credits

License

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)

About

Transcribe audio and add subtitles to videos using Whisper in ComfyUI

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages