This ComfyUI workflow generates short videos—up to 15 seconds—using the Grok model, with an automatically synchronized audio track. At its core is the GrokVideoNode, which accepts either a pure text prompt (text-to-video) or a starting frame from LoadImage (image-to-video). The node handles inference and returns a ready-to-save video clip that pairs the visuals with audio produced by the model. SaveVideo then writes the result to disk as a standard video file, preserving the embedded audio when present.

Technically, the workflow is minimal and direct: LoadImage (optional) feeds an initial frame into GrokVideoNode, which synthesizes the motion and soundtrack from your prompt and/or reference image, and SaveVideo commits the output to a file. Keeping duration capped at 15 seconds ensures responsive generation and stays within the model’s capabilities. The result is a practical pipeline for rapid concepting, animating stills, or creating short social-ready clips without leaving ComfyUI.

Frequently Asked Questions

For text-to-video, enter your prompt in GrokVideoNode and leave LoadImage disconnected. For image-to-video, load a still in LoadImage and connect it to GrokVideoNode so the model uses the image as the starting frame.

This workflow is designed for clips up to 15 seconds, matching the Grok model’s intended range. Keep your duration at or under 15 seconds for reliable results.

GrokVideoNode produces a video with synchronized audio, and SaveVideo preserves it when saving. If you need custom audio, export the video and replace the soundtrack in your video editor. The provided workflow does not include an audio import/replace node.

If GrokVideoNode exposes a seed or randomness control, set and reuse the same value across runs. Also keep prompts, duration, and the starting image (for image-to-video) unchanged to maximize repeatability.

View all workflows
Seedance 2.0: Reference to Video

Seedance 2.0: Reference to Video

ByteDance
ComfyUIComfyUI
Partner Nodes
Image to Video
Video
Z-Image-Turbo Text to Image

Z-Image-Turbo Text to Image

ComfyUIComfyUI
Image
Text to Image
Grok: Image Edit - After
Grok: Image Edit - Before

Grok: Image Edit

Grok
ComfyUIComfyUI
Partner Nodes
Image Edit
Grok Imagine Image Quality: Generation

Grok Imagine Image Quality: Generation

ComfyUIComfyUI
Text to Image
Partner Nodes

LTX 2.3 - Lipdub LoRA + Voice Clone

RobRob
Audio Editing
Character
Video Edit
Video to Video
Voice Cloning
Text to Audio

1 image input Split Stack - Qwen Multiangle + Wan 2.2

RobRob
Video

SCAIL-2: Character Replacement

ComfyUIComfyUI
Video Edit
Ideogram v4: Text to Image

Ideogram v4: Text to Image

ComfyUIComfyUI
Text to Image

Googly Eyes

PurzPurz
Video to Video
Video
Video Edit
LoRA

Seedance 2.0 - Viral Videos Character Swap

RobRob
Character Reference
Image to Video
Video to Video
Video Generation
Text Generation

Seedance 2.0 Reference to Video - Concept Art + Stop Motion Style

MintaMinta
Video
Nano Banana 2: Image Edit

Nano Banana 2: Image Edit

Google
ComfyUIComfyUI
Partner Nodes
Image Edit
cinematic_annotate_video

cinematic_annotate_video

SirolimSirolim
Video
Video Generation
Image to Video

Beeble SwitchX: Video Edit

ComfyUIComfyUI
Image Edit
3x3 Contact Sheet

3x3 Contact Sheet

enigmatic_eenigmatic_e
Image

Restore Archival Footage - LTX 2.3 Dearchive LoRA

RobRob
Video Edit
Video
Video Generation
Video to Video

Remove Object from Video - LTX 2.3 Obscura Remova LoRA

RobRob
Video Edit
Video
Video to Video

Stylize Video - Frame by Frame - Flux.2 Klein 4b

RobRob
Video Edit
Video to Video
Image Edit
Seedream 5.0 Lite: Image Edit - After
Seedream 5.0 Lite: Image Edit - Before

Seedream 5.0 Lite: Image Edit

ByteDance
ComfyUIComfyUI
Partner Nodes
Image Edit
Utility Video Upscale - After
Utility Video Upscale - Before

Utility Video Upscale

SirolimSirolim
Video Upscale
Video Edit
Video
Video to Video
Video Generation
Visual Effects
Cinematic

1 image input Split Stack - Nano Banana 2 + Kling 3.0

RobRob
Partner Nodes
Image to Video
Multiple Angles
Stable Audio 3.0 Medium Base

Stable Audio 3.0 Medium Base

ComfyUIComfyUI
Audio
SYSTMS ACTION: QWEN IMAGE EDIT 2511 - After
SYSTMS ACTION: QWEN IMAGE EDIT 2511 - Before

SYSTMS ACTION: QWEN IMAGE EDIT 2511

Image Edit
Image Enhancement
Ideogram v4: Text to Image (API)

Ideogram v4: Text to Image (API)

ComfyUIComfyUI
Text to Image
Krea 2 Moodboards

Krea 2 Moodboards

JMSJMS
Partner Nodes
Image
Text to Image
Grok Imagine Image Quality: Edit - After
Grok Imagine Image Quality: Edit - Before

Grok Imagine Image Quality: Edit

ComfyUIComfyUI
Partner Nodes
Image Edit

Video Outpainting

PurzPurz
Video
Video Generation
Video to Video
Outpainting

VFX - Bullet Time Effect

SirolimSirolim
Visual Effects
Image to Video
Video

Seedance 2.0 - Extend Video

RobRob
Video
Video Extension
Video to Video

Seedance 2.0 + LLM Prompt Helper

enigmatic_eenigmatic_e
Image
Showing 30 of 566 templates