Skip to content

Web UI Guide

Detailed introduction to the Pixelle-Video Web interface features.


Interface Layout

The Web interface uses a three-column layout:

  • Left Panel: Content input and audio settings
  • Middle Panel: Voice and visual settings
  • Right Panel: Video generation and preview

System Configuration

First-time use requires configuring LLM and image generation services. See Configuration Guide.


Content Input

Generation Mode

  • AI Generate Content: Enter a topic, AI creates script automatically
  • Fixed Script Content: Enter complete script directly

Background Music

  • Built-in music supported
  • Custom music files supported

Voice Settings

TTS Workflow

  • Select TTS workflow
  • Supports Edge-TTS, Index-TTS, etc.

Reference Audio

  • Upload reference audio for voice cloning
  • Supports MP3/WAV/FLAC formats

Visual Settings

Image Generation

  • Select image generation workflow
  • Set image dimensions
  • Adjust prompt prefix to control style

Video Template

  • Choose video template
  • Supports portrait/landscape/square
  • Preview template effects

Generate Video

After clicking "Generate Video", the system will:

  1. Generate video script
  2. Generate images for each scene
  3. Synthesize voice narration
  4. Compose final video

Automatically previews when complete.