viral-video-generator

CLI tool for generating unique photo, video, and speech prompts using LLM (Gemini/OpenAI). With --gen flag, also generates actual media files (images via Gemini, videos via KlingAI). With --tts flag, generates speech audio files via Gemini TTS. With --enhance or --enhanceRun, post-processes generated videos with FFmpeg.

Video prompts follow TikTok uniqueness rules to pass content fingerprinting.

Requirements

Install these tools before using the CLI:

bun.js — required to install dependencies, run the CLI, run tests, and build releases
ffmpeg — required for video enhancement features such as --enhance, --enhanceRun, and scripts/enhance-video.sh

Quick verification:

bun --version
ffmpeg -version

Install

bun install

Usage

Generate photo prompts

viral --photo --prompt "cozy coffee shop aesthetic" -n 5

Generate video prompts (TikTok unique)

viral --video --prompt "productivity tips for developers" -n 3

Use Ollama for local LLM inference

# Make sure Ollama is running locally
viral --photo --prompt "nature landscapes" -n 5 --provider ollama --model llama3.2

Generate prompts AND media files

# Generate photo prompts and images
viral --photo --prompt "sunset beach" -n 3 --gen

# Generate video prompts and videos
viral --video --prompt "cooking tutorial" -n 2 --gen

# Generate video prompts, videos, and enhanced MP4 outputs via FFmpeg
viral --video --prompt "cooking tutorial" -n 2 --gen --enhance --enhancePreset vertical

Enhance all generated videos for an existing run

viral --enhanceRun run-2026-05-09T12-30-00-000Z-video --enhancePreset social

Generate text-to-speech audio (Gemini only)

viral --tts --prompt "Welcome to our channel! Today we explore productivity." -n 1
viral --tts --prompt "Top 5 tips to stay focused." -n 3 --voice Puck

Options

Option	Description
`--photo`	Generate photo prompts
`--video`	Generate video prompts with TikTok uniqueness rules
`--tts`	Generate text-to-speech audio files (Gemini only)
`--gen`	Also generate actual media (images via Ollama/Gemini, videos via KlingAI or Gemini Veo)
`--enhance`	After `--video --gen`, enhance each generated video with FFmpeg
`--enhanceRun`	Enhance every `video-*.mp4` file inside `./output/<runId>`
`--enhancePreset`	FFmpeg enhancement preset: `social`, `vertical`, `clean`, `stabilize`
`--enhanceCrf`	FFmpeg CRF value for enhanced videos (default: `18`)
`--enhanceSpeed`	FFmpeg encoder speed preset (default: `medium`)
`--enhanceOverwrite`	Overwrite existing `*-enhanced.mp4` outputs
`--prompt`, `-p`	Base topic/idea for generation (or exact text for `--tts`)
`-n`	Number of prompts / audio files to generate (default: 5)
`--provider`	LLM provider: `gemini`, `openai`, or `ollama` (default: gemini)
`--model`	Model name for prompts (default: gemini-2.5-flash)
`--voice`	Voice for TTS: `Aoede`, `Charon`, `Fenrir`, `Kore`, `Leda`, `Orus`, `Puck`, `Zephyr` (default: Kore)
`--outDir`	Output directory for run logs (default: ./runs)
`--help`, `-h`	Show help
`--version`, `-v`	Show version

Environment Variables

Variable	Description
`GEMINI_API_KEY`	API key for Google Gemini (prompts, images, TTS, and Veo video)
`GEMINI_API_SECRET`	API secret for Google Gemini (if required)
`GEMINI_MODEL`	Model for LLM prompt generation (default: `gemini-3.1-pro-preview`)
`OPENAI_API_KEY`	API key for OpenAI (prompts only)
`OLLAMA_BASE_URL`	Base URL for Ollama server (default: http://localhost:11434)
`STABILITY_API_KEY`	API key for Stability AI (required for Ollama + image generation)
`STABILITY_API_URL`	Stability AI endpoint or local SD server (optional)
`NANO_BANANA_MODEL`	Model for image generation (default: `gemini-3.1-flash-image-preview`)
`VEO_MODEL`	Model for Gemini Veo video generation (default: `veo-3.1-generate-preview`)
`GEMINI_TTS_MODEL`	Model for TTS audio generation (default: `gemini-3.1-flash-tts-preview`)
`KLINGAI_API_KEY`	API key for KlingAI (video generation)
`KLINGAI_API_SECRET`	API secret for KlingAI
`KLINGAI_VIDEO_MODEL`	Model for KlingAI video (default: `kling-video-o1`)

Output

Prompts only: JSON array of prompts to stdout
With --gen: JSON with prompts and media arrays
With --enhance: JSON also includes enhancedMedia and optional enhancementFailures
With --tts: WAV files (PCM 16-bit, 24 kHz, mono) saved to ./output/<runId>/speech-N.wav
Run logs: Saved to ./runs/ directory with format run-{timestamp}-{photo|video|tts}.json
Media files: Saved to ./output/run-{timestamp}-{type}/ directory

Run (dev)

bun run cli.ts --photo --prompt "your topic" -n 5

Run tests

bun test

Build releases

bun run build:all

Documentation

Core docs live in docs/gemini/
FFmpeg post-processing guide: research/ffmpeg-video-enhancement-guide.md
UX/UI dashboard page: ui/index.html

Enhance generated videos with FFmpeg

bun run video:enhance -- --input ./output/run-example/video-1.mp4 --preset social

Enhance generated videos from the CLI

bun run cli.ts --video --prompt "cooking tutorial" -n 2 --gen --enhance --enhancePreset social
bun run cli.ts --enhanceRun run-2026-05-09T12-30-00-000Z-video --enhancePreset vertical

Available presets in scripts/enhance-video.sh:

social
vertical
clean
stabilize

Run the UI page locally

cd /home/daochild/dev/viral-content-generation-cli/ui
python3 -m http.server 8080

Open http://localhost:8080/index.html.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github		.github
docs/gemini		docs/gemini
research		research
scripts		scripts
src		src
tests		tests
ui		ui
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
build.ts		build.ts
bun.lock		bun.lock
bunfig.toml		bunfig.toml
cli.ts		cli.ts
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

viral-video-generator

Requirements

Install

Usage

Generate photo prompts

Generate video prompts (TikTok unique)

Use Ollama for local LLM inference

Generate prompts AND media files

Enhance all generated videos for an existing run

Generate text-to-speech audio (Gemini only)

Options

Environment Variables

Output

Run (dev)

Run tests

Build releases

Documentation

Enhance generated videos with FFmpeg

Enhance generated videos from the CLI

Run the UI page locally

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

viral-video-generator

Requirements

Install

Usage

Generate photo prompts

Generate video prompts (TikTok unique)

Use Ollama for local LLM inference

Generate prompts AND media files

Enhance all generated videos for an existing run

Generate text-to-speech audio (Gemini only)

Options

Environment Variables

Output

Run (dev)

Run tests

Build releases

Documentation

Enhance generated videos with FFmpeg

Enhance generated videos from the CLI

Run the UI page locally

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages