VFXB-Studio-Plan

Plan: VFXB Full-Stack AI Video Platform Build

Context

Existing frontend: Complete Vite+React+TS frontend at e:\VFXB\VFXB - Studio with 50+ components, full design system, all UI screens — NO backend wired up yet
User decisions: Keep Vite/React frontend | New separate monorepo e:\vfxb\ | FastAPI heavy + simple BFF | MVP = NLP Edit Engine

TL;DR

Build the full backend infrastructure (FastAPI + Worker + DB + Storage + AI) in a new Turborepo monorepo at e:\vfxb\. Wire the existing Vite/React frontend to it. MVP priority is the NLP edit engine — user types English, AI cuts the video.

Architecture Decision Record

Frontend: Existing Vite+React stays at e:\VFXB\VFXB - Studio → becomes apps/web in new monorepo (symlink or copy)
API Layer: FastAPI Python handles all endpoints. "Both" interpreted as: simple endpoints in FastAPI routers (auth, CRUD) + heavy async processing in Redis worker. No Next.js app needed.
Auth: Supabase Auth (JWT) — frontend uses @supabase/supabase-js SDK, backend validates tokens
State: Zustand added to existing Vite frontend (replace prop-drilling + custom events pattern)

Phase 0 — Monorepo Bootstrap (Day 1)

Steps

Run npx create-turbo@latest vfxb --package-manager pnpm at e:\
Move/copy existing VFXB - Studio into vfxb/apps/web (keep Vite config intact)
Create apps/api/ (FastAPI) and apps/worker/ (RQ processor) directories
Create packages/types/ for shared TS types consumed by frontend
Create root .env.example with all required keys (Supabase, OpenAI, Anthropic, Google AI, AssemblyAI, Cloudflare R2, Redis, Stripe)
Create docker-compose.yml for local dev: web:5173, api:8000, worker, redis:6379
Create turbo.json with build/dev/test/lint pipelines

Files

e:\vfxb\turbo.json
e:\vfxb\pnpm-workspace.yaml
e:\vfxb\.env.example
e:\vfxb\docker-compose.yml
e:\vfxb\apps/web/ (existing Studio codebase)
e:\vfxb\apps/api/main.py (FastAPI entry)
e:\vfxb\apps/worker/worker.py (RQ entry)

Phase 1 — Database & Auth (Days 2-4)

Steps

Create Supabase project → enable Email + Google + GitHub OAuth
Run SQL schema in Supabase editor: tables users, videos, edits, chat_messages, exports, subscriptions
Enable RLS on all tables with per-user policies
Create apps/api/db/schema.sql with full DDL
Create apps/api/db/client.py: Supabase Python client with service key
Implement FastAPI auth middleware: apps/api/middleware/auth.py — validates Supabase JWT, extracts user_id, injects into request state
Create FastAPI routers: apps/api/routers/users.py (GET /me, PATCH /me)
Wire existing AuthScreen.tsx to real Supabase Auth: install @supabase/supabase-js, replace mock handleAuthenticate with Supabase signInWithPassword + signInWithOAuth
Create apps/web/src/lib/supabase.ts: singleton Supabase client
Create apps/web/src/store/auth.ts: Zustand auth store with user, session, signIn, signOut
Protect routes in App.tsx using real session state

Key code patterns

FastAPI dependency: async def get_current_user(token: str = Depends(oauth2_scheme)) → decode Supabase JWT
Frontend: supabase.auth.onAuthStateChange() → update Zustand store
RLS policy pattern: CREATE POLICY "owner" ON videos FOR ALL USING (auth.uid() = user_id)

Files

apps/api/db/schema.sql
apps/api/db/client.py
apps/api/middleware/auth.py
apps/api/routers/users.py
apps/web/src/lib/supabase.ts
apps/web/src/store/auth.ts (new Zustand store)

Phase 2 — File Storage & Upload (Days 5-6)

Steps

Create Cloudflare R2 buckets: vfxb-videos + vfxb-exports
Configure R2 CORS for browser uploads, set lifecycle rules
Implement apps/api/routers/upload.py:
- POST /api/upload/presigned: validates user quota, creates video DB record (status=uploading), returns S3-presigned URL + video_id
- POST /api/upload/complete: sets status=processing, enqueues analyze_video job
- POST /api/upload/from-url: yt-dlp download → R2 upload → video record
Implement quota check: free=3 videos, pro=unlimited (read from users.plan)
Wire VFXBUploadScreen.tsx to real upload:
- Replace mock progress with real TUS upload (tus-js-client)
- On complete: call /api/upload/complete, navigate to studio view with real video_id
Create apps/web/src/store/video.ts: Zustand video store (currentVideo, uploadProgress, isProcessing)

Files

apps/api/routers/upload.py
apps/api/services/storage.py (R2 client wrapper)
apps/web/src/store/video.ts
apps/web/src/hooks/useVideoUpload.ts

Phase 3 — Video Analysis Engine (Days 7-10) [AI Core]

Steps (all in `apps/worker/`)

Set up Redis Queue: apps/worker/queue.py — job types: analyze_video, apply_edit, export_video
Step A — Transcription (steps/transcribe.py):
- Submit audio to AssemblyAI with word timestamps + speaker diarization + filler detection
- Poll until complete, store in videos.transcript + videos.analysis.words
Step B — Visual Analysis (steps/visual_analysis.py):
- ffmpeg -vf fps=1 to extract frames
- Sample every 5th frame → GPT-4o Vision → {has_face, motion_level, engagement_potential...}
- Store in videos.analysis.visual_data
Step C — Audio Analysis (steps/audio_analysis.py):
- librosa: detect silences >1.5s, dead zones, SNR, BPM
- Store {silences: [...], avg_energy, background_noise_db} in videos.analysis
Step D — Virality Scoring (steps/virality_score.py):
- Send all analysis to Claude 3.5 Sonnet with structured system prompt
- Returns {overall_score, grade, factors, top_issues, predicted_retention_curve}
- Store in videos.score_breakdown, videos.virality_score, set status=analyzed
Wire Dashboard to real data: GET /api/videos?user_id=me → replace mock stats with DB query
Wire VideoIntelligenceCard + AIDirectorPanel to real analysis data from video store

Files

apps/worker/queue.py
apps/worker/steps/transcribe.py
apps/worker/steps/visual_analysis.py
apps/worker/steps/audio_analysis.py
apps/worker/steps/virality_score.py
apps/api/routers/videos.py (CRUD + analysis results)

Phase 4 — NLP Edit Engine [MVP Priority] (Days 11-15)

Steps

NLP Parser (apps/api/services/nlp_parser.py):
- Takes user message + full video context (transcript, silences, analysis, score)
- Sends to Claude 3.5 Sonnet with VFXB system prompt
- Returns structured {understood_intent, edits[], response_message, needs_confirmation}
FFmpeg Executor (apps/worker/services/ffmpeg_executor.py):
- Implement: cut_silence, trim_range, change_speed, add_captions, platform_export, color_grade, extract_best_clip
- Each func: download from R2 → run FFmpeg → upload output to R2 → return URL
- Safety: validate all paths, sanitize filenames, enforce max duration
Chat API (apps/api/routers/chat.py):
- POST /api/chat/message: load video context → load last 10 messages → route to correct AI agent → parse edit intent → stream SSE response
- SSE events: {type: "text", content}, {type: "edit_plan", plan}, {type: "done"}
- POST /api/chat/confirm-edit: if confirmed → queue apply_edit job
- GET /api/chat/history/:video_id: last 50 messages
Edit Status WebSocket (apps/api/routers/websocket.py):
- WS /ws/edit-status/:job_id → polls Redis job status → pushes {status, progress, output_url}
Wire Frontend (existing BeautifulAIChat.tsx / ChatInput.tsx):
- Add apps/web/src/hooks/useChatStream.ts: EventSource wrapper for SSE
- Add apps/web/src/store/chat.ts: Zustand chat store (messages, isStreaming, pendingEdit)
- Replace mock responses in BeautifulAIChat with real SSE stream
- Show edit_plan cards with Confirm/Cancel in ChatThread
- Add progress bar message type during edit job
- Connect WebSocket for real-time edit progress

Key Security Points

Validate video ownership before any edit (user must own video)
Sanitize all FFmpeg path arguments (no shell injection)
Rate limit chat endpoint: 20 req/min per user (Redis sliding window)

Files

apps/api/services/nlp_parser.py
apps/api/routers/chat.py
apps/api/routers/websocket.py
apps/worker/services/ffmpeg_executor.py
apps/web/src/store/chat.ts
apps/web/src/hooks/useChatStream.ts
apps/web/src/hooks/useEditStatus.ts

Phase 5 — Frontend Wiring (Days 16-18)

Steps

Install Zustand in apps/web: replace prop-drilling and window.dispatchEvent patterns
Create apps/web/src/store/ui.ts (activeFeaturePanel, comparisonMode, beforeUrl, afterUrl)
Wire VideoPreview.tsx to real R2 URLs from video store
Wire Dashboard.tsx to real API data (stats, recent videos)
Wire ExportShareModal.tsx to real export API + platform OAuth
Wire EditHistoryPage.tsx to GET /api/edits?video_id=...
Add skeleton loading states (already use shadcn/ui Skeleton)
Add VITE_API_URL env var to Vite config, update all API calls

Files

apps/web/src/store/ui.ts
apps/web/vite.config.ts (proxy to backend in dev)
apps/web/.env.example

Phase 6 — Payments (Days 19-20)

Steps

Create Stripe products: Free ($0), Pro ($29/mo or $19/mo yearly)
apps/api/routers/stripe.py:
- POST /api/stripe/create-checkout: creates Stripe Checkout session
- POST /api/stripe/webhook: handles checkout.session.completed, subscription.deleted, payment_failed
- GET /api/stripe/portal: opens Customer Portal
Enforce plan limits in FastAPI dependency middleware
Wire UpgradePage.tsx to real Stripe checkout

Files

apps/api/routers/stripe.py
apps/api/middleware/plan_check.py

Phase 7 — Power Features (Days 21-28)

Autonomous Publishing Agent (apps/api/services/autonomous_agent.py): multi-step job chain (analyze → fix → caption → export → YouTube API upload)
Creator DNA (apps/api/services/creator_dna.py): extract style patterns after 3+ videos, inject into all future Claude prompts
Audience Simulation (apps/api/services/simulation.py): Claude retention curve prediction, SVG chart in frontend
Platform Optimizer (apps/worker/services/platform_optimizer.py): per-platform resize, pacing, captions
Real-time Collaboration (apps/api/routers/collab.py): Supabase Realtime, invite system, presence indicators
Enterprise API (apps/api/routers/enterprise.py): API key auth, rate limits per tier

Phase 8 — Deploy (Days 29-30)

apps/api/Dockerfile (python:3.12-slim + ffmpeg)
apps/worker/Dockerfile (python:3.12-slim + ffmpeg + yt-dlp)
Deploy frontend to Vercel (or keep Vite → deploy to Cloudflare Pages)
Deploy API + Worker to Railway
GitHub Actions CI/CD: test → build → deploy

Verification Steps

Upload a real MP4 → verifying R2 storage + DB record created
Analysis pipeline completes → virality score appears in UI with real data
Type "remove silences" in chat → edit plan shown → confirm → FFmpeg job runs → new video appears with before/after comparison
Auth flow: sign up → email verification → dashboard loads user's videos
Stripe checkout: upgrade to Pro → plan updates in DB → video limit removed
pytest apps/api/tests/ (FastAPI unit tests)
Rate limit test: 21 chat messages/min returns 429

Decisions

No Next.js needed for frontend (user keeps Vite). "Both" back-end choice = simple FastAPI routers + async workers
New monorepo at e:\vfxb\ — existing VFXB Studio files move to apps/web
MVP = NLP edit engine (Phase 4) — Phases 0-3 are blockers and must run sequentially first
FFmpeg runs only in worker container, never in the main API process
All AI calls use streaming where possible (SSE for chat responses)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

VFXB-Studio-Plan

Plan: VFXB Full-Stack AI Video Platform Build

Context

TL;DR

Architecture Decision Record

Phase 0 — Monorepo Bootstrap (Day 1)

Steps

Files

Phase 1 — Database & Auth (Days 2-4)

Steps

Key code patterns

Files

Phase 2 — File Storage & Upload (Days 5-6)

Steps

Files

Phase 3 — Video Analysis Engine (Days 7-10) [AI Core]

Steps (all in apps/worker/)

Files

Phase 4 — NLP Edit Engine [MVP Priority] (Days 11-15)

Steps

Key Security Points

Files

Phase 5 — Frontend Wiring (Days 16-18)

Steps

Files

Phase 6 — Payments (Days 19-20)

Steps

Files

Phase 7 — Power Features (Days 21-28)

Phase 8 — Deploy (Days 29-30)

Verification Steps

Decisions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Steps (all in `apps/worker/`)

Packages