Video Super Resolution

This project provides a PyTorch implementation of a state-of-the-art video super resolution model. A short sliding window of frames is processed to produce a high resolution output (e.g. 4K). During training only high quality videos are required; lower resolution inputs are created on the fly through random degradations. The architecture combines modern techniques for spatial detail and temporal consistency.

Key Features

Spatio‑Temporal Encoding with 3D convolutions over a sequence of frames.
U‑Net Style Decoder with pixel shuffle upsampling.
Transformer Bottleneck for non‑local feature interactions.
Deformable Convolution blocks for frame alignment.
Stable Diffusion Refinement using a pretrained SDXL model.
CLIP‑Based Semantic Loss in addition to perceptual and temporal losses.

Training

The main training script is video_sr.py. It expects a directory of high resolution video files (e.g. MP4, AVI) rather than pre-extracted PNG frames. Low resolution inputs are generated internally using random downscaling, blur and noise so no separate low quality dataset is needed.

python video_sr.py

Parameters such as window size, scaling factor and batch size can be adjusted at the bottom of the script.

Requirements

PyTorch
diffusers
OpenAI CLIP
torchvision
OpenCV

Install the required packages with:

pip install torch torchvision diffusers git+https://github.com/openai/CLIP.git opencv-python

License

This project is released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
paper		paper
.gitignore		.gitignore
ForDeveloper.MD		ForDeveloper.MD
LICENSE		LICENSE
README.md		README.md
video_sr.py		video_sr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Video Super Resolution

Key Features

Training

Requirements

License

About

Uh oh!

Releases

Packages

Languages

License

faridani/privatepilot-video

Folders and files

Latest commit

History

Repository files navigation

Video Super Resolution

Key Features

Training

Requirements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages