Skip to content
View SAGNIKMJR's full-sized avatar
:shipit:
Procrastinating
:shipit:
Procrastinating

Block or report SAGNIKMJR

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the paper "ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions" published at CVPR 2025

Python 12 Updated Mar 16, 2025

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,364 52 Updated Jan 12, 2025

The best OSS video generation models

Python 3,054 324 Updated Jan 8, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,465 793 Updated Mar 12, 2025

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,167 445 Updated Mar 19, 2025

High-resolution models for human tasks.

Python 4,911 291 Updated Nov 18, 2024

[ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning".

Python 37 Updated Feb 24, 2025

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,590 147 Updated May 9, 2023

Inference code for Llama models

Python 57,962 9,719 Updated Jan 26, 2025

Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024

Python 51 4 Updated Mar 3, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 5,072 453 Updated Jan 22, 2025

The Most Faithful Implementation of Segment Anything (SAM) in 3D

Python 310 15 Updated Sep 11, 2024

Official inference repo for FLUX.1 models

Python 21,100 1,491 Updated Feb 6, 2025

A feature-rich command-line audio/video downloader

Python 106,045 8,320 Updated Mar 28, 2025

Command-line program to download videos from YouTube.com and other video sites

Python 134,893 10,265 Updated Mar 26, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,764 1,598 Updated Dec 25, 2024

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,716 426 Updated Mar 28, 2025

Learn LeetCode and prepare for coding interviews with free resources.

3,641 394 Updated Feb 14, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,739 780 Updated Aug 12, 2024

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 922 34 Updated Jan 21, 2025

DUSt3R: Geometric 3D Vision Made Easy

Python 6,077 645 Updated Sep 20, 2024

[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'

8 Updated Jun 16, 2024

A curated collections of papers related to speech, audio and music in CVPR 2024.

6 Updated Jun 15, 2024

Contrastive Language-Audio Pretraining

Python 1,576 157 Updated Nov 21, 2024

cuML - RAPIDS Machine Learning Library

C++ 4,573 564 Updated Mar 26, 2025

[NeurIPS'23] Emergent Correspondence from Image Diffusion

Python 670 38 Updated May 14, 2024

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 950 39 Updated Mar 27, 2025

Official repository for the paper PLLaVA

Python 644 47 Updated Jul 28, 2024

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 265 25 Updated Sep 20, 2024

Texas DPS/DMV Automatic Scheduler

TypeScript 353 164 Updated Mar 25, 2025
Next
Showing results