Add video models + functions #814

dreadatour · 2025-01-13T16:58:56Z

Video models added

class VideoFile(File):
    """`DataModel` for reading video files."""

    def get_info(self) -> "Video":
        """Returns video file information."""

    def get_frame_np(self, frame: int) -> "ndarray":
        """Reads video frame from a file."""

    def get_frame(self, frame: int, format: str = "jpg") -> bytes:
        """Reads video frame from a file and returns as image bytes."""

    def save_frame(self, frame: int, output_file: str, format: Optional[str] = None) -> "VideoFrame":
        """Saves video frame as an image file."""

    def get_frames_np(self, start_frame: int = 0, end_frame: Optional[int] = None, step: int = 1) -> "Iterator[ndarray]":
        """Reads video frames from a file."""

    def get_frames(self, start_frame: int = 0, end_frame: Optional[int] = None, step: int = 1, format: str = "jpg") -> "Iterator[bytes]":
        """Reads video frames from a file and returns as bytes."""

    def save_frames(self, output_dir: str, start_frame: int = 0, end_frame: Optional[int] = None, step: int = 1, format: str = "jpg") -> "Iterator[VideoFrame]":
        """Saves video frames as image files."""

    def save_fragment(self, start_time: float, end_time: float, output_file: str) -> "VideoFragment":
        """Saves video interval as a new video file."""

    def save_fragments(self, intervals: list[tuple[float, float]], output_dir: str) -> "Iterator[VideoFragment]":
        """Saves video intervals as new video files."""


class VideoFragment(VideoFile):
    """`DataModel` for reading video fragments."""

    start: float = Field(default=-1.0)
    end: float = Field(default=-1.0)


class VideoFrame(VideoFile):
    """`DataModel` for reading video frames."""

    frame: int = Field(default=-1)
    timestamp: float = Field(default=-1.0)

Meta models added

class Image(DataModel):
    """`DataModel` for image file meta information."""

    width: int = Field(default=-1)
    height: int = Field(default=-1)
    format: str = Field(default="")


class Video(DataModel):
    """`DataModel` for video file meta information."""

    width: int = Field(default=-1)
    height: int = Field(default=-1)
    fps: float = Field(default=-1.0)
    duration: float = Field(default=-1.0)
    frames: int = Field(default=-1)
    format: str = Field(default="")
    codec: str = Field(default="")


class Frame(DataModel):
    """`DataModel` for video frame image meta information."""

    frame: int = Field(default=-1)
    timestamp: float = Field(default=-1.0)
    width: int = Field(default=-1)
    height: int = Field(default=-1)
    format: str = Field(default="")

Usage examples

Can be found here: iterative/datachain-examples#28

codecov · 2025-01-13T17:11:06Z

Codecov Report

Attention: Patch coverage is 90.80460% with 16 lines in your changes missing coverage. Please review.

Project coverage is 87.81%. Comparing base (7f757b3) to head (4098e8b).
Report is 8 commits behind head on main.

Files with missing lines	Patch %	Lines
src/datachain/lib/video.py	88.42%	8 Missing and 6 partials ⚠️
src/datachain/lib/file.py	96.22%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #814      +/-   ##
==========================================
+ Coverage   87.74%   87.81%   +0.07%     
==========================================
  Files         129      130       +1     
  Lines       11462    11633     +171     
  Branches     1545     1567      +22     
==========================================
+ Hits        10057    10216     +159     
- Misses       1017     1023       +6     
- Partials      388      394       +6

Flag	Coverage Δ
datachain	`87.74% <90.80%> (+0.07%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

dmpetrov

Amazing PR!

It would be great to use concise and minimalistic naming and API because we are going to have many file types for multiple domains.

Naming

Keywords like Meta will make it hard for user to remember and use the classes - user have their own meta 🙂

How about this renaming:
VideoFile -> BaseVideo (I assume people won't use this often)
VideoMeta -> Video (the most used class)
VideoClip -> Clip (also, shouldn't it be based on Video with meta?)
VideoFrame -> FrameBase
VideoFrameMeta -> Frame

start_time --> start
end_time --> end
frames_count --> count

Image -> BaseImage
ImageMeta -> Image

FileTypes can be also extended: image (read meta), base_image (do not read meta), video (read meta), base_video (do not read meta), video_clip, base_video_clip , ...

Do we need dummy classes?

I assume that people prefer working with meta information while dealing with images and videos. A followup question - do we really need BaseImages and BaseVideo without any logic? Why don't we clean up API and keep only Meta-enrich version in the API? User still can work with videos as File if meta is not needed.

Do we need singular methods?

save_video_clips() and save_video_clip() How much extra code user needs to get rid of singular form. If one method - let's avoid the singular version.

The same question for video_frames() and video_frames_np()

I assume, we can add the method and classes later if there is a need. But I'd not start with such rich API for now and try my best to keep in minimalistic.

WDYT?

src/datachain/lib/file.py

pyproject.toml

src/datachain/lib/file.py

src/datachain/lib/video.py

cloudflare-workers-and-pages · 2025-01-14T15:24:52Z

Deploying datachain-documentation with Cloudflare Pages

Latest commit:	`4098e8b`
Status:	✅ Deploy successful!
Preview URL:	https://4d43370e.datachain-documentation.pages.dev
Branch Preview URL:	https://video-models.datachain-documentation.pages.dev

View logs

for more information, see https://pre-commit.ci

dreadatour · 2025-01-14T16:01:28Z

Naming

Keywords like Meta will make it hard for user to remember and use the classes - user have their own meta 🙂

👍

How about this renaming: VideoFile -> BaseVideo (I assume people won't use this often) VideoMeta -> Video (the most used class) VideoClip -> Clip (also, shouldn't it be based on Video with meta?) VideoFrame -> FrameBase VideoFrameMeta -> Frame

For now we have naming with File: TextFile, ImageFile and File itself. I left VideoFile for now, but rename others:

ImageMeta -> Image
VideoClipFile -> VideoClip (I can rename it to Clip as you suggested, just not sure yet, because see next line)
VideoFrameFile -> VideoFrame (I can rename it to Frame to be consistent with Clip, also Frame is already busy, see below)
VideoMeta -> Video
VideoFrameMeta -> Frame

start_time --> start end_time --> end frames_count --> count

Done. Only frames_count became frames, because I am not sure about count, too general, IMO.

Image -> BaseImage ImageMeta -> Image

We don't have Image model, we have ImageFile model, left it as is for now. ImageMeta -> Image done.

FileTypes can be also extended: image (read meta), base_image (do not read meta), video (read meta), base_video (do not read meta), video_clip, base_video_clip , ...

That's good suggestion, only we use FileTypes for now only in from_storage method. I am not sure we we want to change it to download files and read meta 🤔 Even with additional param.

Do we need dummy classes?

I assume that people prefer working with meta information while dealing with images and videos. A followup question - do we really need BaseImages and BaseVideo without any logic? Why don't we clean up API and keep only Meta-enrich version in the API? User still can work with videos as File if meta is not needed.

Good question. I've added VideoFile only because we already have ImageFile, just to be consistent. Also it is useful when we use from_storage with type=video, and then we can use VideoFile type in mappers, like this:

def video_meta(file: "VideoFile") -> Video:
    """
    Returns video file meta information.

    Args:
        file (VideoFile): VideoFile object.

    Returns:
        Video: Video file meta information.
    """

Do we need singular methods?

save_video_clips() and save_video_clip() How much extra code user needs to get rid of singular form. If one method - let's avoid the singular version.
The same question for video_frames() and video_frames_np()

Sounds reasonable to me 👍 Will update the code (not done yet).

Default values

Done.

WDYT?

Those are great comments! Love the discussion ❤️

src/datachain/lib/video.py

src/datachain/lib/file.py

shcheklein · 2025-01-14T20:51:35Z

src/datachain/lib/file.py


    def save(self, destination: str):
        """Writes it's content to destination"""
        self.read().save(destination)


+class Image(DataModel):


why do we need this separate model?

Same as for video info (Video model). I can remove it from this PR 🤔

it's just a bit weird that we have ImageFile and Image (that contains only some basic metadata) 🤔

It was VideoMeta (and ImageMeta) before, but Dmitry was asked to rename these models here. I agree having Video (Image) model with just meta looks odd. I think you're right and we should inherit this model from VideoFile (ImageFile) to extend files with meta, than it will make sense. If no, what do you think about VideoInfo (and ImageInfo)?

src/datachain/lib/file.py

shcheklein · 2025-01-28T03:58:09Z

src/datachain/lib/file.py

+
+        return video_frame(self, frame, format)
+
+    def save_frame(self, frame: int, output_file: str) -> "VideoFrame":


does VideoFrame has information that is needed it to be File?

Good question. VideoFrame class looks like this:

class VideoFrame(VideoFile): """`DataModel` for reading video frames.""" frame: int = Field(default=-1) timestamp: float = Field(default=-1.0) orig: File

It is inherited from VideoFile, which is inherited from File. I am thinking now this VideoFrame model should be inherited from ImageFile instead, because it is an image.

Also I am not sure TBH if we need orig: File here (also should it be orig: VideoFile?)

Also what if we want to have "virtual video frame" model? Inherited from VideoFile, so we do have a video file in this model and with additional frame and timestamp fields. This way we can do not store frame in the storage and use original video file and strip the frame from it then needed.

Many questions regarding to the API, many answers but same time it is way to much for this PR and we need real world use cases and examples to make it all works the best way.

src/datachain/lib/vfile.py

src/datachain/lib/file.py

dreadatour · 2025-01-28T14:29:22Z

@dreadatour is the PR description up to date?

It is now 👍

shcheklein · 2025-01-28T23:36:32Z

src/datachain/lib/file.py

+    codec: str = Field(default="")
+
+
+class Frame(DataModel):


I would suggest to rename those back to SomethingMeta if we keep this approach

it is super confusing - VideoFrame and Frame - which one is the main class?

cc @dmpetrov

src/datachain/lib/file.py

for more information, see https://pre-commit.ci

dreadatour · 2025-01-31T05:54:34Z

In this Video Models PR, we now have models for:

VideoFile – based on the File model with additional methods added to work with video: get info, get frames, get fragments.
VideoFragment – based on the VideoFile model with additional fields: start time and end time.
VideoFrame – based on ImageFile with additional fields: frame number and timestamp.

There are some questions about this implementation:

When we are splitting VideoFile into fragments, we are uploading fragment video clips into storage and creating a new VideoFragment model with this uploaded video file. In this model, we have start time and end time signals, but there is no link by default to the original video file. Do we need this link (for example, an orig field with VideoFile type)? Do we always need this link? Do we need these signals (start and end time) without a link to the original video file?
Same for frames: when we are splitting VideoFile into frames, we are uploading frame images into storage and creating a new VideoFrame model with this uploaded image file. We do have frame and timestamp signals, but no link to the original video. Do we need these signals? Do we need the link to the original video?
What about virtual video fragments and lightweight frame models? It is an original video file model with additional signals (start and end time for fragments and timestamp for frames), but the original file is still the same—no physical file split and no upload happens. It looks like these virtual fragment and virtual frame models are required and will be used often, but we haven’t implemented them yet. Do we need them? How should we organize an API to work with these models? Do we need a set of additional methods in the VideoFile model class?

dreadatour · 2025-02-06T12:59:15Z

Continue work in #890

Add video models + functions

75877d1

dreadatour requested a review from a team January 13, 2025 16:58

dreadatour self-assigned this Jan 13, 2025

dreadatour temporarily deployed to internal January 13, 2025 16:59 — with GitHub Actions Inactive

dreadatour linked an issue Jan 13, 2025 that may be closed by this pull request

Support Video file and Video clip, Video frame models and operations with them #797

Closed

dreadatour mentioned this pull request Jan 13, 2025

Support Video file and Video clip, Video frame models and operations with them #797

Closed

dmpetrov requested changes Jan 13, 2025

View reviewed changes

src/datachain/lib/file.py Outdated Show resolved Hide resolved

shcheklein reviewed Jan 14, 2025

View reviewed changes

pyproject.toml Show resolved Hide resolved

shcheklein reviewed Jan 14, 2025

View reviewed changes

src/datachain/lib/file.py Outdated Show resolved Hide resolved

shcheklein reviewed Jan 14, 2025

View reviewed changes

src/datachain/lib/file.py Outdated Show resolved Hide resolved

shcheklein reviewed Jan 14, 2025

View reviewed changes

src/datachain/lib/file.py Show resolved Hide resolved

shcheklein reviewed Jan 14, 2025

View reviewed changes

src/datachain/lib/video.py Show resolved Hide resolved

dreadatour commented Jan 14, 2025

View reviewed changes

src/datachain/lib/video.py Outdated Show resolved Hide resolved

Code review update

031b9df

dreadatour temporarily deployed to internal January 14, 2025 15:24 — with GitHub Actions Inactive

[pre-commit.ci] auto fixes from pre-commit.com hooks

548bbd5

for more information, see https://pre-commit.ci

pre-commit-ci bot temporarily deployed to internal January 14, 2025 15:25 Inactive

Code review update

b55149a

dreadatour temporarily deployed to internal January 14, 2025 15:28 — with GitHub Actions Inactive

shcheklein reviewed Jan 14, 2025

View reviewed changes

src/datachain/lib/video.py Outdated Show resolved Hide resolved

shcheklein reviewed Jan 14, 2025

View reviewed changes

src/datachain/lib/file.py Outdated Show resolved Hide resolved

shcheklein reviewed Jan 14, 2025

View reviewed changes

src/datachain/lib/file.py Outdated Show resolved Hide resolved

Code review update

2cd6d62

dreadatour temporarily deployed to internal January 15, 2025 01:48 — with GitHub Actions Inactive

Small fixes due to work on usage examples

5892ab9

dreadatour temporarily deployed to internal January 15, 2025 16:24 — with GitHub Actions Inactive

shcheklein reviewed Jan 28, 2025

View reviewed changes

src/datachain/lib/vfile.py Outdated Show resolved Hide resolved

shcheklein reviewed Jan 28, 2025

View reviewed changes

src/datachain/lib/file.py Outdated Show resolved Hide resolved

dreadatour added 3 commits January 28, 2025 19:38

Update video requirements

23514f7

Code review updates

8a8dd64

Merge branch 'main' into video-models

1a04dd0

dreadatour requested review from dmpetrov, shcheklein and mattseddon January 28, 2025 13:53

shcheklein reviewed Jan 28, 2025

View reviewed changes

src/datachain/lib/file.py Outdated Show resolved Hide resolved

dreadatour and others added 11 commits January 29, 2025 09:41

Merge branch 'main' into video-models

0c95c3d

Code review updates + tests

e55405d

Set up ffmpeg in tests

8e2a673

Set up ffmpeg in tests

9c910ec

Set up ffmpeg in tests

a2b8c9a

Update 'ensure_cached' test

63448d9

Revert 'ensure_cached' test

abe39f5

[pre-commit.ci] auto fixes from pre-commit.com hooks

3b7b829

for more information, see https://pre-commit.ci

Fix tests

55f0478

Fix tests

99b9490

Update video models

c28cd66

Merge branch 'main' into video-models

4098e8b

dreadatour mentioned this pull request Feb 3, 2025

Video models #890

Merged

dreadatour closed this Feb 6, 2025

dreadatour deleted the video-models branch February 6, 2025 16:33

dreadatour mentioned this pull request Mar 14, 2025

Improve models and file methods #966

Merged


		return video_frame(self, frame, format)

		def save_frame(self, frame: int, output_file: str) -> "VideoFrame":

Add video models + functions #814

Add video models + functions #814

Uh oh!

Conversation

dreadatour commented Jan 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Video models added

Meta models added

Usage examples

Uh oh!

codecov bot commented Jan 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dmpetrov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cloudflare-workers-and-pages bot commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying datachain-documentation with Cloudflare Pages

Uh oh!

dreadatour commented Jan 14, 2025

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dreadatour commented Jan 28, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dreadatour commented Jan 31, 2025

Uh oh!

dreadatour commented Feb 6, 2025

Uh oh!

Uh oh!

dreadatour commented Jan 13, 2025 •

edited

Loading

codecov bot commented Jan 13, 2025 •

edited

Loading

cloudflare-workers-and-pages bot commented Jan 14, 2025 •

edited

Loading