You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: nemo_curator/stages/video/clipping/clip_extraction_stages.py
+14-4Lines changed: 14 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -38,7 +38,7 @@ class ClipTranscodingStage(ProcessingStage[VideoTask, VideoTask]):
38
38
software (libx264, libopenh264) and hardware (NVENC) encoding with configurable parameters.
39
39
40
40
Args:
41
-
num_cpus_per_worker: Number of CPUs per worker.
41
+
num_cpus_per_worker: Number of CPUs per worker for Xenna scheduling. Does not affect Ray Data CPU scheduling; use ray_data_num_cpus for that.
42
42
encoder: Video encoder to use.
43
43
encoder_threads: Number of threads per encoder.
44
44
encode_batch_size: Number of clips to encode in parallel.
@@ -48,6 +48,7 @@ class ClipTranscodingStage(ProcessingStage[VideoTask, VideoTask]):
48
48
num_clips_per_chunk: Number of clips per chunk. If the number of clips is larger than this, the clips will be split into chunks, and created VideoTasks for each chunk.
49
49
verbose: Whether to print verbose logs.
50
50
ffmpeg_verbose: Whether to print FFmpeg verbose logs.
51
+
ray_data_num_cpus: CPU cores reserved per Ray Data actor for this stage. Defaults to 1.0 on the CPU encoder path to enable stage fusion with upstream stages. Set to None to fall back to resources.cpus. Does not affect Xenna scheduling.
51
52
"""
52
53
53
54
num_cpus_per_worker: float=6.0
@@ -61,6 +62,9 @@ class ClipTranscodingStage(ProcessingStage[VideoTask, VideoTask]):
61
62
ffmpeg_verbose: bool=False
62
63
verbose: bool=False
63
64
name: str="clip_transcoding"
65
+
ray_data_num_cpus: float|None= (
66
+
None# CPU reservation for Ray Data scheduler; set to 1.0 on CPU path to enable stage fusion
0 commit comments