Skip to content

Implement Pause and Resume for RowSource #191

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 9 commits into from
May 7, 2025
Merged

Conversation

kaidaguerre
Copy link
Contributor

No description provided.

update StatusFromProto to accept proto.Event
Add ChunkWrittenFromProto, CompleteFromProto, StartedFromProto
Update EventFromProto to handle all events - so we can also use from CLI
instead of reopening connection, forceMemoryRelease - flush all but 64Mb
simplify worker memory calc - do not leave allocation for CLI just divide mem byu num workers
Add PausableObservable interface which stops even publishing when paused, adds a BlockWhilePaused function
RowSource and CollectorImpl are now PausableObservables

RowEnrichmentCollector.writeChunk and ArtifactSourceImpl.OnArtifactDiscovered call BlockWhilePaused
CollectorImpl.onChunk calls checkJsonlSize which checks the size of the json destination folder and pause the source if it is too large
Remove logging
Add hysteresis to json size polling - do not unpause until < 75% max json size
Add EnsureArtifactPath
Rename req.MaxJsonlSizeMb to MaxTempCacheSizeMb
@kaidaguerre kaidaguerre merged commit aa14fdb into develop May 7, 2025
2 checks passed
@kaidaguerre kaidaguerre deleted the memory_testing branch May 7, 2025 17:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant