Skip to content

Conversation

@dmarx
Copy link

@dmarx dmarx commented Aug 6, 2025

This PR publishes an sbatch script which can be used to launch a "self-contained" training demo. The main role of this demo is for debugging and smoke testing. As such, it's sort of an odd fit for the reference-architecture repo because it actually illustrates an anti-pattern: shipping training data packaged into the container. This is useful for the expedient purpose this container is generally used for, but we might want to hold off on merging to main until after adding a reference example for training that doesn't demonstrate the anti-pattern.

@dmarx dmarx requested a review from tmadhyastha-cw August 6, 2025 20:48
@bradbeam
Copy link
Member

I love the concept here. Is there much of a lift needed to make this work for gh200?

@bradbeam
Copy link
Member

shipping training data packaged into the container.

Could we make use of a public bucket instead?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants