Skip to content

Bert training update #785

Bert training update

Bert training update #785

Triggered via pull request February 12, 2025 13:16
Status Failure
Total duration 6h 3m 16s
Artifacts
Run distributed tests on Trainium 1
6h 0m
Run distributed tests on Trainium 1
Fit to window
Zoom out
Zoom in

Annotations

2 errors
Run distributed tests on Trainium 1
The job running on runner aws-trn1-32xlarge-use1-public-80-hsr8z-runner-fbmvz has exceeded the maximum execution time of 360 minutes.
Run distributed tests on Trainium 1
The operation was canceled.