Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not an issue, a question: Why not benchmark batch sizes 16, 32, 64? #8

Open
Mattk70 opened this issue Jan 28, 2025 · 1 comment
Open

Comments

@Mattk70
Copy link

Mattk70 commented Jan 28, 2025

In your benchmarks, you use BS 1, 100, 300, 1000, etc. It's clear that 1 is slower than 100 for the GPU backend, but then larger batch sizes show performance degradation.

Two issues with this:

  • For consumer GPUs, even BS 100 will be hard to fit into memory
  • As the results show deterioration from 100 upward, what is the performance for BS 16, 32, 64?

Enquiring minds need to know!

Thanks!

@stefantaubert
Copy link
Member

That’s a great point, and it would indeed be very interesting to test! Thanks for bringing it up. I’d love to look into this when I have a bit more time, but unfortunately, I don’t have the capacity to do so at the moment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants