Skip to content

Conversation

wweic
Copy link

@wweic wweic commented Sep 17, 2025

@kthui kthui added the PR: perf A code change that improves performance label Sep 18, 2025
@kthui
Copy link
Contributor

kthui commented Sep 18, 2025

Pipeline # 35130400

@whoisj whoisj added the enhancement New feature or request label Sep 18, 2025
@wweic wweic force-pushed the wweic/optimize-string-tensor-pr branch from 9fbfc13 to cf1a489 Compare September 18, 2025 20:45
@wweic wweic marked this pull request as ready for review September 18, 2025 20:46
@wweic
Copy link
Author

wweic commented Sep 18, 2025

@kthui I addressed your comments in PR and from issue. I ran a local benchmark (spin up a triton container, send requests to the container for 5000 times), the p99, p80, p50 latencies remain the same before making the changes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request PR: perf A code change that improves performance
Development

Successfully merging this pull request may close these issues.

3 participants