-
Notifications
You must be signed in to change notification settings - Fork 217
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Speed up test execution for non-cached tests #181
Comments
That is a good observation and I here is my plan to speed it up. Here's an outline of what I have in mind:
I think implementing these simple changes should speed up the AI execution by 2x at least. @slavingia, @gladyshcodes Wdyt? |
Makes sense. I pinged Anthropic to see if they'd support multiple actions in one step. |
Have you received a callback from Anthropic yet? |
Not yet, will bump |
The executions are already pretty expensive. Do we have ways to cut down on cost as well as speed up executions? |
We should probably tackle #187 first, to see that, and then evaluate. Anything that caches computer use should help. |
Recently we introduced caching #179 that made test execution about 6 times faster. I have several more ideas in mind:
Hoping quota of LLM providers will decrease over time (similar to how the price of GFLOPS or disk space has dropped, making this tool more affordable for everyone |
This will be huge and eventually happen.
This seems like relatively low-hanging fruit to explore. In theory a server could run one browser for every test (just keeping in mind chaining/caching) that needs be run, and the entire test suite should just take as long as the slowest chain of tests. |
Wouldn't running tests in parallel run into rate limit issues, and in turn, make null the speed gains? I suppose it depends partly on the API key tier, but when I ran the original claude computer use demo I would constantly get rate limited |
Things may have changed, but overall you're right it'll be a bottleneck. I'll bring it up with them! |
What
Speed up test execution by finding ways to addressing issues outlined below.
Why
While working on #179, I have found that screenshooting perhaps takes the most time when test runs in
--no-cache
mode. Sometimes, screenshots are taken several times when there's no need for it. Also, delay before making a screenshot is about a second or so.The text was updated successfully, but these errors were encountered: