-
Notifications
You must be signed in to change notification settings - Fork 595
Qualcomm AI Engine Direct - GA Model Enablement (Roberta) #11354
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11354
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit fe5089a with merge base e02ca41 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
@pytorchbot label "release notes: qualcomm" |
Hi @cccclai, we have 4 GA BERT variants depend on this PR, will submit them after this got merged. It's just FYI, please take your time, no rush. |
And looks like it's not triggering CI, could you help on it? |
@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
There is a lint error, can you fix it? |
- Add end to end script for Roberta - Handle extreme values in replace_inf function for masked_fill
32dd145
to
fe5089a
Compare
@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
) ### Summary - support e2e script / test case for GA RoBERTa model - perf: 16a8w 79.2ms/inf (SM8750) (seq_len=107) - acc: ~= 50% ### Test plan ``` python backends/qualcomm/tests/test_qnn_delegate.py -k TestExampleOssScript.test_roberta --device <device_id> --host <host> --model <soc_model> --build_folder build-android --executorch_root . --sentence_dataset wikisent2.txt --artifact roberta_artifact ```
Summary
Test plan