-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Whisper Redesigned Solution #23549
Open
kunal-vaishnavi
wants to merge
38
commits into
microsoft:main
Choose a base branch
from
kunal-vaishnavi:kvaishnavi/whisper-separate-export
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Whisper Redesigned Solution #23549
Changes from all commits
Commits
Show all changes
38 commits
Select commit
Hold shift + click to select a range
f314287
Add support for creating optimized whisper ONNX models without beam s…
kunal-vaishnavi 6a44f72
Fix incorrect dynamic axes labels
kunal-vaishnavi 58ec5eb
Fix fusion breaks for OpenAI implementation of Whisper
kunal-vaishnavi 4c228ea
Merge branch 'main' into kvaishnavi/whisper-separate-export
kunal-vaishnavi dd20876
Merge branch 'main' into kvaishnavi/whisper-separate-export
kunal-vaishnavi b13cb22
Comment out DMMHA case temporarily
kunal-vaishnavi 31db1a0
Replace MHA with DMMHA
kunal-vaishnavi 3b92432
Merge branch 'main' into kvaishnavi/whisper-separate-export
kunal-vaishnavi 7bb79f3
Debugging beam search output
kunal-vaishnavi 14b7e77
Initial commit for new export
kunal-vaishnavi fa345fe
Add parity check after export and optimization
kunal-vaishnavi e050dea
Fix multiple attention kernel invocations
kunal-vaishnavi bf87062
Make output Q*K values optional
kunal-vaishnavi 17fa0ab
Fix batch size check for cache indirection
kunal-vaishnavi 52aeb58
Save checkpoint for working solution
kunal-vaishnavi 240fe3b
Clean up code
kunal-vaishnavi ae98085
Fix string dumping
kunal-vaishnavi 3d2c8fe
Fix out_qk dtype issue for half input case.
mindest 287151f
Remove type cast for output QK
kunal-vaishnavi 0805d1d
Enable release mode build
kunal-vaishnavi b629903
Make QK output dtype independent of attention dtype
kunal-vaishnavi 648b389
Add batched jump times export
kunal-vaishnavi a6c6ee8
Get batched jump times ONNX model with parity check
kunal-vaishnavi c0a6ce4
Save checkpoint for working solution
kunal-vaishnavi 008eeb9
Merge branch 'main' into kvaishnavi/whisper-separate-export
kunal-vaishnavi 158d0a8
Fix build after merge
kunal-vaishnavi 02cb5be
Fix model with beam search op
kunal-vaishnavi 2acd593
Get model impl and beam search op export combinations working
kunal-vaishnavi 612eb0c
Enable separate export of encoder and decoder init
kunal-vaishnavi f2d78fd
Add tests for multiple export types to CIs
kunal-vaishnavi cb93517
Update folder and file names in Whisper README
kunal-vaishnavi 6da11ec
Add FP32 CPU DMMHA support
kunal-vaishnavi 9640736
Add unit tests
kunal-vaishnavi 75a342a
Merge branch 'main' into kvaishnavi/whisper-separate-export
kunal-vaishnavi 7fe6b05
Change debug message for PrepareQkv
kunal-vaishnavi 8620168
Fix seqlens_k after merge
kunal-vaishnavi b0a732b
Merge branch 'main' into kvaishnavi/whisper-separate-export
kunal-vaishnavi 23808f7
Add changes suggested by linter
kunal-vaishnavi File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It might not need. I did not see code in this file that dumps tensor.