How to support re-compute kv-cache after certain decoded token #6886

jiazhan-msft · 2024-07-29T04:44:39Z

jiazhan-msft
Jul 29, 2024

I have a feature in my model which switches model setup after certain decoded token, e.g., when decoded to the n-th token, the model requires re-compute previous kv-cache, what's the possible path to enable this support? Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How to support re-compute kv-cache after certain decoded token #6886

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

How to support re-compute kv-cache after certain decoded token #6886

Uh oh!

jiazhan-msft Jul 29, 2024

Replies: 0 comments

jiazhan-msft
Jul 29, 2024