Skip to content

Conversation

gongchensu
Copy link

@gongchensu gongchensu commented Sep 19, 2025

  1. 接入使用InfiniCore分支中的logsoftmax算子
  2. 增加completion端口,支持launch_server后通过http端口计算得到max_tokens=0的logprobs
  3. 更改test_ppl和jiuge_ppl中用到的torch库的log_softmax算子
  4. 对齐test_ppl的token分块方式,使得和jiuge_ppl对perlexity的计算结果保持一致

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant