FIX: Top logprob bug in Offline._parse_logprobs #47

lucyfarnik · 2025-01-20T12:51:11Z

The current implementation sometimes picks up on the incorrect decoded token because it implicitly assumes that the model always picks the token with the highest logprob (which is what logprob.rank==1 looks for). This leads to outputs such as these:

Notice how when you put together the Logprobs.token’s, you don’t get the same thing as the model's actual output (often because the model was "indecisive" between a number and a space). This then confuses Classifier._parse_logprobs which only looks at Logprobs.token to determine which sequence positions contain the model’s labels. In the case of the logs I screenshotted above, this leads to Classifier._parse_logprobs returning a list of the wrong length, which ultimately causes the assertion at scorers/classifier/classifier.py:133 to fail.

My PR fixes this by always setting Logprobs.token to the actual token which the model chose, rather than the token it was maximally likely to choose (but may or may not have actually chosen).

CLAassistant · 2025-01-20T12:59:28Z

All committers have signed the CLA.

FIX: Top logprob bug in Offline client

b8fb794

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: Top logprob bug in Offline._parse_logprobs #47

FIX: Top logprob bug in Offline._parse_logprobs #47

lucyfarnik commented Jan 20, 2025

CLAassistant commented Jan 20, 2025 •

edited

Loading

FIX: Top logprob bug in Offline._parse_logprobs #47

Are you sure you want to change the base?

FIX: Top logprob bug in Offline._parse_logprobs #47

Conversation

lucyfarnik commented Jan 20, 2025

CLAassistant commented Jan 20, 2025 • edited Loading

CLAassistant commented Jan 20, 2025 •

edited

Loading