Skip to content

Trying to retrieve token-level uncertainty values #414

@alexander-hamme

Description

@alexander-hamme

Hello,

I am working on a personal research study and trying to explore the topic of live hallucination detection (with possibilities for short-circuiting etc), I would like to use the various metrics offered by lm-polygraph but it seems that only the TokenEntropy class supports returning the full list of token-level values (non-aggregated).

Is there any way to retrieve the original token-level value list for certain other metrics, such as PMI/CPMI, Renyi, etc? (Even if they make more sense at a claim / full response level, I am still interested in getting the token-level list for further exploration and analysis.

As I currently see it, the source code will have to be significantly modified (even when I look at the low_level jupyter notebook). Please let me know if there are functions available to do this already with estimate_uncertainty or a similar function.

Thank you!

Metadata

Metadata

Assignees

Labels

questionFurther information is requested

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions