Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate existing tools: Captum, LIT, AllenNLP Interpret, NeuroX into the HF pipeline #3

Open
oserikov opened this issue Feb 24, 2022 · 0 comments

Comments

@oserikov
Copy link
Collaborator

oserikov commented Feb 24, 2022

duration: scalable, can be both 175 and 350 hours
mentor: @oserikov
difficulty: medium
requirements:

  1. pytorch
  2. sklearn
  3. python engineering code, with OOP and patterns
  4. experience with Transformer Language models

useful links:

Idea Description:

There exist lots of interpretability tools, both for Industry and Academia users.
While some of them are general-purpose, and the others are very field-specific, all of them have several things in common.
One would typically apply them to HuggingFace models. All of these methods try to explain the black-boxes we have.

What we propose is, shortly, to put together the existing popular models interpretation stack. We've made a survey of interpretability for LLMs and now have both scientific and engineering vision of what we should implement in order to maximize the interpretability of the existing LLMs.

You need to implement the HF-compatible interpretability aggregation API. The exact tasks to accomplish are:

  1. choose the most important methods provided by Captum, Interpret and NeuroX (which ones? to better understand the task, try to figure it out yourself. having done this, reach out to us ASAP and we will discuss your vision)
  2. implement the all-in-one interpret method to run all the chosen ones
  3. perform the initial analysis of the BigScience models checkpoints
  4. ensure the codebase is easy to cover the new methods

Coding Challenge

see task 1.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant