Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Imlement tests for abstract structures such as in Curcuits thread #1

Open
oserikov opened this issue Feb 24, 2022 · 0 comments
Open

Comments

@oserikov
Copy link
Collaborator

oserikov commented Feb 24, 2022

duration: scalable, can be both 175 and 350 hours
mentor: @oserikov , Ryan Teehan
difficulty: medium
requirements:

  1. PyTorch
  2. sklearn
  3. experience with re-using the academic code
  4. experience with Transformer Language models

useful links:

Idea Description:

In Circuits, several abstract structures found in CV models were summarized. The Branches Specialization tendency of the CV neural networks, as well as the Weight Banding property of NNs last layers, have not been directly studied in LLMs, though the findings of several papers (1, 2) could be related.

The task is to perform a study of the abstract structures representedness in CV and NLP models, by applying the same inspection techniques to both groups of models. This would allow answering the following questions. Are the abstract structures identified for CV also found in NLP models? How do these structures differ among NLP models?

Coding Challenge

Reproduce the Branch Specialization core insight: demonstrate the presence of the branch specialization in the CV model; Reproduce the Individual Neurons analysis on BERT model.

@oserikov oserikov changed the title Imlement tests for abstract structures such as in Curcuits thread 1. Imlement tests for abstract structures such as in Curcuits thread Feb 26, 2022
@oserikov oserikov changed the title 1. Imlement tests for abstract structures such as in Curcuits thread Imlement tests for abstract structures such as in Curcuits thread Feb 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant