diff --git a/images/Editing using TA_Task Analogy.png b/images/Editing using TA_Task Analogy.png new file mode 100644 index 0000000..b8eab2f Binary files /dev/null and b/images/Editing using TA_Task Analogy.png differ diff --git a/images/Editing using TA_Forgetting.png b/images/Editing using TA_Forgetting.png new file mode 100644 index 0000000..555b860 Binary files /dev/null and b/images/Editing using TA_Forgetting.png differ diff --git a/images/Editing using TA_addition.png b/images/Editing using TA_addition.png new file mode 100644 index 0000000..d84a7fd Binary files /dev/null and b/images/Editing using TA_addition.png differ diff --git a/images/Editing using Task Arithmetic.png b/images/Editing using Task Arithmetic.png new file mode 100644 index 0000000..b8d1cfb Binary files /dev/null and b/images/Editing using Task Arithmetic.png differ diff --git a/summaries/Editing Using Task Arithmetic.md b/summaries/Editing Using Task Arithmetic.md new file mode 100644 index 0000000..10c54f7 --- /dev/null +++ b/summaries/Editing Using Task Arithmetic.md @@ -0,0 +1,61 @@ +## Title: Editing Models with Task Arithmetic + + +**Authors:** Gabriel Ilharco, Marco Tulio Ribeiro, Mitchell Wortsman, Suchin Gururangan, Ludwig Schmidt, Hannaneh Hajishirzi, Ali Farhadi, March 2023 + + +**Abstract:** +In "Editing Models with Task Arithmetic," the paper introduces a novel method for modifying the behavior of pre-trained neural networks using task vectors. Task vectors are directions in the weight space of a pre-trained model, created by subtracting the model's original weights from its weights after fine-tuning on a specific task. This approach allows for performance improvements on target tasks by moving in the direction of the task vector without expensive pre-training, reducing costs significantly. + +The paper outlines three operations on task vectors from various fine-tuned LLMs: + +- **Negation:** Reducing performance on a task by negating the task vector, with minimal impact on other tasks. +- **Addition:** Enhancing performance across multiple tasks by adding task vectors together. +- **Analogy Relationships:** Improving performance on a fourth task by combining task vectors from three related tasks, even without training data for the fourth task. + +Experiments across various models, modalities, and tasks demonstrate that task arithmetic is an efficient and effective technique for fine-tuning model behavior. + +# Task Vectors and Task Arithmetic + + +
+
+
+
+
+
+
+ images/Editing using Task Arithmetic-Task Analogy.png
+
+