attention mechanism class addressing issue #5 #30

rubinovitz · 2017-11-06T01:42:22Z

Created a subclass of the ToxModel that includes an attention mechanism.
Analysis forthcoming.

…-metrics-2 Summary metrics for AUC, FNR, TNR.

iislucas · 2017-11-07T04:46:58Z

This is super cool :D Do you have any results you might add to this description on its AUC/quality compared to the attentionless models?

rubinovitz · 2017-11-07T04:59:39Z

Yep, will push up soon.

…

On Mon, Nov 6, 2017, 11:47 PM iislucas ***@***.***> wrote: This is super cool :D Do you have any results you might add to this description on its AUC/quality compared to the attentionless models? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#30 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABuWFyLKJScFBU3fKHgn3fuGdVfMG8L3ks5sz-DCgaJpZM4QSodV> .

nthain

Thanks for submitting this PR! We're excited to have you thinking about this.

I think the build_dense_attention_layer function looks good.

Now I'm trying to wrap my head around the build_model function. Because this model was built as a CNN, attention is a bit of a weird concept. In this model, if I'm reading this correctly, the CNN component takes reduces the sentence to a vector (default 128), we then add a dense layer (of size max_num_tokens) and then use another dense layer (of size max_num_tokens) to compute "attention weights" and do a weighted sum of these last two vectors? Is there some significance to the attention weights in this context?

My only familiarity is with attention in the RNN context, and there, I can see your build_dense_attention_layer being quite useful.

nthain · 2017-11-08T18:51:09Z

src/model_tool.py

+
+        x = Flatten()(x)
+        x = Dropout(self.hparams['dropout_rate'], name="Dropout")(x)
+        x = Dense(250, activation='relu', name="Dense_RELU")(x)


The hardcoded 250 is not quite right. I think this should be self.hparams['max_sequence_length'] for the attention layer to work for a general set of hyperparameters.

Now fixed at line 444

nthain · 2017-11-08T18:56:13Z

src/model_tool.py

+        attention_mul = Multiply()([input_tensor, attention_probs])
+        return {'attention_probs': attention_probs, 'attention_preds': attention_mul}
+
+    def build_probs(self):


I'm not sure what the purpose of the build_probs function is?

That's obsolete, you're right. This was removed in the latest push.

nthain · 2017-11-08T19:08:34Z

src/model_tool.py

+        preds = attention_dict['attention_preds']
+        preds = Dense(2, name="preds_dense", activation='softmax')(preds)
+        rmsprop = RMSprop(lr=self.hparams['learning_rate'])
+        self.model = Model(sequence_input, preds)


It would be nice to expose the attention weights as well as the predictions so we can visualize what the model is paying attention to.

I now save this here.

nthain · 2017-12-04T19:27:47Z

Thanks for continuing to push changes! Let us know when you're ready for us to have another look.

rubinovitz · 2017-12-04T19:28:58Z

Thanks for checking in! Yes I will let you know soon, want to clean it up and catch up with your changes!

…

On Mon, Dec 4, 2017 at 2:27 PM, nthain ***@***.***> wrote: Thanks for continuing to push changes! Let us know when you're ready for us to have another look. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#30 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABuWF5WTtlNi3NMNgqwDW_Y0qxcBLDwbks5s9EezgaJpZM4QSodV> .

iislucas · 2017-12-20T05:57:10Z

@rubinovitz : You might be interested in this: https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge :)

iislucas · 2018-02-27T21:23:36Z

Also, could you remove the compiled python files from the pull request. Thanks!

rubinovitz · 2018-03-07T20:56:44Z

Just pushed a bunch of updates for the 1DConv, but still need to push up the LSTM, so stay tuned and will let you know when I'm done.

nthain · 2018-03-07T21:24:53Z

Thanks for the update! We'll stay tuned.

lucyvasserman and others added 3 commits October 27, 2017 17:21

Merge pull request conversationai#27 from conversationai/lucy-summary…

0d7e7e0

…-metrics-2 Summary metrics for AUC, FNR, TNR.

attention mechanism addressing conversationai#5

0cd34f2

attention mechanism addressing conversationai#5

1eb44e9

nthain self-requested a review November 6, 2017 17:15

nthain reviewed Nov 8, 2017

View reviewed changes

rubinovitz added 7 commits November 10, 2017 18:03

test

2d29af9

latest

1fa7e1e

latest

b344b01

remove swaps

e20d109

latest

0e66620

update .gitignore

96a7657

merge model_tool

e16cd00

Merge remote-tracking branch 'upstream/master' into attention

a52d15a

rubinovitz added 2 commits December 27, 2017 17:37

analysis

42b7f6f

latest

e21621a

rubinovitz added 8 commits March 6, 2018 12:30

Merge remote-tracking branch 'upstream/master' into attention

d4c1959

updated results (to clean up)

93cc3a3

model_tool.py fixes

a61a681

add updated attention results

e446155

remove *.pyc from index

f6aed1d

remove old attention training file

509219b

remove src directory

dda699e

remove some unneeded datafiles

1ac62f1

Base automatically changed from master to main March 25, 2021 19:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attention mechanism class addressing issue #5 #30

attention mechanism class addressing issue #5 #30

rubinovitz commented Nov 6, 2017

iislucas commented Nov 7, 2017

rubinovitz commented Nov 7, 2017 via email

nthain left a comment •

edited

Loading

nthain Nov 8, 2017

rubinovitz Mar 7, 2018

nthain Nov 8, 2017

rubinovitz Mar 7, 2018

nthain Nov 8, 2017

rubinovitz Mar 7, 2018

nthain commented Dec 4, 2017

rubinovitz commented Dec 4, 2017 via email

iislucas commented Dec 20, 2017

iislucas commented Feb 27, 2018

rubinovitz commented Mar 7, 2018

nthain commented Mar 7, 2018

attention mechanism class addressing issue #5 #30

Are you sure you want to change the base?

attention mechanism class addressing issue #5 #30

Conversation

rubinovitz commented Nov 6, 2017

iislucas commented Nov 7, 2017

rubinovitz commented Nov 7, 2017 via email

nthain left a comment • edited Loading

Choose a reason for hiding this comment

nthain Nov 8, 2017

Choose a reason for hiding this comment

rubinovitz Mar 7, 2018

Choose a reason for hiding this comment

nthain Nov 8, 2017

Choose a reason for hiding this comment

rubinovitz Mar 7, 2018

Choose a reason for hiding this comment

nthain Nov 8, 2017

Choose a reason for hiding this comment

rubinovitz Mar 7, 2018

Choose a reason for hiding this comment

nthain commented Dec 4, 2017

rubinovitz commented Dec 4, 2017 via email

iislucas commented Dec 20, 2017

iislucas commented Feb 27, 2018

rubinovitz commented Mar 7, 2018

nthain commented Mar 7, 2018

nthain left a comment •

edited

Loading