Adds temperature, top_k decoding, and top_p decoding to decode_seq2seq.py #44

Phirefly9 · 2019-12-05T19:45:55Z

This PR adds the capability to provide different token decoding strategies.

Pretty much a transplant of some code from https://github.com/huggingface/transformers/blob/master/examples/run_generation.py

msftclas · 2019-12-05T19:46:07Z

All CLA requirements met.

src/biunilm/decode_seq2seq.py

Phirefly9 · 2019-12-09T13:45:55Z

This is ready for review.

arguments were set so that results of the paper will not change by default when running eval.

donglixp · 2019-12-16T12:23:28Z

src/pytorch_pretrained_bert/modeling.py

@@ -1449,10 +1453,15 @@ def forward(self, input_ids, token_type_ids, position_ids, attention_mask, task_
            last_hidden = new_encoded_layers[-1][:, -1:, :]
            prediction_scores, _ = self.cls(
                last_hidden, None, task_idx=task_idx)
+            prediction_scores = prediction_scores[:, -1, :] / (self.temperature if self.temperature > 0 else 1.)


prediction_scores[:, -1, :] reduces the dimension from 3 to 2, while other places assume the dim=3 (such as prediction_scores[:, :, token_id].fill_(-10000.0)).

good catch, I checked and prediction_scores only needed to be 3 dimensional for the not_predict_set if conditional, and is not used after that, so I modified the access to work properly with 2 dimensions.

unfortunately due to to way torch.multinomial works I cannot leave it as 3 dimensions, so had to make the modification that way

Phirefly9 added 6 commits December 5, 2019 13:46

adds topk topp

857d862

adds topp topk arguments to BertForSeq2SeqDecoder

72feb8c

actually calls the topk topp filtering

e32d173

bugfix

1227918

spelling fix

bb377d9

adds temperature, top_k, and top_p arguments to decode_seq2seq.py

28745aa

Phirefly9 commented Dec 6, 2019

View reviewed changes

src/biunilm/decode_seq2seq.py Outdated Show resolved Hide resolved

fixed temperature default value and redid some argument documentation

e48bd3f

donglixp reviewed Dec 16, 2019

View reviewed changes

argument type bugfix, fixed not_predict_set dimensionality access

07dae09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds temperature, top_k decoding, and top_p decoding to decode_seq2seq.py #44

Adds temperature, top_k decoding, and top_p decoding to decode_seq2seq.py #44

Phirefly9 commented Dec 5, 2019

msftclas commented Dec 5, 2019 •

edited

Phirefly9 commented Dec 9, 2019

donglixp Dec 16, 2019

Phirefly9 Dec 16, 2019

Adds temperature, top_k decoding, and top_p decoding to decode_seq2seq.py #44

Are you sure you want to change the base?

Adds temperature, top_k decoding, and top_p decoding to decode_seq2seq.py #44

Conversation

Phirefly9 commented Dec 5, 2019

msftclas commented Dec 5, 2019 • edited

Phirefly9 commented Dec 9, 2019

donglixp Dec 16, 2019

Choose a reason for hiding this comment

Phirefly9 Dec 16, 2019

Choose a reason for hiding this comment

msftclas commented Dec 5, 2019 •

edited