Learning Rate Setting #16

mr-Mojo · 2019-11-05T10:55:53Z

Hi, I am experimenting with the NIMA implementation for a scientfic project! In your Readme, you say that "The learning rate setting differs from the original paper. I can't seem to get the model to converge with momentum SGD using an lr of 3e-7 for the conv base and 3e-6 for the dense block.". Which settings did you use? The defaults in argparser are set to the 3e-7, 3e-6, so that's why I was wondering!

Thank you :)

yunxiaoshi · 2020-05-27T06:37:59Z

0.005 for the dense block and 0.0005 for the conv base gives you a reasonable result. But I suspect it is not optimal, maybe do a grid/random search

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Learning Rate Setting #16

Learning Rate Setting #16

mr-Mojo commented Nov 5, 2019

yunxiaoshi commented May 27, 2020

Learning Rate Setting #16

Learning Rate Setting #16

Comments

mr-Mojo commented Nov 5, 2019

yunxiaoshi commented May 27, 2020