Gradient Noise

Some have suggested that [adding gradient noise helps deep models converge](https://arxiv.org/abs/1511.06807) and generalise. Other works, such as [DDPG](https://spinningup.openai.com/en/latest/algorithms/ddpg.html), showed that this is the case even for shallow networks of a different domain. That's why it could be interesting for us to explore gradient noise as an option to improve generalisation and with that convergence by avoiding overfitting and other local minima during training.\
One option to further improve gradient noise would be to combine it with #35, by adding different noise to each optimiser. This change would allow us to create combinations like Adam#Adam, where each optimiser sees slightly different noise at each step.\
This issue tracks the progress of such a scheme.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gradient Noise #46

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Gradient Noise #46

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions