flip order of weight+bias application in LayerNormANE by carsonswope · Pull Request #2 · apple/ml-ane-transformers

carsonswope · 2022-08-31T00:12:29Z

Hi, I'm attempting to duplicate the pytorch LayerNorm functionality, and the formula that pytorch uses is clearly (out * weight) + bias, which does not match the code in LayerNormANE.

So I changed it for my use case, and thought I'd open a PR in case this is in fact a bug.

However.. looking at 4b37184, it looks like there is some history and/or legacy reasons for the order being this way, so feel free to reject if I'm missing something :)

flip order of weight+bias application in LayerNormANE

c3907a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

flip order of weight+bias application in LayerNormANE#2

flip order of weight+bias application in LayerNormANE#2
carsonswope wants to merge 1 commit intoapple:mainfrom
carsonswope:main

carsonswope commented Aug 31, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

carsonswope commented Aug 31, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant