Skip to content

multihead attention#2375

Merged
lozhnikov merged 21 commits into
mlpack:masterfrom
mrityunjay-tripathi:transformer
Aug 24, 2020
Merged

multihead attention#2375
lozhnikov merged 21 commits into
mlpack:masterfrom
mrityunjay-tripathi:transformer

Conversation

@mrityunjay-tripathi
Copy link
Copy Markdown
Member

@mrityunjay-tripathi mrityunjay-tripathi commented Apr 17, 2020

Hi everyone,
I've worked on the implementation of multihead attention. The multihead attention layer would be required for the Transformer model. Debugging and refactoring of the code will come subsequently but this is the initial structure on which I will be working on. The implementation is mostly motivated from PyTorch and Tensorflow.

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants