Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
http://nlp.seas.harvard.edu/2018/04/03/attention.html
https://blog.floydhub.com/the-transformer-in-pytorch/
http://jalammar.github.io/illustrated-transformer/
https://towardsdatascience.com/transformers-141e32e69591