https://jalammar.github.io/illustrated-transformer/ http://nlp.seas.harvard.edu/2018/04/03/attention.html https://www.youtube.com/watch?v=kCc8FmEb1nY