attention_sentence https://jalammar.github.io/visualizing-neural-machine-translation-mechanics-of-seq2seq-models-with-attention/ transformer https://lilianweng.github.io/lil-log/2018/06/24/attention-attention.html