Week 14: Training and Using Large Language Models Attention and Transformer Networks

Learning Objectives

Concepts

Without any programming, you should be able to: * Relate each compnent of the English to Portugese translator to attention and transformer networks, especially which parts are used for the attention encoding, the attention decoding, and the transformer.

Implementation

Using the Python programming language, you should be able to: * Use the Keras and Tensorflow libraries and the given code to implement an English to Portugese translator including explaining the purpose of the given code. * Relate each compnent of the English to Portugese translator to attention and transformer networks