Weeks 14 and 15: Training and Using Large Language Models Attention and Transformer Networks
Learning Objectives
Concepts
Without any programming, you should be able to:
- Relate each component of the English to Portuguese translator to attention and transformer networks, especially which parts are used for the attention encoding, the attention decoding, and the transformer.
Implementation
Using the Python programming language, you should be able to:
- Use the Keras and Tensorflow libraries and the given code to implement an English to Portuguese translator including explaining the purpose of the given code and altering the given code to improve its function or add additional functionality.
- Relate each component of the English to Portuguese translator to attention and transformer networks