16. Implementation

This chapter delves into the implementation of the Transformer model using TensorFlow.

Complete Python code is available at: Transformer-tf.py

While the previous chapter covered multi-head attention, position-wise feed-forward networks, and positional encoding, this chapter focuses on the Transformer’s encoder and decoder, along with the training and translation code.

Chapter Contents

16.1. Create Model
16.2. Training
16.3. Translation

Info

The Transformer model implemented with PyTorch is available on the following website:

The Annotated Transformer