17. Dig into the components

In this chapter, we will dig into the core components of the Transformer model. Especially, we will focus on analyzing the limitations of the original Transformer and explore key improvements introduced by subsequent research.

Reference

A survey of transformers (20.Oct.2022)

Part Contents

17.1. Multi-Head Attention
17.2. Positional Encoding
17.3. Position-wise Feed-Forward Networks (FFN)