The Transformer – Attention is all you need. - Michał Chromiak's blog | Data science learning, Learn computer coding, Learning methods
Skip to content
When autocomplete results are available use up and down arrows to review and enter to select. Touch device users, explore by touch or with swipe gestures.
a block diagram with multiple blocks labeled in the bottom row and below it is an image of
mchromiak.github.io

The Transformer – Attention is all you need.

Transformer - more than meets the eye! Are we there yet? Well... not really, but... How about eliminating recurrence and convolution from transduction? Sequence modeling and transduction (e.g. language modeling, machine translation) problems solutions has been dominated by RNN (especially gated RNN) or LSTM, additionally employing the attention mechanism. Main sequence transduction models are based on RNN or CNN including encoder and decoder. The new transformer architecture is claimed...

Comments

More about this Pin

Board containing this Pin

Selected board cover image
Learning methods
640 Pins
3mo

Related interests

Understanding Transformer Functions
Understanding Transformer Models
Traffic Light State Machine Diagram
Current Transformer Connection Diagram
Transformer Model Explanation
Understanding Transformer Ratings
Understanding Transformer Components
Current Transformer Diagram
Transformer Graph Neural Network