Transformer is kind of designed for the GPU…we want an architecture that is fundamentally extremely parallelizable.
Andrej Karpathy on AI infra of the future…
Transformer is kind of designed for the GPU…we want an architecture that is fundamentally extremely parallelizable.