Transformer特辑
https://github.com/LongxingTan/Machine-learning-interview 模型结构 基本单元:token_embedding positional encoding, encoder, token_embedding positional encoding, decoderencoder: (self-attention, skip-connect, ln), (ffn, skip-connect, ln)decoder:…
2026-05-26