Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"
natural-language-processing deep-learning neural-network transformers jacobi-iteration decoding-algorithm parallel-decoding jacobi-decoding
-
Updated
Mar 15, 2024 - Python