Open
Description
This is a highly cited study that apparently pioneers recurrent transformers: https://arxiv.org/abs/1807.03819
I am not completely convinced of the study quality, though. There are a few insufficiently substantiated claims, weird (buggy?) code excerpts, not obviously fair comparisons to alternatives, etc..
Metadata
Metadata
Assignees
Labels
No labels