Universal Transformers Need Memory: Depth-State Trade-Offs in Adaptive Recursive

(arxiv.org)

1 points | by che_shr_cat 7 hours ago ago

No comments yet.