1 points | by fblgit 5 hours ago ago
1 comments
one of a kind single-transformer block layer, high throughput. The new generation of transformer-based lightweight models for common NLP tasks?
one of a kind single-transformer block layer, high throughput. The new generation of transformer-based lightweight models for common NLP tasks?