Low-Rank KV Attention: 50% Less Memory, Better Models

(fin.ai)

2 points | by destraynor 10 hours ago ago

1 comments