Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training

(github.com)

88 points | by xlayn 6 hours ago ago

28 comments