Alignment whack-a-mole: Finetuning activates recall of copyrighted books in LLMs

(github.com)

124 points | by reconnecting 6 hours ago ago

80 comments