Language Model Contains Personality Subnetworks

(arxiv.org)

42 points | by PaulHoule 8 hours ago ago

26 comments