Steering interpretable language models with concept algebra

(guidelabs.ai)

37 points | by luulinh90s a day ago ago

3 comments