How does misalignment scale with model intelligence and task complexity?

(alignment.anthropic.com)

241 points | by salkahfi 4 days ago ago

83 comments