HN
New
Show
Ask
Jobs
Built with Solid
Why Current AI Guardrails Train Models to Fake Alignment
(kellyasay.substack.com)
3 points | by
kellya
8 hours ago ago
1 comments
8 hours ago ago
[deleted]
1 comments