Model Spec Midtraining: Improving How Alignment Training Generalizes

(alignment.anthropic.com)

2 points | by bearseascape 12 hours ago ago

No comments yet.