HN
New
Show
Ask
Jobs
Built with Solid
Confidence estimation is a better metric than agreement for LLM judges
(arxiv.org)
3 points | by
rapiddev
7 hours ago ago
No comments yet.
No comments yet.