HN
New
Show
Ask
Jobs
Built with Solid
How good a detective is an AI? A Sherlock Holmes board game as an LLM-agent eval
(alexweil.github.io)
4 points | by
ajonat
5 hours ago ago
1 comments
ajonat
4 hours ago ago
[flagged]
[flagged]