EsoLang-Bench: Evaluating Genuine Reasoning in LLMs via Esoteric Languages

(esolang-bench.vercel.app)

53 points | by matt_d 3 hours ago ago

18 comments