Show HN: A new benchmark for testing LLMs for deterministic outputs

(interfaze.ai)

60 points | by khurdula 6 days ago ago

29 comments