I built an LLM router that doesn't use an LLM

(github.com)

4 points | by tcballard 7 hours ago ago

6 comments

tcballard 7 hours ago ago
Interestingly, a tuned length-threshold baseline performs very competitively and even beats Wayfinder on one of the benchmark sets. My intuition is that structural signals become more useful as prompt formats get more heterogeneous (code reviews, logs, tables, long instructions, etc.), but I’d love to see more real-world data.
tcballard 7 hours ago ago
Most routing systems I found either used another model as a classifier or relied on provider-specific routing. I was interested in the narrower question: how far can deterministic heuristics get before you actually need another model in the loop?
tcballard 7 hours ago ago
[dead]
tcballard 7 hours ago ago
[dead]
mhd64 7 hours ago ago
[dead]
RiyaSen 7 hours ago ago
[dead]