4 points | by zxy-action 6 hours ago ago
1 comments
I’m the founder of Auriko. We ran this study to measure how much cache-aware llm routing can reduce inference costs.
Critique welcome.
I’m the founder of Auriko. We ran this study to measure how much cache-aware llm routing can reduce inference costs.
Critique welcome.