Quantifying LLM Cost Savings from Cache-Aware Inference Routing

(auriko.ai)

4 points | by zxy-action 6 hours ago ago

1 comments