Towards Compute-Aware In-Switch Computing for LLMs on Multi-GPU Systems

(arxiv.org)

1 points | by rbanffy 15 hours ago ago

No comments yet.