Show HN: Autonomous recovery for distributed training jobs

(docs.tensorpool.dev)

12 points | by tsvoboda 4 days ago ago

3 comments