LLM from scratch, part 28 – training a base model from scratch on an RTX 3090

(gilesthomas.com)

534 points | by gpjt 9 days ago ago

116 comments