47 points | by yu3zhou4 3 hours ago ago
6 comments
README is in my opinion (author here) the most interesting - I wrote it to help others build useful mental model to be able to recreate the project yourself, without need to even read my code
Very nice job on read me.
>>Physically, LLM is a file which contains a lot of float numbers.
aka atoms of the LLM.
the universe is just atomic if statments
Looks interesting, it reminds me of the first llama.cpp, but better documented.
It seems the author believes checking the return values of CUDA API calls is not "tiny" enough :-(
I love the documentation formatted in lessons. I can't wait to read through it.
README is in my opinion (author here) the most interesting - I wrote it to help others build useful mental model to be able to recreate the project yourself, without need to even read my code
Very nice job on read me.
>>Physically, LLM is a file which contains a lot of float numbers.
aka atoms of the LLM.
the universe is just atomic if statments
Looks interesting, it reminds me of the first llama.cpp, but better documented.
It seems the author believes checking the return values of CUDA API calls is not "tiny" enough :-(
I love the documentation formatted in lessons. I can't wait to read through it.