(study this code to speedrun your understanding of LLMs - by alpha nerd karpathy himself):
https://github.com/karpathy/nanoGPT
https://github.com/karpathy/minGPT
https://www.youtube.com/watch?v=kCc8FmEb1nY
(intro to NNs and gradient descent, again karpathy is the best)
https://www.youtube.com/watch?v=VMj-3S1tku0&list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ
(amazing writeup on how transformers work)
(models and torrents)
/lmg/ Model Links and Torrents
(llava - training visual llms like gpt4)
https://github.com/haotian-liu/LLaVA
(prompting hacks : theory of mind)
Boosting Theory-of-Mind Performance in Large Language Models via Prompting