Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Illustrated Transformer is amazing as a way of understanding the original transformer architecture step-by-step, but if you want to truly visualize how information flows through a decoder-only architecture - from nanoGPT all the way up to a fully represented GPT-3 - nothing beats this:

https://bbycroft.net/llm



whoa, that's awesome.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: