Ben Chuanlong Du's Blog

It is never too late to learn.

Scaling Law for LLM

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Beyond neural scaling laws – Paper Explained

Scaling Laws refer to the observed trend of some machine learning architectures (notably transformers) to scale their performance on predictable power law when given more …