Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Beyond neural scaling laws – Paper Explained
Scaling Laws refer to the observed trend of some machine learning architectures (notably transformers) to scale their performance on predictable power law when given more …