Skip to content
X Β· @cwolferesearch Β· X / Twitter

The 🐐 is back!

The 🐐 is back!Lilian Weng: A super long overdue (3+ years?) post on scaling laws.Compute is expensive. Scaling laws are a way to help us reason about the optimal compute allocation between data and model size before committing to a large run. The post covers what scaling laws predict, how