X Β· @cwolferesearch
Β· X / Twitter
The π is back!
The π is back!Lilian Weng: A super long overdue (3+ years?) post on scaling laws.Compute is expensive. Scaling laws are a way to help us reason about the optimal compute allocation between data and model size before committing to a large run. The post covers what scaling laws predict, how