Skip to content
X · @lilianweng · X / Twitter

Probably the first product Thinky will build is a full panel of dials that researchers can use to physically adjust all the hparams during training. W…

Probably the first product Thinky will build is a full panel of dials that researchers can use to physically adjust all the hparams during training. We gonna do hardware one day and it is the time 😂Stephen Roller: Some teams use sweeps, heuristics, or scaling laws to determine their training LR. At Character, we just h