X · @lilianweng
· X / Twitter
Probably the first product Thinky will build is a full panel of dials that researchers can use to physically adjust all the hparams during training. W…
Probably the first product Thinky will build is a full panel of dials that researchers can use to physically adjust all the hparams during training. We gonna do hardware one day and it is the time 😂Stephen Roller: Some teams use sweeps, heuristics, or scaling laws to determine their training LR. At Character, we just h