X · @rasbt
· X / Twitter
It's been *almost* a bit quiet around LLM architecture releases in the past two weeks 😅 Interesting tidbit is the parallel block design. Via the Cm…
It's been *almost* a bit quiet around LLM architecture releases in the past two weeks 😅Interesting tidbit is the parallel block design. Via the Cmd-A the tech report "equivalent performance but significant improvement in throughput compared to the vanilla transformer block."Cohere: Introducing: Cohere Command A+We’ve c