o1 tops aider’s new polyglot leaderboard
o1 scores the top result on aider's new multi-language, more challenging coding benchmark.
Every story across every category, newest first. Each card links to the original publisher; daily-brief posts open as editorial pages.
o1 scores the top result on aider's new multi-language, more challenging coding benchmark.
We've worked with dozens of teams building LLM agents across industries. Consistently, the most successful implementations use simple, composable patterns rather than complex frameworks.
Making sense of recent technology trends and claims
Technology Isn’t the Problem—or the Solution.
nbsanity - Share Notebooks as Polished Web Pages in Seconds
In this post, we show that when two TopK SAEs are trained on the same data, with the same batch order but with different random initializations,…
QwQ is reasoning model like o1, and needs to be used as an architect with another model as editor.
With regard to writing, there are many rules and also no rules at all.
Building an Audience Through Technical Writing: Strategies and Mistakes
Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing…
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Note: This is the pronunciation of QwQ: /kwju:/ , similar to the word “quill”. What does it mean to think,…
Yi and Yi 1.5 are evolving🌳Omar Sanseviero: The (non-exhaustive) evolution of base modelsIf you want to learn more about it and how to use these models,…
Merge pull request #620 from 01-ai/Mia-xia-patch-3 Update README.md
Adds controlnet images, updates README (#21) * Adds blur and depth images * Cosmetic changes to REEADME --------- Co-authored-by: Vikram Voleti
Merge pull request #20 from Stability-AI/bf/controlnet ControlNet support
fixed latent encoder behavior based on control type
Benefits of running a weekly paper club, how to start one, and how to read and facilitate papers.
Open source LLMs are becoming very powerful, but pay attention to how you (or your provider) are serving the model. It can affect code editing skill.
Fixes for VAE logic and 2B ControlNets, and speed up model loading by loading ControlNets to CUDA if available
minor changes to image saving and controlnet loading
Setting up my new MacBook Pro from scratch
What is the Role of Mathematics in Modern Machine Learning?The past decade has witnessed a shift in how progress is made in machine learning. Research involving…
RT Kai-Fu Leehttp://01.ai trained the #6 model in the world for $3M pre-train cost. And the inference price is $0.14/million tokens! https://www.tomshardware.com/tech-industry/artificial-intelligence/chinese-company-trained-gpt-4-rival-with-just-2-000-gpus-01-ai-spent-usd3m-compared-to-openais-usd80m-to-usd100m
🎉Love seeing Yi models in @CamelAIOrg! Powerful Yi models join forces with this awesome multi-agent framework. Can't wait to see what AI agents you'll build!#YiLightning #LLM…