Feed · AI Feed

Aider Infrastructure December 21, 2024

o1 tops aider’s new polyglot leaderboard

o1 scores the top result on aider's new multi-language, more challenging coding benchmark.

Anthropic Engineering Frontier Labs December 19, 2024

Building effective agents

We've worked with dozens of teams building LLM agents across industries. Consistently, the most successful implementations use simple, composable patterns rather than complex frameworks.

AI Snake Oil (Narayanan) Newsletters December 18, 2024

Is AI progress slowing down?

Making sense of recent technology trends and claims

AI Snake Oil (Narayanan) Newsletters December 13, 2024

We Looked at 78 Election Deepfakes. Political Misinformation is not an AI Problem.

Technology Isn’t the Problem—or the Solution.

Hamel Husain Tech Media December 13, 2024

nbsanity – Share Notebooks as Polished Web Pages in Seconds

nbsanity - Share Notebooks as Polished Web Pages in Seconds

EleutherAI Open Source December 12, 2024

SAEs trained on the same data don’t learn the same features

In this post, we show that when two TopK SAEs are trained on the same data, with the same batch order but with different random initializations,…

Aider Infrastructure December 3, 2024

QwQ is a code architect, not an editor

QwQ is reasoning model like o1, and needs to be used as an architect with another model as editor.

Eugene Yan Tech Media December 1, 2024

Seemingly Paradoxical Rules of Writing

With regard to writing, there are many rules and also no rules at all.

Hamel Husain Tech Media November 30, 2024

Building an Audience Through Technical Writing: Strategies and Mistakes

Lilian Weng Tech Media November 28, 2024

Reward Hacking in Reinforcement Learning

Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing…

Alibaba Qwen News November 27, 2024

QwQ: Reflect Deeply on the Boundaries of the Unknown

GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Note: This is the pronunciation of QwQ: /kwju:/ , similar to the word “quill”. What does it mean to think,…

X · @01AI_Yi China Labs November 27, 2024

Yi and Yi 1.5 are evolving🌳

Yi and Yi 1.5 are evolving🌳Omar Sanseviero: The (non-exhaustive) evolution of base modelsIf you want to learn more about it and how to use these models,…

01.AI Yi (GitHub) China Labs November 27, 2024

Merge pull request #620 from 01-ai/Mia-xia-patch-3

Merge pull request #620 from 01-ai/Mia-xia-patch-3 Update README.md

Stable Diffusion 3.5 Generative Media November 26, 2024

Adds controlnet images, updates README (#21)

Adds controlnet images, updates README (#21) * Adds blur and depth images * Cosmetic changes to REEADME --------- Co-authored-by: Vikram Voleti

Stable Diffusion 3.5 Generative Media November 26, 2024

Merge pull request #20 from Stability-AI/bf/controlnet

Merge pull request #20 from Stability-AI/bf/controlnet ControlNet support

Stable Diffusion 3.5 Generative Media November 26, 2024

fixed latent encoder behavior based on control type

Eugene Yan Tech Media November 24, 2024

How to Run a Weekly Paper Club (and Build a Learning Community)

Benefits of running a weekly paper club, how to start one, and how to read and facilitate papers.

Aider Infrastructure November 21, 2024

Details matter with open source models

Open source LLMs are becoming very powerful, but pay attention to how you (or your provider) are serving the model. It can affect code editing skill.

Stable Diffusion 3.5 Generative Media November 20, 2024

Fixes for VAE logic and 2B ControlNets, and speed up model loading by…

Fixes for VAE logic and 2B ControlNets, and speed up model loading by loading ControlNets to CUDA if available

Stable Diffusion 3.5 Generative Media November 20, 2024

minor changes to image saving and controlnet loading

Eugene Yan Tech Media November 17, 2024

My Minimal MacBook Pro Setup Guide

Setting up my new MacBook Pro from scratch

The Gradient Newsletters November 16, 2024

Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research

What is the Role of Mathematics in Modern Machine Learning?The past decade has witnessed a shift in how progress is made in machine learning. Research involving…

X · @01AI_Yi China Labs November 16, 2024

RT Kai-Fu Lee: http://01.ai trained the #6 model in the world for $3M pre-train cost. And the inference price is $0.14/million tokens! https://www.tom…

RT Kai-Fu Leehttp://01.ai trained the #6 model in the world for $3M pre-train cost. And the inference price is $0.14/million tokens! https://www.tomshardware.com/tech-industry/artificial-intelligence/chinese-company-trained-gpt-4-rival-with-just-2-000-gpus-01-ai-spent-usd3m-compared-to-openais-usd80m-to-usd100m

X · @01AI_Yi China Labs November 15, 2024

🎉Love seeing Yi models in @CamelAIOrg! Powerful Yi models join forces with this awesome multi-agent framework. Can't wait to see what AI agents you…

🎉Love seeing Yi models in @CamelAIOrg! Powerful Yi models join forces with this awesome multi-agent framework. Can't wait to see what AI agents you'll build!#YiLightning #LLM…

Feed 4,355 posts

o1 tops aider’s new polyglot leaderboard

Building effective agents

Is AI progress slowing down?

We Looked at 78 Election Deepfakes. Political Misinformation is not an AI Problem.

nbsanity – Share Notebooks as Polished Web Pages in Seconds

SAEs trained on the same data don’t learn the same features

QwQ is a code architect, not an editor

Seemingly Paradoxical Rules of Writing

Building an Audience Through Technical Writing: Strategies and Mistakes

Reward Hacking in Reinforcement Learning

QwQ: Reflect Deeply on the Boundaries of the Unknown

Yi and Yi 1.5 are evolving🌳

Merge pull request #620 from 01-ai/Mia-xia-patch-3

Adds controlnet images, updates README (#21)

Merge pull request #20 from Stability-AI/bf/controlnet

fixed latent encoder behavior based on control type

How to Run a Weekly Paper Club (and Build a Learning Community)

Details matter with open source models

Fixes for VAE logic and 2B ControlNets, and speed up model loading by…

minor changes to image saving and controlnet loading

My Minimal MacBook Pro Setup Guide

Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research

RT Kai-Fu Lee: http://01.ai trained the #6 model in the world for $3M pre-train cost. And the inference price is $0.14/million tokens! https://www.tom…

🎉Love seeing Yi models in @CamelAIOrg! Powerful Yi models join forces with this awesome multi-agent framework. Can't wait to see what AI agents you…