AI Feed

Hamel Husain Tech Media November 30, 2024

Building an Audience Through Technical Writing: Strategies and Mistakes

Lilian Weng Tech Media November 28, 2024

Reward Hacking in Reinforcement Learning

Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing…

Alibaba Qwen News November 27, 2024

QwQ: Reflect Deeply on the Boundaries of the Unknown

GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Note: This is the pronunciation of QwQ: /kwju:/ , similar to the word “quill”. What does it mean to think,…

X · @01AI_Yi China Labs November 27, 2024

Yi and Yi 1.5 are evolving🌳

Yi and Yi 1.5 are evolving🌳Omar Sanseviero: The (non-exhaustive) evolution of base modelsIf you want to learn more about it and how to use these models,…

01.AI Yi (GitHub) China Labs November 27, 2024

Merge pull request #620 from 01-ai/Mia-xia-patch-3

Merge pull request #620 from 01-ai/Mia-xia-patch-3 Update README.md

Stable Diffusion 3.5 Generative Media November 26, 2024

Adds controlnet images, updates README (#21)

Adds controlnet images, updates README (#21) * Adds blur and depth images * Cosmetic changes to REEADME --------- Co-authored-by: Vikram Voleti

Stable Diffusion 3.5 Generative Media November 26, 2024

Merge pull request #20 from Stability-AI/bf/controlnet

Merge pull request #20 from Stability-AI/bf/controlnet ControlNet support

Stable Diffusion 3.5 Generative Media November 26, 2024

fixed latent encoder behavior based on control type

Eugene Yan Tech Media November 24, 2024

How to Run a Weekly Paper Club (and Build a Learning Community)

Benefits of running a weekly paper club, how to start one, and how to read and facilitate papers.

Aider Infrastructure November 21, 2024

Details matter with open source models

Open source LLMs are becoming very powerful, but pay attention to how you (or your provider) are serving the model. It can affect code editing skill.

Stable Diffusion 3.5 Generative Media November 20, 2024

Fixes for VAE logic and 2B ControlNets, and speed up model loading by…

Fixes for VAE logic and 2B ControlNets, and speed up model loading by loading ControlNets to CUDA if available

Stable Diffusion 3.5 Generative Media November 20, 2024

minor changes to image saving and controlnet loading

Eugene Yan Tech Media November 17, 2024

My Minimal MacBook Pro Setup Guide

Setting up my new MacBook Pro from scratch

The Gradient Newsletters November 16, 2024

Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research

What is the Role of Mathematics in Modern Machine Learning?The past decade has witnessed a shift in how progress is made in machine learning. Research involving…

X · @01AI_Yi China Labs November 16, 2024

RT Kai-Fu Lee: http://01.ai trained the #6 model in the world for $3M pre-train cost. And the inference price is $0.14/million tokens! https://www.tom…

RT Kai-Fu Leehttp://01.ai trained the #6 model in the world for $3M pre-train cost. And the inference price is $0.14/million tokens! https://www.tomshardware.com/tech-industry/artificial-intelligence/chinese-company-trained-gpt-4-rival-with-just-2-000-gpus-01-ai-spent-usd3m-compared-to-openais-usd80m-to-usd100m

X · @01AI_Yi China Labs November 15, 2024

🎉Love seeing Yi models in @CamelAIOrg! Powerful Yi models join forces with this awesome multi-agent framework. Can't wait to see what AI agents you…

🎉Love seeing Yi models in @CamelAIOrg! Powerful Yi models join forces with this awesome multi-agent framework. Can't wait to see what AI agents you'll build!#YiLightning #LLM…

X · @01AI_Yi China Labs November 15, 2024

RT Rohan Paul: Chinese startup 01 .ai trains competitive LLM using 95% fewer resources through innovative engineering optimization. 01 .ai trained a G…

RT Rohan PaulChinese startup 01 .ai trains competitive LLM using 95% fewer resources through innovative engineering optimization.01 .ai trained a GPT-4 competitor using just 2,000 GPUs…

FLUX (Black Forest Labs) Generative Media November 14, 2024

Ruff ci (#194)

Ruff ci (#194) * apply ruff * rename * specify ruff version for CI * also check imports * check formatting

Alibaba Qwen News November 14, 2024

Extending the Context Length to 1M Tokens!

API Documentation (Chinese) HuggingFace Demo ModelScope Demo Introduction After the release of Qwen2.5, we heard the community’s demand for processing longer contexts. In recent months, we…

AI Snake Oil (Narayanan) Newsletters November 11, 2024

Does the UK’s liver transplant matching algorithm systematically exclude younger patients?

Seemingly minor technical decisions can have life-or-death effects

Alibaba Qwen News November 11, 2024

Qwen2.5-Coder Series: Powerful, Diverse, Practical.

GITHUB HUGGING FACE MODELSCOPE KAGGLE DEMO DISCORD Introduction Today, we are excited to open source the “Powerful”, “Diverse”, and “Practical” Qwen2.5-Coder series, dedicated to continuously promoting…

01.AI Yi (GitHub) China Labs November 11, 2024

Merge pull request #619 from 01-ai/Anonymitaet-patch-2

Merge pull request #619 from 01-ai/Anonymitaet-patch-2 Update README.md

EleutherAI Open Source November 10, 2024

Partially rewriting an LLM in natural language

Using interpretations of SAE latents to simulate activations.

Allen Ai2 (Medium) Open Source November 7, 2024

We’re moving our blog!

We’re excited to announce that our blog is moving to its new home! From now on, all our new blog posts will be published directly on…

Latest

Building an Audience Through Technical Writing: Strategies and Mistakes

Reward Hacking in Reinforcement Learning

QwQ: Reflect Deeply on the Boundaries of the Unknown

Yi and Yi 1.5 are evolving🌳

Merge pull request #620 from 01-ai/Mia-xia-patch-3

Adds controlnet images, updates README (#21)

Merge pull request #20 from Stability-AI/bf/controlnet

fixed latent encoder behavior based on control type

How to Run a Weekly Paper Club (and Build a Learning Community)

Details matter with open source models

Fixes for VAE logic and 2B ControlNets, and speed up model loading by…

minor changes to image saving and controlnet loading

My Minimal MacBook Pro Setup Guide

Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research

RT Kai-Fu Lee: http://01.ai trained the #6 model in the world for $3M pre-train cost. And the inference price is $0.14/million tokens! https://www.tom…

🎉Love seeing Yi models in @CamelAIOrg! Powerful Yi models join forces with this awesome multi-agent framework. Can't wait to see what AI agents you…

RT Rohan Paul: Chinese startup 01 .ai trains competitive LLM using 95% fewer resources through innovative engineering optimization. 01 .ai trained a G…

Ruff ci (#194)

Extending the Context Length to 1M Tokens!

Does the UK’s liver transplant matching algorithm systematically exclude younger patients?

Qwen2.5-Coder Series: Powerful, Diverse, Practical.

Merge pull request #619 from 01-ai/Anonymitaet-patch-2

Partially rewriting an LLM in natural language

We’re moving our blog!

Browse by category