Building an Audience Through Technical Writing: Strategies and Mistakes
Building an Audience Through Technical Writing: Strategies and Mistakes
Building an Audience Through Technical Writing: Strategies and Mistakes
Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards, without genuinely learning or completing…
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Note: This is the pronunciation of QwQ: /kwju:/ , similar to the word “quill”. What does it mean to think,…
Yi and Yi 1.5 are evolving🌳Omar Sanseviero: The (non-exhaustive) evolution of base modelsIf you want to learn more about it and how to use these models,…
Merge pull request #620 from 01-ai/Mia-xia-patch-3 Update README.md
Adds controlnet images, updates README (#21) * Adds blur and depth images * Cosmetic changes to REEADME --------- Co-authored-by: Vikram Voleti
Merge pull request #20 from Stability-AI/bf/controlnet ControlNet support
fixed latent encoder behavior based on control type
Benefits of running a weekly paper club, how to start one, and how to read and facilitate papers.
Open source LLMs are becoming very powerful, but pay attention to how you (or your provider) are serving the model. It can affect code editing skill.
Fixes for VAE logic and 2B ControlNets, and speed up model loading by loading ControlNets to CUDA if available
minor changes to image saving and controlnet loading
Setting up my new MacBook Pro from scratch
What is the Role of Mathematics in Modern Machine Learning?The past decade has witnessed a shift in how progress is made in machine learning. Research involving…
RT Kai-Fu Leehttp://01.ai trained the #6 model in the world for $3M pre-train cost. And the inference price is $0.14/million tokens! https://www.tomshardware.com/tech-industry/artificial-intelligence/chinese-company-trained-gpt-4-rival-with-just-2-000-gpus-01-ai-spent-usd3m-compared-to-openais-usd80m-to-usd100m
🎉Love seeing Yi models in @CamelAIOrg! Powerful Yi models join forces with this awesome multi-agent framework. Can't wait to see what AI agents you'll build!#YiLightning #LLM…
RT Rohan PaulChinese startup 01 .ai trains competitive LLM using 95% fewer resources through innovative engineering optimization.01 .ai trained a GPT-4 competitor using just 2,000 GPUs…
Ruff ci (#194) * apply ruff * rename * specify ruff version for CI * also check imports * check formatting
API Documentation (Chinese) HuggingFace Demo ModelScope Demo Introduction After the release of Qwen2.5, we heard the community’s demand for processing longer contexts. In recent months, we…
Seemingly minor technical decisions can have life-or-death effects
GITHUB HUGGING FACE MODELSCOPE KAGGLE DEMO DISCORD Introduction Today, we are excited to open source the “Powerful”, “Diverse”, and “Practical” Qwen2.5-Coder series, dedicated to continuously promoting…
Merge pull request #619 from 01-ai/Anonymitaet-patch-2 Update README.md
Using interpretations of SAE latents to simulate activations.
We’re excited to announce that our blog is moving to its new home! From now on, all our new blog posts will be published directly on…