AI Feed

EleutherAI Open Source August 5, 2024

Mechanistic Anomaly Detection Research Update

Interim report on ongoing work on mechanistic anomaly detection

X · @01AI_Yi China Labs August 5, 2024

🔥 Meet Yi-Large Turbo: the powerful, cost-effective upgrade to Yi-Large. Faster and more affordable at only $0.19 per 1M tokens for input and outpu…

🔥 Meet Yi-Large Turbo: the powerful, cost-effective upgrade to Yi-Large. Faster and more affordable at only $0.19 per 1M tokens for input and output. Ideal for…

The Gradient Newsletters August 3, 2024

We Need Positive Visions for AI Grounded in Wellbeing

IntroductionImagine yourself a decade ago, jumping directly into the present shock of conversing naturally with an encyclopedic AI that crafts images, writes code, and debates philosophy.…

EleutherAI Open Source July 30, 2024

Open Source Automated Interpretability for Sparse Autoencoder Features

Building and evaluating an open-source pipeline for auto-interpretability

Hamel Husain Tech Media July 29, 2024

An Open Course on LLMs, Led by Practitioners

AI Snake Oil (Narayanan) Newsletters July 26, 2024

AI existential risk probabilities are too unreliable to inform policy

How speculation gets laundered through pseudo-quantification

Chip Huyen Tech Media July 25, 2024

Building A Generative AI Platform

After studying how companies deploy generative AI applications, I noticed many similarities in their platforms. This post outlines the common components of a generative AI platform,…

Lilian Weng Tech Media July 7, 2024

Extrinsic Hallucinations in LLMs

Hallucination in large language models usually refers to the model generating unfaithful, fabricated, inconsistent, or nonsensical content. As a term, hallucination has been somewhat generalized to…

AI Snake Oil (Narayanan) Newsletters July 3, 2024

New paper: AI agents that matter

Rethinking AI agent benchmarking and evaluation

EleutherAI Open Source June 14, 2024

Experiments in Weak-to-Strong Generalization

Writing up results from a recent project

EleutherAI Open Source June 13, 2024

Free Form Least-Squares Concept Erasure Without Oracle Concept Labels

Achieving even more surgical edits than LEACE without concept labels at inference time.

Hamel Husain Tech Media June 1, 2024

What We’ve Learned From A Year of Building with LLMs

EleutherAI Open Source May 22, 2024

VINC-S: Closed-form Optionally-supervised Knowledge Elicitation with Paraphrase Invariance

Writing up results from a project from Spring 2023

The Gradient Newsletters April 20, 2024

Financial Market Applications of LLMs

The AI revolution drove frenzied investment in both private and public companies and captured the public’s imagination in 2023. Transformational consumer products like ChatGPT are powered…

Chip Huyen Tech Media April 17, 2024

Measuring personal growth

My founder friends constantly think about growth. They think about how to measure their business growth and how to get to the next order of magnitude…

Lilian Weng Tech Media April 12, 2024

Diffusion Models for Video Generation

Diffusion models have demonstrated strong results on image synthesis in past years. Now the research community has started working on a harder task—using it for video…

The Gradient Newsletters April 8, 2024

A Brief Overview of Gender Bias in AI

A brief overview and discussion on gender bias in AI

Anthropic Red (safety) Frontier Labs March 31, 2024

Coordinated Vulnerability Disclosure Dashboard

A regularly updated record of vulnerabilities found by Anthropic and reported to maintainers with cryptographic commitments to each finding at the time of disclosure.

The Gradient Newsletters March 28, 2024

Mamba Explained

Is Attention all you need? Mamba, a novel AI model based on State Space Models (SSMs), emerges as a formidable alternative to the widely used Transformer…

EleutherAI Open Source March 25, 2024

Yi-34B, Llama 2, and common practices in LLM training: a fact check of the New York Times

Setting the record straight regarding Yi-34B and Llama 2.

Chip Huyen Tech Media March 14, 2024

What I learned from looking at 900 most popular open source AI tools

[Hacker News discussion, LinkedIn discussion, Twitter thread] Update (Feb 2026): The full list of open source AI repos is hosted at Good AI List, updated daily.…

The Gradient Newsletters March 8, 2024

Car-GPT: Could LLMs finally make self-driving cars happen?

Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges?

The Gradient Newsletters March 5, 2024

Do text embeddings perfectly encode text?

'Vec2text' can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data.

LlamaIndex blog Infrastructure March 4, 2024

Unlocking the 3rd Dimension for Generative AI (Part 1)

It would be an understatement to say that generative AI has been taking the world by storm over the past couple of years. While text (1D)…

Latest