Mechanistic Anomaly Detection Research Update
Interim report on ongoing work on mechanistic anomaly detection
Interim report on ongoing work on mechanistic anomaly detection
🔥 Meet Yi-Large Turbo: the powerful, cost-effective upgrade to Yi-Large. Faster and more affordable at only $0.19 per 1M tokens for input and output. Ideal for…
IntroductionImagine yourself a decade ago, jumping directly into the present shock of conversing naturally with an encyclopedic AI that crafts images, writes code, and debates philosophy.…
Building and evaluating an open-source pipeline for auto-interpretability
An Open Course on LLMs, Led by Practitioners
How speculation gets laundered through pseudo-quantification
After studying how companies deploy generative AI applications, I noticed many similarities in their platforms. This post outlines the common components of a generative AI platform,…
Hallucination in large language models usually refers to the model generating unfaithful, fabricated, inconsistent, or nonsensical content. As a term, hallucination has been somewhat generalized to…
Rethinking AI agent benchmarking and evaluation
Writing up results from a recent project
Achieving even more surgical edits than LEACE without concept labels at inference time.
What We’ve Learned From A Year of Building with LLMs
Writing up results from a project from Spring 2023
The AI revolution drove frenzied investment in both private and public companies and captured the public’s imagination in 2023. Transformational consumer products like ChatGPT are powered…
My founder friends constantly think about growth. They think about how to measure their business growth and how to get to the next order of magnitude…
Diffusion models have demonstrated strong results on image synthesis in past years. Now the research community has started working on a harder task—using it for video…
A brief overview and discussion on gender bias in AI
A regularly updated record of vulnerabilities found by Anthropic and reported to maintainers with cryptographic commitments to each finding at the time of disclosure.
Is Attention all you need? Mamba, a novel AI model based on State Space Models (SSMs), emerges as a formidable alternative to the widely used Transformer…
Setting the record straight regarding Yi-34B and Llama 2.
[Hacker News discussion, LinkedIn discussion, Twitter thread] Update (Feb 2026): The full list of open source AI repos is hosted at Good AI List, updated daily.…
Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges?
'Vec2text' can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data.
It would be an understatement to say that generative AI has been taking the world by storm over the past couple of years. While text (1D)…