AI Feed

Distill.pub Papers July 2, 2021

Distill Hiatus

After five years, Distill will be taking a break.

Lilian Weng Tech Media May 31, 2021

Contrastive Representation Learning

The goal of contrastive representation learning is to learn such an embedding space in which similar sample pairs stay close to each other while dissimilar ones…

Distill.pub Papers May 6, 2021

Adversarial Reprogramming of Neural Cellular Automata

Reprogramming Neural CA to exhibit novel behaviour, using adversarial attacks.

Jay Alammar Tech Media May 4, 2021

Explainable AI Cheat Sheet

Introducing the Explainable AI Cheat Sheet, your high-level guide to the set of tools and methods that helps humans understand AI/ML models and their predictions. I…

Distill.pub Papers April 8, 2021

Weight Banding

Weights in the final layer of common visual models appear as horizontal bands. We investigate how and why.

Distill.pub Papers April 5, 2021

Branch Specialization

When a neural network layer is divided into multiple branches, neurons self-organize into coherent groupings.

Lilian Weng Tech Media March 21, 2021

Reducing Toxicity in Language Models

Large pretrained language models are trained over a sizable collection of online data. They unavoidably acquire certain toxic behavior and biases from the Internet. Pretrained language…

Distill.pub Papers March 4, 2021

Multimodal Neurons in Artificial Neural Networks

We report the existence of multimodal neurons in artificial neural networks, similar to those found in the human brain.

Distill.pub Papers February 11, 2021

Self-Organising Textures

Neural Cellular Automata learn to generate textures, exhibiting surprising properties.

Distill.pub Papers February 4, 2021

Visualizing Weights

We present techniques for visualizing, contextualizing, and understanding neural network weights.

Distill.pub Papers January 30, 2021

Curve Circuits

Reverse engineering the curve detection algorithm from InceptionV1 and reimplementing it from scratch.

Distill.pub Papers January 27, 2021

High-Low Frequency Detectors

A family of early-vision neurons reacting to directional transitions from high to low spatial frequency.

Jay Alammar Tech Media January 19, 2021

Finding the Words to Say: Hidden State Visualizations for Language Models

By visualizing the hidden state between a model's layers, we can get some clues as to the model's "thought process". Figure: Finding the words to say…

Lilian Weng Tech Media January 2, 2021

Controllable Neural Text Generation

[Updated on 2021-02-01: Updated to version 2.0 with several work added and many typos fixed.] [Updated on 2021-05-26: Add P-tuning and Prompt Tuning in the “prompt…

Jay Alammar Tech Media December 17, 2020

Interfaces for Explaining Transformer Language Models

Interfaces for exploring transformer language models by looking at input saliency and neuron activation. Explorable #1: Input saliency of a list of countries generated by a…

Distill.pub Papers December 8, 2020

Naturally Occurring Equivariance in Neural Networks

Neural networks naturally learn many transformed copies of the same feature, connected by symmetric weights.

Distill.pub Papers November 17, 2020

Understanding RL Vision

With diverse environments, we can analyze, diagnose and edit deep reinforcement learning models using attribution.

Lilian Weng Tech Media October 29, 2020

How to Build an Open-Domain Question Answering System?

[Updated on 2020-11-12: add an example on closed-book factual QA using OpenAI API (beta). A model that can answer any question with regard to factual knowledge…

Distill.pub Papers September 11, 2020

Communicating with Interactive Articles

Examining the design of interactive articles by synthesizing theory from disciplines such as education, journalism, and visualization.

Distill.pub Papers August 27, 2020

Thread: Differentiable Self-organizing Systems

A collection of articles and comments with the goal of understanding how to design robust and general purpose self-organizing systems.

Distill.pub Papers August 27, 2020

Self-classifying MNIST Digits

Training an end-to-end differentiable, self-organising cellular automata for classifying MNIST digits.

Lilian Weng Tech Media August 6, 2020

Neural Architecture Search

Although most popular and successful model architectures are designed by human experts, it doesn’t mean we have explored the entire network architecture space and settled down…

Jay Alammar Tech Media July 27, 2020

How GPT3 Works – Visualizations and Animations

Discussions: Hacker News (397 points, 97 comments), Reddit r/MachineLearning (247 points, 27 comments) Translations: German, Korean, Chinese (Simplified), Russian, Turkish The tech world is abuzz with…

Distill.pub Papers June 17, 2020

Curve Detectors

Part one of a three part deep dive into the curve neuron family.

Latest