AI Feed

Lilian Weng Tech Media August 6, 2020

Neural Architecture Search

Although most popular and successful model architectures are designed by human experts, it doesn’t mean we have explored the entire network architecture space and settled down…

Jay Alammar Tech Media July 27, 2020

How GPT3 Works – Visualizations and Animations

Discussions: Hacker News (397 points, 97 comments), Reddit r/MachineLearning (247 points, 27 comments) Translations: German, Korean, Chinese (Simplified), Russian, Turkish The tech world is abuzz with…

Distill.pub Papers June 17, 2020

Curve Detectors

Part one of a three part deep dive into the curve neuron family.

Lilian Weng Tech Media June 7, 2020

Exploration Strategies in Deep Reinforcement Learning

[Updated on 2020-06-17: Add “exploration via disagreement” in the “Forward Dynamics” section. Exploitation versus exploration is a critical topic in Reinforcement Learning. We’d like the RL…

Distill.pub Papers May 5, 2020

Exploring Bayesian Optimization

How to tune hyperparameters for your machine learning model using Bayesian optimization.

Lilian Weng Tech Media April 7, 2020

The Transformer Family

[Updated on 2023-01-27: After almost three years, I did a big refactoring update of this post to incorporate a bunch of new Transformer models since 2020.…

Distill.pub Papers April 1, 2020

An Overview of Early Vision in InceptionV1

An overview of all the neurons in the first five layers of InceptionV1, organized into a taxonomy of 'neuron groups.'

Distill.pub Papers March 16, 2020

Visualizing Neural Networks with the Grand Tour

By focusing on linear dimensionality reduction, we show how to visualize many dynamic phenomena in neural networks.

Distill.pub Papers March 10, 2020

Thread: Circuits

What can we learn if we invest heavily in reverse engineering a single neural network?

Distill.pub Papers March 10, 2020

Zoom In: An Introduction to Circuits

By studying the connections between neurons, we can find meaningful algorithms in the weights of neural networks.

Distill.pub Papers February 11, 2020

Growing Neural Cellular Automata

Training an end-to-end differentiable, self-organising cellular automata model of morphogenesis, able to both grow and regenerate specific patterns.

Lilian Weng Tech Media January 29, 2020

Curriculum for Reinforcement Learning

[Updated on 2020-02-03: mentioning PCG in the “Task-Specific Curriculum” section. [Updated on 2020-02-04: Add a new “curriculum through distillation” section.

Distill.pub Papers January 10, 2020

Visualizing the Impact of Feature Attribution Baselines

Exploring the baseline input hyperparameter, and how it impacts interpretations of neural network behavior.

Distill.pub Papers November 4, 2019

Computing Receptive Fields of Convolutional Neural Networks

Detailed derivations and open-source code to analyze the receptive fields of convnets.

Latest