Adversarial Reprogramming of Neural Cellular Automata
Reprogramming Neural CA to exhibit novel behaviour, using adversarial attacks.
Reprogramming Neural CA to exhibit novel behaviour, using adversarial attacks.
Introducing the Explainable AI Cheat Sheet, your high-level guide to the set of tools and methods that helps humans understand AI/ML models and their predictions. I…
Weights in the final layer of common visual models appear as horizontal bands. We investigate how and why.
When a neural network layer is divided into multiple branches, neurons self-organize into coherent groupings.
Large pretrained language models are trained over a sizable collection of online data. They unavoidably acquire certain toxic behavior and biases from the Internet. Pretrained language…
We report the existence of multimodal neurons in artificial neural networks, similar to those found in the human brain.
Neural Cellular Automata learn to generate textures, exhibiting surprising properties.
We present techniques for visualizing, contextualizing, and understanding neural network weights.
Reverse engineering the curve detection algorithm from InceptionV1 and reimplementing it from scratch.
A family of early-vision neurons reacting to directional transitions from high to low spatial frequency.
By visualizing the hidden state between a model's layers, we can get some clues as to the model's "thought process". Figure: Finding the words to say…
[Updated on 2021-02-01: Updated to version 2.0 with several work added and many typos fixed.] [Updated on 2021-05-26: Add P-tuning and Prompt Tuning in the “prompt…
Interfaces for exploring transformer language models by looking at input saliency and neuron activation. Explorable #1: Input saliency of a list of countries generated by a…
Neural networks naturally learn many transformed copies of the same feature, connected by symmetric weights.
With diverse environments, we can analyze, diagnose and edit deep reinforcement learning models using attribution.
[Updated on 2020-11-12: add an example on closed-book factual QA using OpenAI API (beta). A model that can answer any question with regard to factual knowledge…
Examining the design of interactive articles by synthesizing theory from disciplines such as education, journalism, and visualization.
A collection of articles and comments with the goal of understanding how to design robust and general purpose self-organizing systems.
Training an end-to-end differentiable, self-organising cellular automata for classifying MNIST digits.
Although most popular and successful model architectures are designed by human experts, it doesn’t mean we have explored the entire network architecture space and settled down…
Discussions: Hacker News (397 points, 97 comments), Reddit r/MachineLearning (247 points, 27 comments) Translations: German, Korean, Chinese (Simplified), Russian, Turkish The tech world is abuzz with…
Part one of a three part deep dive into the curve neuron family.
[Updated on 2020-06-17: Add “exploration via disagreement” in the “Forward Dynamics” section. Exploitation versus exploration is a critical topic in Reinforcement Learning. We’d like the RL…
How to tune hyperparameters for your machine learning model using Bayesian optimization.