Summaries | Arshia Soltani Moakhar

ActiveLearning

title is self-explanatory

Decision tree learning By Finding Consistent Decision Trees

Active Learning in linear regression with multiplicative error rate bounds

learning a decision tree for unifrom random data distribution in O(s ^ log(log(s)))

greedily learn a decision tree based on the most inflouential variables in all leaves.

Active Learning for Agnostic classification

A different definition of active learning label complexity

In Transformers residual stream is the main object and layers read and write from/to it.

inceptionV1 feature maps of different layers

Find concepts that activates a neuron using a image dataset

Scale SAE to Claude 3 Sonnet

How SAE works

Investigate Vision Circuits by Studying the Connections between Neurons

summary of Can Large Language Models Explain Their Internal Mechanisms?

summary of Emergent World Representations Exploring a Sequence Model Trained on a Synthetic Task

summary of Interpretability Beyond Feature Attribution Quantitative Testing with Concept Activation Vectors (TCAV)

summary of Labeling Neural Representations with Inverse Recognition

summary of Progress measures for grokking via mechanistic interpretability

summary of What do we learn from inverting CLIP models?