Inspectus allows you to create interactive visualizations of attention matrices with just a few lines of Python code. It’s designed to run smoothly in Jupyter notebooks through an easy-to-use Python API. Inspectus provides multiple views to help you understand language model behaviors. If you have any questions, feel free to ask!
On a related note: recently, I released a visualization of all MLP neurons inside the llama3 8B model. Here is an example "derivative" neuron which is triggered when talking about the derivative concept.
Interesting. I think OpenAI here uses sparse autoencoders to map out sparse activation patterns in networks. Comparing them to how a real person reasons about a situations.
Inspectus, on the other hand is a general tool to visualize how transformer models pay attention to different parts of the data they process.
That OpenAI work is more elaborate. It trains an additional network in such a way that it encodes what GPT is doing in terms of activations, but in a more interpretable way (hopefully). Here, as far as I can tell, it's visualizing the activation of the attention layers directly.
I'm not a primary user. Just cleaned up the existing codebase to make it open source. But you could use this to visualise attentions and debug the model.
For an example if you're working on a Q&A model, you can check which tokens in the prompt contributed to the output. It's possible to detect issues like output not paying attention to any important part of the prompt.
xcodevn|1 year ago
https://neuralblog.github.io/llama3-neurons/neuron_viewer.ht...
skulk|1 year ago
vpj|1 year ago
SushiHippie|1 year ago
Golden Gate Claude - https://news.ycombinator.com/item?id=40459543 - (60 comments, 16 days ago)
Extracting Concepts from GPT-4 - https://news.ycombinator.com/item?id=40599749 (144 comments, 2 days ago)
lakshith-403|1 year ago
Inspectus, on the other hand is a general tool to visualize how transformer models pay attention to different parts of the data they process.
dimatura|1 year ago
ravjo|1 year ago
swifthesitation|1 year ago
blackbear_|1 year ago
benf76|1 year ago
lakshith-403|1 year ago
For an example if you're working on a Q&A model, you can check which tokens in the prompt contributed to the output. It's possible to detect issues like output not paying attention to any important part of the prompt.
unknown|1 year ago
[deleted]
JackYoustra|1 year ago