top | item 34816184

Transformer models: an introduction and catalog

188 points| mariuz | 3 years ago |arxiv.org | reply

25 comments

order
[+] adamnemecek|3 years ago|reply
I have recently written a paper on understanding transformer learning via the lens of coinduction & Hopf algebra.

https://arxiv.org/abs/2302.01834v1

The learning mechanism of transformer models was poorly understood however it turns out that a transformer is like a circuit with a feedback.

I argue that autodiff can be replaced with what I call in the paper Hopf coherence.

Furthermore, if we view transformers as Hopf algebras, one can bring convolutional models, diffusion models and transformers under a single umbrella.

I'm working on a next gen Hopf algebra based machine learning framework.

Join my discord if you want to discuss this further https://discord.gg/mr9TAhpyBW

[+] amkkma|3 years ago|reply
Is there any hope of understanding this with just calc and linalg knowledge?
[+] erichocean|3 years ago|reply
> Furthermore, if we view transformers as Hopf algebras, one can bring convolutional models, diffusion models and transformers under a single umbrella.

Have you written any more about this?

[+] erichocean|3 years ago|reply
Hi Adam, can you update your Discord invite? It's now invalid.
[+] visarga|3 years ago|reply
> 2.5.9 ChatGPT is also more than a model since it includes extensions for Memory Store and retrieval similar to BlenderBot3

I don't think this affirmation is factual. There are people who played with this idea, but it is not part of chatGPT.

[+] sva_|3 years ago|reply
> 2.5.5 BERT

> Extension:It can be seen as a generalization of BERT and GPT in that it combines ideas from both in the encoder and decoder

I believe this is an error? Text from BART. And a space missing.

[+] visarga|3 years ago|reply
I have a hunch they used LLM to compile the list.
[+] DerSaidin|3 years ago|reply
It is a shame the figures 5,6,7,8 break up the content of 2.5 Catalog, just to fit the figures onto pages.

Are pages even needed anymore?

[+] h_lezzaik|3 years ago|reply
Good timing, I've been trying to compile a list like this myself to keep track of everything released.
[+] theredlancer|3 years ago|reply
Where's Cliffjumper and Ironside?
[+] zndr|3 years ago|reply
I'm glad I'm not the only one looking for a taxonomy of refugees from the great Cybertron wars
[+] sircastor|3 years ago|reply
When I was younger I would often encounter mentions of electrical transformers, and be quite disappointed when it wasn't related to the toys or the series. Even in my 40s I still have a bit of disappointment about it...
[+] peresthe|3 years ago|reply
"The goal of this paper is to offer a somewhat comprehensive but simple catalog and classification of the most popular Transformer models."

Yet of the 6 comments here, 2 of them are complaining about missing models and three more are arguing about the typesetting on figures.

[+] swyx|3 years ago|reply
figure 5 on page 10 is a ridiculously small font and unreadable. i wish there was a better way to display this kind of info on PDFs
[+] mdp2021|3 years ago|reply
> i wish there was a better way to display this kind of info on PDFs

...Portable Document Format was /born/ to display vector (i.e. you just zoom in)... The error in the page was to embed a raster image of text!

[+] dylan604|3 years ago|reply
From the bottom of the page in question: "Figure 5: You can access the original table at https://docs.google.com/spreadsheets/d/ 1ltyrAB6BL29cOv2fSpNQnnq2vbX8UrHl47d7FkIf6t4 for easier browsing across the different model features."