(no title)
gabriel666smith | 2 months ago
Language architecture is really interesting, I think, for programmers who have bought into the LLM hype in any meaningful way. It's an important field to have a sense of.
Tokenizers, for example, generally have multi-syllabic tokens as their base-level, indivisible unit.
You rarely see this mentioned when LLM capability against non-coding tasks is discussed, despite it being deeply important for prose construction.
Not to mention, putting language models aside, that the vast majority of code is written in language with a logical grammar. The disciplines are highly linked.
regularfry|2 months ago
altairprime|2 months ago