top | item 40143104 (no title) terafo | 1 year ago t5 is an architecture, t5x is a framework for training models that was created with that architecture in mind, but can be used to train other architectures, including decoder-only ones(there is one in examples). discuss order hn newest ma2rten|1 year ago t5x was used to train PaLM 1.
ma2rten|1 year ago