(no title)
thoughtlede | 1 year ago
Also we may have to look for better loss functions than ones that help us predict the next token to train the models if the objective is reasoning.
thoughtlede | 1 year ago
Also we may have to look for better loss functions than ones that help us predict the next token to train the models if the objective is reasoning.
No comments yet.