top | item 44833594 (no title) ludwik | 6 months ago There is "performance" as in "speed and cost" and performance as in "the model returning quality responses, without getting lost in the weeds". Caching only helps with the former. discuss order hn newest otabdeveloper4|6 months ago If the context window is small enough then only the tail of the prompt matters anyways. HardCodedBias|6 months ago "the model returning quality responses, without getting lost in the weeds"I should edit, but that would be disingenuous. This is exactly what I meant.thank you!
otabdeveloper4|6 months ago If the context window is small enough then only the tail of the prompt matters anyways.
HardCodedBias|6 months ago "the model returning quality responses, without getting lost in the weeds"I should edit, but that would be disingenuous. This is exactly what I meant.thank you!
otabdeveloper4|6 months ago
HardCodedBias|6 months ago
I should edit, but that would be disingenuous. This is exactly what I meant.
thank you!