Hey. Its a good practical application, especially to reduce cost. But at about 0.6 similarity, I get some cache hit. Maybe with more examples and for a high use app, the cache hit would increase based on a higher similarity scores.
Still early days on this I guess, but any observations that can help improve the hit rates?
No comments yet.