(no title)
elcomet | 1 year ago
You can use a specific LLM, or a general larger LLM to do this routing.
Also, some work suggest using smaller llms to generate multiple responses and use a stronger and larger model to rank the responses (which is much more efficient than generating them)
No comments yet.