If it's limited to 8k context length then it's not competing with sonnet at all IMO. Sonnet has a 200k context length and it's decent at pulling stuff from it, with just an 8k context length this model won't be great for RAG applications, instead it'll be used for chat and transforming data from one type to another.
CuriouslyC|1 year ago
sroussey|1 year ago
It’s better to make your RAG system work well on small context first anyway.