(no title)
Santisco | 2 years ago
Another thing is that I'm a little bit confused about all kinds of "LLMs". I looked into hugging face but there are so many different models and they may have variant size or structure. Which model can be a good start? I guess gpt4 is too large to handle in a lab environment (sorry I don't really know much about LLM :).
speedgoose|2 years ago
Mistral is a recent strong contender, but they only released a "small" large language model for now.
Depending on your tasks, some BERT models can give better results too.
ChatGPT4 is supposedly very large, but we don't know much about it. It's not open or available outside SaaS on OpenAI or Microsoft Azure.
If you don't plan to do research on the LLMs themselves, but more do applied research with LLMs, ChatGPT4 using the OpenAI API may be a good start. And perhaps don't buy costly lab equipment yet, you can experiment and gain experience on the small quantised models first.