top | item 37827587

(no title)

Santisco | 2 years ago

Thanks so much for the info!

Another thing is that I'm a little bit confused about all kinds of "LLMs". I looked into hugging face but there are so many different models and they may have variant size or structure. Which model can be a good start? I guess gpt4 is too large to handle in a lab environment (sorry I don't really know much about LLM :).

discuss

speedgoose|2 years ago

Llama2 models are the standard models to use those days. But that can change fast.

Mistral is a recent strong contender, but they only released a "small" large language model for now.

Depending on your tasks, some BERT models can give better results too.

ChatGPT4 is supposedly very large, but we don't know much about it. It's not open or available outside SaaS on OpenAI or Microsoft Azure.

If you don't plan to do research on the LLMs themselves, but more do applied research with LLMs, ChatGPT4 using the OpenAI API may be a good start. And perhaps don't buy costly lab equipment yet, you can experiment and gain experience on the small quantised models first.