Show HN: Steiner – An open-source reasoning model inspired by OpenAI o1
83 points| peakji | 1 year ago |medium.com
Blog: https://medium.com/@peakji/a-small-step-towards-reproducing-...
Hugging Face: https://huggingface.co/collections/peakji/steiner-preview-67...
schmeichel|1 year ago
peakji|1 year ago
I haven't personally used Ollama Modelfile, but I think it should be relatively easy to convert from GGUF?
swyx|1 year ago
peakji|1 year ago
mdaniel|1 year ago
Metameh|1 year ago
[deleted]
nxobject|1 year ago
zby|1 year ago
peakji|1 year ago
nwnwhwje|1 year ago
Is this a fined tuned LLM, for example drop in replacement for Llama etc.
Or is it some algorithm on top of an LLM, doing some chain of reasoning?
peakji|1 year ago
Mr_Bees69|1 year ago
peakji|1 year ago
ActorNightly|1 year ago
Im wondering if we can abstract chain of thought further down into the computation levels to replace a lot of matrix multiply. Like smaller transformers with less parameters and more selection of which transformer to use through search.