(no title)
mgreg
|
2 years ago
I very much appreciate that the authors not only published their code (https://github.com/llm-random/llm-random) but included the dataset they used (available on Huggingface - https://huggingface.co/datasets/c4) as well as the training process and hyperparameters they used so others can replicate and build on their work. The only thing really missing is the weights which would be nice to have on huggingface as well.
swells34|2 years ago
chaxor|2 years ago
And the wonderful thing is, every person that used git clone on this repo and ran it would be serving the NN weights.
But alas, this unfortunately hasn't been done yet.
astrange|2 years ago
_ea1k|2 years ago
jakderrida|2 years ago
mgreg|2 years ago
It would be nice if they provided the weights so it could be USABLE without the effort or knowledge required.
We (I think) would all like to see more _truly_ open models (not just the source code) that enable collaboration in the community.