Show HN: Smolmodels – open-source tool to build ML models using natural language
37 points| imaginaryspaces | 1 year ago |github.com
The core idea is that LLMs are overkill for a lot of predictive tasks. Smolmodels automates the trial-and-error process of finding the right model architecture and training approach, letting you build small, specialised models. You can either provide your own training data or have the library generate synthetic data based on your input/output schema requirements. This lets you quickly experiment with different model designs before investing in data collection.
The library handles the full pipeline - from data prep/generation through training to inference code. Everything can be self-hosted and works with major LLM providers.
We would love any thoughts/feedback on the project!
Repo link: https://github.com/plexe-ai/smolmodels
documentparser|1 year ago
imaginaryspaces|1 year ago
However, the concept generalises to other data types very naturally, and we plan to add support for things like images, audio etc very soon :)
unknown|1 year ago
[deleted]
binarymuffin|1 year ago
imaginaryspaces|1 year ago
pfrpt|1 year ago
imaginaryspaces|1 year ago
For example: you have an ecommerce site and want to rank relevant products for your users. You want to launch a prototype quickly. You could use ChatGPT as your ranker ("rank products for this user ..."), or you could use smolmodels to generate a more lightweight ranking model like a smaller neural net, etc.
unknown|1 year ago
[deleted]
Santosho|1 year ago
alienlolkinn|1 year ago
fkmms|1 year ago
[deleted]