(no title)
yosai
|
1 year ago
We have a team of domain expert who do the vetting of the instruction dataset.We do typical RLHF(Reinforcement learning from human feedback) and connect back to our SFT(supervised finetuning) loop.That's why we name ourself as hardware and human in loop.Humans play an important role in ensuring quality and accuracy of our dataset.
novacode007|1 year ago
yosai|1 year ago
yosai|1 year ago