top | item 45733999 (no title) melvinmelih | 4 months ago > because they are trained on multilingual dataBut they were not trained on government-sanctioned homegrown EU data. discuss order hn newest sunaookami|4 months ago Who in their right mind would use this? tensor|4 months ago I'd use a model trained on a targeted and curated data set over one trained on all the crap on the internet any day. load replies (2) saretup|4 months ago The entirety of the internet vs government-sanctioned homegrown EU data. tonyhart7|4 months ago "But they were not trained on government-sanctioned homegrown EU data."ok what are you implying on this mock-possum|4 months ago Sidesteps potential legal issues probably raverbashing|4 months ago > But they were not trained on government-sanctioned homegrown EU data.If none of the LLM makers used the very big corpus of EU multilingual data I have an EU regulation bridge to sell it to you
sunaookami|4 months ago Who in their right mind would use this? tensor|4 months ago I'd use a model trained on a targeted and curated data set over one trained on all the crap on the internet any day. load replies (2)
tensor|4 months ago I'd use a model trained on a targeted and curated data set over one trained on all the crap on the internet any day. load replies (2)
tonyhart7|4 months ago "But they were not trained on government-sanctioned homegrown EU data."ok what are you implying on this mock-possum|4 months ago Sidesteps potential legal issues probably
raverbashing|4 months ago > But they were not trained on government-sanctioned homegrown EU data.If none of the LLM makers used the very big corpus of EU multilingual data I have an EU regulation bridge to sell it to you
sunaookami|4 months ago
tensor|4 months ago
saretup|4 months ago
tonyhart7|4 months ago
ok what are you implying on this
mock-possum|4 months ago
raverbashing|4 months ago
If none of the LLM makers used the very big corpus of EU multilingual data I have an EU regulation bridge to sell it to you