top | item 45733999

(no title)

melvinmelih | 4 months ago

> because they are trained on multilingual data

But they were not trained on government-sanctioned homegrown EU data.

discuss

order

sunaookami|4 months ago

Who in their right mind would use this?

tensor|4 months ago

I'd use a model trained on a targeted and curated data set over one trained on all the crap on the internet any day.

saretup|4 months ago

The entirety of the internet vs government-sanctioned homegrown EU data.

tonyhart7|4 months ago

"But they were not trained on government-sanctioned homegrown EU data."

ok what are you implying on this

mock-possum|4 months ago

Sidesteps potential legal issues probably

raverbashing|4 months ago

> But they were not trained on government-sanctioned homegrown EU data.

If none of the LLM makers used the very big corpus of EU multilingual data I have an EU regulation bridge to sell it to you