top | item 42785630

(no title)

xgstation | 1 year ago

It is really fun and sarcastic to watch all this happening. U.S. Gov. tried to block China from accessing GPU resources very hardly to stop their AI development, but actually helped China to take a leap on developing efficient and more cost-effective LLM model with constraint GPU access.

discuss

order

cadamsdotcom|1 year ago

And then "China" (which is actually a bunch of super generous folks at DeepSeek) decides to release it all back to the US under a permissive MIT license.

They could've just exposed an API and kept the model to themselves but they didn't!

They could've not published their research paper, but they did, again and again - and each time they publish they discuss not just the techniques that DO work, but those that don't - saving researchers everywhere from loads of dead ends.

That is pure awesome. Thank you DeepSeek engineers for your gift to humanity.

JTyQZSnP3cQGa8B|1 year ago

Do they have models that try to downplay what happened on Tiananmen Square? That would be a sneaky way to shape our future in some way (and no whataboutism, we do it too).

austin-starks|1 year ago

I couldn't agree more. China has: * weaker GPUs * a smaller model * started with nothing

And now they're building better, faster, and cheaper models at a fraction of the cost. It's hilarious and exciting.

glimshe|1 year ago

Speaking of sarcasm, and thinking about your argument, should we start selling them cutting edge GPUs to slow down their research?

seanmcdirmid|1 year ago

I doubt you ur GPU sanctions have had much of an influence one way or the other. They can get their resources from third countries even if they can’t get them directly from the USA. I wonder if the USA will eventually try to lock down higher end NVIDIA GPUs and prevent export all together.