top | item 38911241

(no title)

madlag | 2 years ago

I love the idea, that's the future. However you should be aware that the explanation of second law of thermodynamics generated by the LLM you used in your app store screenshot is wrong: the LLM has it backwards. Energy transfers to less stable states from more stable states, and not the reverse. (I use LLMs for science education apps like https://apps.apple.com/fr/app/explayn-learn-chemistry/id6448..., so I am quite used to spot that kind of errors in LLM outputs...)

discuss

kkielhofner|2 years ago

Strongly agree.

Local, app embedded, and purpose-built targeted experts is clearly the future in my mind for a variety of reasons. Looking at TPUs in Android devices and neural engine in Apple hardware it's pretty clear.

Xcode already has an ML studio, for example, that can not only embed and integrate models in apps but also finetune, etc. It's obvious to me that at some point most apps will have embedded models in the app (or device) for specific purposes.

No AI can compare to humans and even we specialize. You wouldn't hire a plumber to perform brain surgery and you wouldn't hire a neurosurgeon to fix your toilet. Mixture of experts with AI models is a thing of course but when we look at how we primarily interact with technology and the functionality it provides it's generally pretty well siloed to specific purposes.

A purposed domain and context trained/tuned small model doing stuff on your on-device data would likely do nearly as well if not better for some applications than even ChatGPT. Think of the next version of device keyboards doing RAG+LLM through your text messages to generate replies. Stack it up with speech to text, vision, multimodal models, and who knows what and yeah, interesting.

Throw in the automatic scaling, latency, and privacy and the wins really stack up.

Some random app developer can integrate a model in their application and scale higher with better performance than ChatGPT without setting money on fire.

jorvi|2 years ago

> Local, app embedded, and purpose-built targeted experts is clearly the future in my mind for a variety of reasons. Looking at TPUs in Android devices and neural engine in Apple hardware it's pretty clear.

I think that’s only true for delay-intolerant or privacy-focused features. For most situations, a remote model running on an external server will outperform a local model. There is no thermal, battery or memory headroom for the local model to ever do better. The cost being a mere hundred milliseconds delay at most.

I expect most models triggered on consumer devices to run remotely, with a degraded local service option in case of connection problems.

spyUlovedM3|2 years ago

> when we look at how we primarily interact with technology and the functionality it provides it's generally pretty well siloed to specific purposes.

Yes, but siloes in this case will get much bigger e.g. ChatGPT vs DALL-E

Const-me|2 years ago

Is that explanation better? https://github.com/Const-me/Cgml/blob/master/Mistral/Mistral...

Same Mistral Instruct 0.2 model, different implementation.

wannabag|2 years ago

Oh, that's an interesting app and in French too... is that something you plan to have on Android as well?

madlag|2 years ago

Yes, it's Unity based, so quite easy. There is another version on Quest too, so running on Android : https://www.meta.com/fr-fr/experiences/6113695908674751/ .

Horffupolde|2 years ago

How do you define stability in that context?

madlag|2 years ago

Stability is actually defined by having a lower energy level. That explains why energy can only flow from a less stable system to a more stable system : the more stable system does not have available energy to give.

reexpressionist|2 years ago

[deleted]