top | item 45370476

(no title)

jastuk | 5 months ago

You've mentioned in the docs that:

> Gemini leverages native video understanding for direct analysis, while Local models reconstruct understanding from individual frame descriptions - resulting in dramatically different processing complexity.

For people like me who haven't dabbled much with AI video processing and have no intuition for it, could you clarify the drawbacks of such a local-only approach vs what Gemini offers? I don't mean the performance or power/battery impact (that part is clear), just in terms of end-result and quality what the practical differences are.

I'm in the only-100%-offline camp here but would like to know what I'm missing out on since I won't even try Gemini here.

discuss

No comments yet.