top | item 46841130

(no title)

_diyar | 29 days ago

Are any real world self-driving models (Waymo, Tesla, any others I should know?) really using VLM?

discuss

order

bijant|29 days ago

No! No one in their right mind would even consider using them for guidance and if they are used for OCR (not too my knowledge but could make sense in certain scenarios) then their output would be treated the way you'd treat any untrusted string.

godelski|29 days ago

You are confidently wrong

  > Powered by Gemini, a multimodal large language model developed by Google, EMMA employs a unified, end-to-end trained model to generate future trajectories for autonomous vehicles directly from sensor data. Trained and fine-tuned specifically for autonomous driving, EMMA leverages Gemini’s extensive world knowledge to better understand complex scenarios on the road. 
https://waymo.com/blog/2024/10/introducing-emma/