(no title)
BrandiATMuhkuh | 4 months ago
Since the first multimodal llms came out, I'm using this approach when I deal with documents. It makes the code much simpler because everything is an image and it's surprisingly robust.
Works also for embeddings (cohere embed v4)
No comments yet.