(no title)
pbasp
|
2 months ago
Personally I don't believe much in AI recommendations. The problem is the data. AI isn't magic, if the AI doesn't have the data, then it will hallucinate the data. I've discussed with ChatGPT about my movie tastes and asked it to give me recommendations... At first it was a quite interesting conversation, but it couldn't go very far because it knows a lot of details about the blockbuster movies, but strictly nothing about the remaining 98% movies. In comparison, collaborative filtering has access to way more data.
nicola_alessi|2 months ago
Our approach with lumigo.tv is different by necessity, and it's a direct response to the problem you've nailed. We don't use an LLM for knowledge.
Here's the technical split:
The LLM is strictly a query translator. Its only job is to take your messy, natural language prompt ("a gloomy noir set in a rainy city") and convert it into a structured set of searchable tags, genres, and metadata filters. It is forbidden from generating or hallucinating movie titles, actors, or plots. The recommendations come from a structured database. Those translated filters are executed against a traditional database of movies/shows (we've integrated with TMDB and similar sources). The results are ranked by existing metrics like popularity, rating, and release date. The LLM never invents a result; it can only return what exists in the connected data. You're right that pure collaborative filtering (like Netflix's) has a massive data advantage for mainstream tastes. Where it falls short is for edge cases and specific intent. If you want "movies like the third act of Parasite," a collaborative filter has no vector for that. Our hypothesis is that a human can describe that intent, an LLM can map it to tags (e.g., "class tension," "thriller," "dark comedy"), and a database can find matches.
So, it's not AI vs. collaborative filtering. It's AI as a natural-language front-end to a traditional database. The AI handles the "what I want" translation; the database handles the "what exists" retrieval. This avoids the hallucination problem but still allows for queries that a "Because you watched..." algorithm could never process.
Does that distinction make sense? It's an attempt to use each tool for what it's best at.
pbasp|2 months ago
pbasp|2 months ago