Ask HN: What data model are you using for RAG prototyping?
1 points| throwawaystress | 1 year ago
This is the kind of stuff that I think needs to be stored: - Prompt templates (i.e., versioning my prompts) - Final inputs to and outputs from the LLM provider (and associated metadata) - Chunks of all my documents to be used in RAG - The chunks that were retrieved for a given prompt, so that I can evaluate the performance of the retrieval step - Conversations (or chains?) for when there might be multiple requests sent to an LLM for a given "question" - Experiments. This is for the purposes of evaluation. It would associate an experiment ID with a series of inputs/outputs for an evaluation set of questions.
I can't be the first person to hit this issue. I started off with a simple SQLite database with a handful of tables, and now that I'm going to be incorporating RAG into the application (and probably agentic stuff soon), I really want to leverage someone else's learning so I don't rediscover all the same mistakes.
No comments yet.