(no title)
samwho
|
2 months ago
I was wondering about this when I was reading around the topic. I can’t personally think of a reason you would need to segregate, though it wouldn’t surprise me if they do for some sort of compliance reasons. I’m not sure though, would love to hear something first-party.
weird-eye-issue|2 months ago
With OpenAI at least you can specify the cache key and they even have this in the docs:
Use the prompt_cache_key parameter consistently across requests that share common prefixes. Select a granularity that keeps each unique prefix-prompt_cache_key combination below 15 requests per minute to avoid cache overflow.
ambicapter|2 months ago
Why below a certain number? Usually in caches a high number of requests keeps the cached bit from expiring or being replaced, no?
psadri|2 months ago
samwho|2 months ago
gwern|2 months ago
gunalx|2 months ago
dustfinger|2 months ago
dustfinger|2 months ago
- Product logic / decision rules, such as: when to refund, how to triage tickets
- Internal taxonomies, schemas, or tool interfaces
- Safety and policy guardrails (which adversaries could try to route around)
- Brand voice, strategy, or proprietary workflows
That is just off the top of my head.
unknown|2 months ago
[deleted]