(no title)
miketheman | 2 months ago
Search is an unbounded context and does not lend itself to caching very well, as every search can contain anything
miketheman | 2 months ago
Search is an unbounded context and does not lend itself to caching very well, as every search can contain anything
bastawhiz|2 months ago
And anyway, hit rates are going to be pretty good. You're not taking arbitrary queries, the domain is pretty narrow. Half the queries are going to be for requests, pytorch, numpy, httpx, and the other usual suspects.
froh|2 months ago
woodruffw|2 months ago
(Which isn’t to say I disagree with you about scale not being the main issue, just to offer some nuance. Another piece of nuance is the fact that distributions are the source of metadata but users think in terms of projects/releases.)
Kwpolska|2 months ago
firesteelrain|2 months ago
The moment you expose that same service to a ubiquitous CLI like pip, the workload changes qualitatively.
PyPI has the /simple endpoint that the CDN can handle.
It’s PyPI philosophy that search happens on the website and pip has aligned to that. Pip doesn’t want to make a web scraper understandably so the function of searching remains disabled