top | item 39729015

(no title)

Areading314 | 1 year ago

It seems unlikely that this C++ library written by a solo dev is somehow more secure than the Python standard library would be for such a security-sensitive task.

discuss

order

TkTech|1 year ago

Hi, can_ada (but not ada!) dev here. Ada is over 20k lines of well-tested and fuzzed source by 25+ developers, along with an accompanying research paper. It is the parser used in node.js and parses billions of URLs a day.

can_ada is simply a 60-line glue and packaging making it available with low overhead to Python.

Areading314|1 year ago

Ah, that makes more sense -- it might be a good idea to integrate with the upstream library as a submodule rather than lifting the actual .cpp/.h files into the bindings repo. That way people know the upstream C++ code is from a much more active project.

Despite my snarky comments, thank you for contributing to the python ecosystem, this does seem like a cool project for high performance URL parsing!

masklinn|1 year ago

Not in the sense of differential vulnerabilities, since the standard library refuses to match any sort of modern standard.

It's also

1. not a solo dev

2. Daniel Lemire

3. a serious engineering and research effort: https://arxiv.org/pdf/2311.10533.pdf