(no title)
jepler
|
1 year ago
My mind just balks at the idea of having so much source that a 2020s computer could take hours to index it. ctags is nothing special (both in terms of optimization but also the level of detail it gets to: just global function identifiers) and looks like it runs at about 400MB/s on a single core of an i5-1235U. But still it looks ctags could process about 100TB in 4 hours across 16 threads on a workstation class CPU...
DylanSp|1 year ago
menaerus|1 year ago
That said, Glean seems to be reusing the indexer from LLVM/clang for C and C++.
> The C++ indexer ("the clang indexer") is a wrapper over clang. The clang indexer is a drop in replacement for the C++ compiler that emits Glean facts instead of code. The wrapper is linked against libclang and libllvm.
[1] https://glean.software/docs/indexer/cxx
UltraSane|1 year ago
phyrex|1 year ago
kllrnohj|1 year ago
Try doing the same with C++ and more indexing options enabled, such as with something like universal-ctags, and a larger code base, say Android's repository aught to do it. Are you still getting 400MB/s? Nope.