top | item 21534153

(no title)

gunta | 6 years ago

it's sad to see these "moved from X language to Y language and it became faster!" every-time.

If the CPU is the hugest bottleneck, the best answer is not to optimize the algorithm by going to a lower level language, but rather to invest in a different architecture like GPGPU or FPGA.

For example, this paper shows a significant speed-up for PIP( Polygon in Point) algorithm, going from 15hs (CPU) to a mere 11sec (GPU) in task load-time. https://pdfs.semanticscholar.org/1e51/e3c681e1afc908a41ac253...

discuss

justincormack|6 years ago

That is high throughput design, not a low latency design, which is what they are optimising for here. It is a very different design space.