Well, the core issue is still drawing. I took another look at some profiles again and seems like its not the renderer limiting this to 27k! I still had some stupid scene-graph traversal... But clustering and culling is 53us and 33us respectively, but the draw is 7ms. So a frame (on the GPU-side) is like 7ms, and some 100-200 us on the CPU side.Should really dive deeper and update the measurements for final results...
godelski|9 months ago
But also, really good work! You should be proud of this! Squeezing that much out of that hardware is no easy feat.