YOLOA model has knowledge for dozens of objects. At every frame it is computing lots of parameters not related to your case. This is very wasteful and the main reason you can't achieve your performance goals.
Already checked that and have some data for it, just hoped getting people positions is such a typical usecase someone would have done a better model than I could here
atoav|1 year ago
nasir|1 year ago
ekabod|1 year ago
atoav|1 year ago