top | item 46425840 (no title) dmarwicke | 2 months ago does this do continuous batching or just static? couldn't tell from the code discuss order hn newest ubermenchh|2 months ago yes it does continous batching along with paged attention and prefix caching. i am also goint to be adding some more inference techniques
ubermenchh|2 months ago yes it does continous batching along with paged attention and prefix caching. i am also goint to be adding some more inference techniques
ubermenchh|2 months ago