top | item 46425840

(no title)

dmarwicke | 2 months ago

does this do continuous batching or just static? couldn't tell from the code

discuss

order

ubermenchh|2 months ago

yes it does continous batching along with paged attention and prefix caching. i am also goint to be adding some more inference techniques