top | item 40384732 Practical Llama 3 inference in Java 4 points| mukel | 1 year ago |github.com 1 comment order hn newest unknown|1 year ago [deleted] mukel|1 year ago Llama3.java: featuring .GGUF file format support, Q8_0 and Q4_0 quantizations, fast matrix/vector multiplication routines using Java's Vector API; served by a simple CLI with a --chat mode to interact with the Llama 3 models.
mukel|1 year ago Llama3.java: featuring .GGUF file format support, Q8_0 and Q4_0 quantizations, fast matrix/vector multiplication routines using Java's Vector API; served by a simple CLI with a --chat mode to interact with the Llama 3 models.
unknown|1 year ago
[deleted]
mukel|1 year ago