top | item 45854075 (no title) lordofgibbons | 3 months ago At what quantization? And if it is in fact quantized below fp8, how is the performance impacted on all the various benchmarks? discuss order hn newest antonvs|3 months ago They claim they don't use quantization.The reason for their speed is this chip: https://www.cerebras.ai/chip
antonvs|3 months ago They claim they don't use quantization.The reason for their speed is this chip: https://www.cerebras.ai/chip
antonvs|3 months ago
The reason for their speed is this chip: https://www.cerebras.ai/chip