(no title)
throwawayish | 9 years ago
- 1S/2S is obviously where the pie is. Few servers are 4S.
- 8 DDR4 channels per socket is twice the memory bandwidth of 2011, and still more than LGA-36712312whateverthenumberwas
- First x86 server platform with SHA1/2 acceleration
- 128 PCIe lanes in a 1S system is unprecedented
All in all Naples seems like a very interesting platform for throughput-intensive applications. Overall it seems that Sun with it's Niagara-approach (massive number of threads, lots of I/O on-chip) was just a few years too early (and likely a few thousands / system to expensive ;)
semi-extrinsic|9 years ago
Yes, definitely drooling at this. Assuming a workload that doesn't eat too much CPU, this would make for a relatively cheap and hassle-free non-blocking 8 GPU @ 16x PCIe workstation. I wants one.
sorenjan|9 years ago
AnthonyMouse|9 years ago
This one will be interesting. The current Ryzen (like most of the Intel desktop range) has two channels, but everyone has been benchmarking it against the i7-6900K because they both have eight cores. The i7-6900K is the workstation LGA 2011 with four channels. If the workstation Ryzen will have eight channels...
gigatexal|9 years ago
binarycrusader|9 years ago
By comparison, the performance of sparc substantially improved moving from the T1, T2 to T3+. The T1 used a round-robin policy to issue instructions from the next active thread each cycle, supporting up to 8 fine-grained threads in total. That made it more like a barrel processor.
Starting with the T3, two of the threads could be executed simultaneously. Then, starting with the T4, sparc added dynamic threading and out-of-order execution. Later versions are even faster and clock speeds have also risen considerably.
alimbada|9 years ago
tyingq|9 years ago
agumonkey|9 years ago
kev009|9 years ago
Every mainframe interface is basically an offload interface.. "computers" DMAing and processing to the CPs and each other. Every I/O device has a command processor, so it can handle channel errors and integrated pcie errors in a way PCs cannot.
A PC with Chelsio NICs doing TCP offload with direct data placement or RDMA as well as Fiber Channel storage would be mini/mainframe-ish.
PeCaN|9 years ago
mtgx|9 years ago
And AMD should dump SHA1 acceleration in the next generation.
drzaiusapelord|9 years ago
The cost to have that on silicon is probably close to zero. If you think SHA1 is just going to magically disappear because you want it to, well, you'll be in for a SHA1 sized surprise. Our grandkids will still have SHA1 acceleration.
>ARMv8 has had it for like 2-3 years now...
Because ARM cores don't remotely have the CPU heft an Intel x86/64 chip has, so ARM needs all this acceleration because its typically used in very low power mobile scenarios. On top of that, Intel claims AES-NI can be used to accelerate SHA1.
https://software.intel.com/en-us/articles/improving-the-perf...
throwawayish|9 years ago
yuhong|9 years ago
gankedfrank|9 years ago
[deleted]
tw04|9 years ago
https://en.wikipedia.org/wiki/Intel_SHA_extensions
>There are seven new SSE-based instructions, four supporting SHA-1 and three for SHA-256:
>SHA1RNDS4, SHA1NEXTE, SHA1MSG1, SHA1MSG2, SHA256RNDS2, SHA256MSG1, SHA256MSG2
throwawayish|9 years ago
floatboth|9 years ago
The only processors so far with these extensions are low power Goldmont chips.
https://github.com/weidai11/cryptopp/issues/139