In this deck, Numascale describes how the company achieved a world-record on the McCalpin STREAM benchmark using their innovative scale-out to scale-up architecture. The benchmark measures sustainable memory bandwidth and the corresponding computation rate for simple vector kernels.
Numascale’s cache coherent shared memory system, which was targeted for big data analytics, reached 10.06 TBytes/second for the Scale function. This feat ranked 53% higher than the next most scalable system on the list, which was only able to achieve 6.59 TBytes/second.
Learn more: http://numascale.com
Watch the video presentation: http://wp.me/p3RLHQ-ehv
14. 14
Stream
Benchmark
• 108
nodes
• 6
x
6
x
3
Torus
• 5
184
CPU
cores
• 58
TFlops
• 20.7
TBytes
Shared
Memory
• Single
Image
OS
• 10TB/s
Memory
BW
15. 15
McCalpin
Stream
Benchmark
--------------------------------------------------------------------------------------------
Sub. Date Machine ID ncpus COPY SCALE ADD TRIAD
--------------------------------------------------------------------------------------------
2015.04.08 NumaConnect_648node 1296 9139226.2 10062237.6 8985643.0 8871850.0
COPY:
9.1
3TBytes/s
SCALE:
10.06TBytes/s
ADD:
8.98TBytes/s
TRIAD:
8.87TBytes/s
-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
Function
Best
Rate
MB/s
Avg
time
Min
time
Max
time
Copy:
9139226.2
0.240336
0.236344
0.246761
Scale:
10062237.6
0.217982
0.214664
0.220526
Add:
8985643.0
0.361473
0.360575
0.363363
Triad:
8871850.0
0.366032
0.365200
0.366646
-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
16. 16
Standard
Cluster
Architecture
Caches
CPUs
I/O
Memory
Caches Caches
CPUs
I/O
Memory
Caches Caches
CPUs
I/O
Memory
Caches Caches
CPUs
I/O
Memory
Caches
Network
Switch
Standard
Server
17. 17
Scale-Up and Scale-Out Capacity
• Single
System
Image
or
Multiple
Partitions
in
one
Fabric
• Max
Numbers
- 256
TeraBytes
Physical
Address
Space
- 4096
Nodes
- 196
608
cores