SlideShare a Scribd company logo
1 of 37
Download to read offline
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
1
Lustre File System on
ARM
September 2017
Architecture Evaluation v1.1
Carlos Thomaz
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
2 Motivations
▶ ARM momentum
• 64bit evolution
• Recent debuts on HPC
• Traction in new areas such as Machine Learning and AI
▶ Another option in the market
• Intel established as de-facto standard
• Market needs competitors; cost reduction
▶ Technical reasons
• Potential high bandwidth, high throughput processor
• Low power consumption option
• The Technical challenge
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
3 The Cavium ThunderX Architecture
▶ SoC architecture
• ISA: ARMV8
root@s167:/proc# lscpu
Architecture: aarch64
Byte Order: Little
Endian
CPU(s): 96
On-line CPU(s) list: 0-95
Thread(s) per core: 1
Core(s) per socket: 48
Socket(s): 2
NUMA node(s): 2
L1d cache: 32K
L1i cache: 78KL2
cache: 16384K
NUMA node0 CPU(s): 0-47
NUMA node1 CPU(s): 48-95
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
4 ARM ThunderX and Intel Xeon
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
5 ARM Ecosystem
Courtesy of ARM – http://arm.com/hpc
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
6 DDN Goals evaluating ARM
▶ Understand if it is a viable option for mid/long
term future products
▶ Understand what’s the effort necessary to
make Lustre running optimally on ARM (client
and server-side)
▶ Understand how Lustre and general I/O
behaves on ARM SoC architecture
▶ Contribute to the community
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
7 Test Environment used for the study
IB-SRP FDR (56Gbps)
SFA7700KX Block
Flash Array
ES7K IB
Embedded NL-SAS 7.2K
IB Switch 36p
IBFDR56Gbps
IBFDR56Gbps
IB FDR (56Gbps)
s164
Es7k-vm1 Es7k-vm1
s165
s166
s167
IB-SRPFDR(56Gbps)
IB FDR
IB FDR
IB FDR
IBFDR
IBFDR
IBFDR
s161
s162
s163
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
8 Test environment
▶ 4 x Gigabyte, Cavium ThunderX2 ARM servers
• 128GB RAM, 3 x 40GbE | 4 x 10GbE, 1 x IB FDR 56Gbps
▶ 1 x SFA7700-IB (ib-srp)
• Full flash array, 8 x RAID6 LUNs (200GB SSDs)
▶ 1 x ES7KE-IB (Intel based, DDN appliance)
• Embedded Lustre appliance, 2 controllers, 8 RAID6 pools
(OSTs), 2 SSD RAID1 pools for MDT
▶ 3 x DELL R620 servers
• 2 sockets, 12 cores total, 64GB RAM
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
9 Lustre File System configuration
▶ ARM Servers and Clients
• OS: Ubuntu 16.04.03 LTS – Xenial Xerus
• Kernel: Linux s166 4.4.0-31-generic #50-Ubuntu SMP Wed Jul 13
00:06:30 UTC 2016 aarch64 aarch64 aarch64 GNU/Linux
• Lustre: 2.10.0.0 + patches
o LU-9950, LU-9951, LU-9564 (backported for Ubuntu/debian)
▶ X86 clients
• OS: CentOS Linux release 7.2.1511 (Core)
• Kernel: Linux s162 3.10.0-327.el7.x86_64 #1 SMP Thu Nov 19 22:10:57
UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
• Lustre: 2.10.0.0
▶ ES7K Embedded Lustre Server
• OS: CentOS Linux release 7.3.1611 (Core)
• Kernel: Linux vm01-es7k01 3.10.0-514.21.2.el7.x86_64.lustre #1 SMP
Wed Jun 21 03:34:21 PDT 2017 x86_64 x86_64 x86_64 GNU/Linux
• Lustre: DDN Lustre 2.7.x + Patches
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
10
10
Stand alone ARM servers
Baseline Performance
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
11
Single ARM Server – first glimpse
Memory Bandwidth (stream)
ARM# gcc-6 –O3 -march=ARMv8.1-a -fopenmp -mcmodel=large  -DSTREAM_ARRAY_SIZE=2600000000 -Wall stream.c -o stream_h
DELL# gcc -Ofast -fopenmp stream.c -Wall -m64 -mcmodel=medium -DSTREAM_ARRAY_SIZE=1100000000 -o stream_h
0
20000
40000
60000
80000
100000
120000
140000
160000
180000
200000
Copy Scale Add Triad
Memory bandwidth - Absolute
results
ARM DELL Project ThunderX2
0
500
1000
1500
2000
2500
3000
3500
4000
Copy Scale Add Triad
Memory bandwidth -
Normalized (per core) results
ARM DELL Project ThunderX2
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
12
IB RDMA Network test with IB_SEND_RW
Sanity tests
root@s167:~# ib_send_bw -a -b -c UC -z 192.168.0.185
root@s165:~# ib_send_bw -a -b -c UC –z
************************************
*.Waiting for client to connect... *
************************************
---------------------------------------------------------------------------------------
Send Bidirectional BW Test
Dual-port : OFF Device : mlx4_0
Number of qps : 1 Transport type : IB
Connection type : UC Using SRQ : OFF
TX depth : 128
RX depth : 1000
CQ Moderation : 100
Mtu : 2048[B]
Link type : IB
Max inline data : 0[B]
rdma_cm QPs : OFF
Data ex. method : rdma_cm
---------------------------------------------------------------------------------------
local address: LID 0x25 QPN 0x0255 PSN 0xfe41b5
remote address: LID 0x26 QPN 0x47d1 PSN 0x8db89f
---------------------------------------------------------------------------------------
#bytes #iterations BW peak[MB/sec] BW average[MB/sec] MsgRate[Mpps]
2 1000 9.01 7.98 4.185542
1000 16.95 16.42 4.303677
1000 33.17 32.75 4.292886
1000 66.34 65.39 4.285470
32 1000 132.68 130.73 4.283882
64 1000 265.36 262.16 4.295241
<SNIP>
131072 1000 8250.48 8244.47 0.065956
262144 1000 8263.43 8256.41 0.033026
524288 1000 8252.15 8246.37 0.016493
1048576 1000 8256.10 8248.31 0.008248
2097152 1000 8254.87 8251.13 0.004126
4194304 1000 8256.12 8251.63 0.002063
8388608 1000 8138.98 8138.19 0.001017
---------------------------------------------------------------------------------------
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
13
ARM Server – point to point IB BW
MPI OSU BW and BIBW
root@s165:/mnt/lustre# mpirun_rsh -hostfile
/mnt/lustre/bin/mach -n 2 /mnt/lustre/mvapich/bin/osu_bw
# OSU MPI Bandwidth Test v5.3.2
# Size Bandwidth (MB/s)
1 0.22
2 0.84
4 2.14
8 4.28
16 8.56
32 17.05
64 33.76
128 65.66
256 115.18
512 245.05
1024 399.22
2048 735.48
4096 1090.75
8192 1235.69
16384 2838.55
32768 4864.02
65536 4688.12
131072 5270.83
262144 5333.59
524288 5297.62
1048576 5387.66
2097152 5400.73
4194304 5415.79
root@s165:/mnt/lustre# mpirun_rsh -hostfile /mnt/lustre/bin/mach
-n 2 /mnt/lustre/mvapich/bin/osu_bibw
# OSU MPI Bi-Directional Bandwidth Test v5.3.2
# Size Bandwidth (MB/s)
1 0.28
2 1.25
4 2.51
8 4.97
16 4.02
32 19.13
64 38.35
128 72.64
256 142.76
512 269.86
1024 519.32
2048 872.59
4096 1261.87
8192 1449.39
16384 2608.61
32768 4686.06
65536 7477.37
131072 8310.94
262144 8436.23
524288 8539.24
1048576 8597.96
2097152 8624.46
4194304 8472.55
Unidirectional BW Bidirectional BW
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
14 Single ARM server – storage backend
▶ Simple test to evaluate storage backend – FIO
• 1 x ARM server connected to SFA7700X via IB-SRP (FDR)
root@s165:/sys/block# fio --name=foo --rw=read --bs=1m --runtime=30 --time_based --ioengine=libaio --iodepth=64 --direct=1 --numjobs=8 --
group_reporting --filename=/dev/sdb --filename=/dev/sdc
foo: (g=0): rw=read, bs=(R) 1024KiB-1024KiB, (W) 1024KiB-1024KiB, (T) 1024KiB-1024KiB, ioengine=libaio, iodepth=64
...
fio-3.0
Starting 8 processes
Jobs: 8 (f=16): [R(8)][100.0%][r=5260MiB/s,w=0KiB/s][r=5260,w=0 IOPS][eta 00m:00s]
foo: (groupid=0, jobs=8): err= 0: pid=37191: Tue Sep 26 18:51:52 2017
read: IOPS=5426, BW=5427MiB/s (5690MB/s)(159GiB/30074msec)
slat (usec): min=191, max=101843, avg=1202.10, stdev=5434.79
clat (usec): min=342, max=562238, avg=92961.48, stdev=54905.53
lat (usec): min=807, max=562616, avg=94164.77, stdev=55509.54
clat percentiles (msec):
| 1.00th=[ 7], 5.00th=[ 17], 10.00th=[ 31], 20.00th=[ 48],
| 30.00th=[ 63], 40.00th=[ 75], 50.00th=[ 86], 60.00th=[ 96],
| 70.00th=[ 112], 80.00th=[ 136], 90.00th=[ 165], 95.00th=[ 190],
| 99.00th=[ 251], 99.50th=[ 284], 99.90th=[ 510], 99.95th=[ 527],
| 99.99th=[ 550]
bw ( KiB/s): min=153600, max=983040, per=12.50%, avg=694524.80, stdev=105260.97, samples=480
iops : min= 150, max= 960, avg=678.12, stdev=102.78, samples=480
lat (usec) : 500=0.01%, 750=0.01%, 1000=0.01%
lat (msec) : 2=0.06%, 4=0.31%, 10=1.87%, 20=3.92%, 50=15.35%
lat (msec) : 100=41.89%, 250=35.57%, 500=0.90%, 750=0.11%
cpu : usr=0.34%, sys=31.73%, ctx=19114, majf=0, minf=21757
<SNIP>
Run status group 0 (all jobs):
READ: bw=5427MiB/s (5690MB/s), 5427MiB/s-5427MiB/s (5690MB/s-5690MB/s), io=159GiB (171GB), run=30074-30074msec
Disk stats (read/write):
sdb: ios=55827/0, merge=55472/0, ticks=2579464/0, in_queue=2582500, util=94.10%
sdc: ios=53850/0, merge=57636/0, ticks=2780888/0, in_queue=2793204, util=95.68%
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
15
15
Part 1 – ARM Server
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
16
0
500
1000
1500
2000
2500
3000
3500
4000
1 2 4 8 12 16 24 32 64
IOR - Single Client Performance - 4MB RPCs
/mnt/arm/bin/ior.arm.mvapich -a POSIX -b 1g -r -w -F -B -t 4m -o
/mnt/arm/file.out
Writes 4M - Real IO Reads 4M - Real IO
IOR Single Client Performance – Multiple
Threads
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
17
IOR Single Client Performance – Multiple
Threads
0
1000
2000
3000
4000
5000
6000
1 2 4 8 12 16 24 32 64
IOR Single Client Performance - 4MB RPCs - REGULAR vs FAKE IO
/mnt/arm/bin/ior.arm.mvapich -a POSIX -b 1g -r -w -F -B -t 4m -o /mnt/arm/file.out
Writes 4M - Fake IO Reads 4M - Fake IO Writes 4M - Real IO Reads 4M - Real IO
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
18
IOR Results – end to end multiple clients
(Real I/O)
0
500
1000
1500
2000
2500
3000
3500
4000
4500
5000
2 4 6 8 10 12 14 16 24 32 48 64 96 128
Multiple Client Perofmance - 2 to 128 threads, 16MB RPC
Command line used: /mnt/arm/bin/ior.arm.mvapich -a POSIX -b 1g -r -w -B -F -t 16m -
o /mnt/arm/file-out -vv
Reads (4MB) Writes (4MB) Reads (16MB) Writes (16MB)
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
19
X86 clients against Lustre ARM server
IOR Sequential Performance
0
1000
2000
3000
4000
5000
6000
0 5 10 15 20 25 30 35 40
Multiple Client IOR Performance - x86 Clients against ARM Server
/mnt/arm/bin/ior.x86.mvapich -a POSIX -b 1g -r -w -F -B -t 16m -o
/mnt/arm/file.out
x86 Writes (16MB) x86 Reads (16MB)
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
20
ARM and x86 Clients comparison
IOR, multiple clients - Sequential
0
1000
2000
3000
4000
5000
6000
0 10 20 30 40 50 60 70
ARM and x86 Clients - IOR Sequential Reads / Writes (ARM Server)
ARM Writes (16MB) ARM Reads (16MB) x86 Writes (16MB) x86 Reads (16MB)
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
21
Sniplet from brw_stats during a set of
runs (2 to 128 threads)
Ltest-OST0000
<snip>
read | write
disk I/O size ios % cum % | ios % cum %
4K: 127 0 0 | 1 0 0
8K: 146 0 1 | 0 0 0
16K: 403 2 4 | 0 0 0
32K: 681 4 8 | 0 0 0
64K: 1590 10 19 | 0 0 0
128K: 1565 10 29 | 0 0 0
256K: 631 4 33 | 0 0 0
512K: 89 0 34 | 0 0 0
1M: 9905 65 100 | 169184 99 100
Ltest-OST0001
<snip>
read | write
disk I/O size ios % cum % | ios % cum %
4K: 44 0 0 | 1 0 0
8K: 44 0 0 | 0 0 0
16K: 119 0 1 | 0 0 0
32K: 245 1 3 | 0 0 0
64K: 461 3 7 | 0 0 0
128K: 452 3 10 | 0 0 0
256K: 148 1 12 | 0 0 0
512K: 30 0 12 | 0 0 0
1M: 10940 87 100 | 168096 99 100
Very much the same for all other OSTs ltest-OST000[0-6]
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
22
22
Part II – ARM Clients
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
23 Single Client Performance comparison
0
1000
2000
3000
4000
5000
6000
1 2 4 6 8 10 12 16 24 32 64 72 96
Single Client Performance (ARM x x86) - ES7K
/mnt/arm/bin/ior.x86.mvapich -a POSIX -b 1g -r -w -F -B -t 16m -o
/mnt/es7k/file.out
x86 Writes (16MB) ARM Writes (16MB) x86 Reads (16MB) ARM Reads (16MB)
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
24 Multiple Client performance comparison
0
1000
2000
3000
4000
5000
6000
7000
2 3 4 6 8 9 12 15 16 18 21 24 27 30 32 33 36 64
Multiple Client Performance (ARM x x86) - ES7K
/mnt/arm/bin/ior.x86.mvapich -a POSIX -b 1g -r -w -F -B -t 16m -o /mnt/es7k/file.ou
x86 Writes (16MB) ARM Writes (16MB) x86 Reads (16MB) ARM Reads (16MB)
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
25
25
Preliminary Conclusions
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
26 ARM Server - RAW vs Lustre
▶ RAW performance indicates the ARM systems
could potentially sustain high bandwidth
• We achieved about 7GB/sec reading/writing into and from
a Flash based storage that is capable of doing 10GB/sec
I/O.
• The bottleneck is the IB-FDR used with IB-SRP as
connection
• Concurrent Infiniband traffic also performs well. Tests
executed demonstrated about 6GB/sec unidirectional BW
and about 9GB/sec bi-directional on both IB_RDMA calls
(ib_send_bw) and also on MPI layer.
• Memory bandwith per core is much lower than other x86
architecture that probably will affect Lustre IO too.
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
27
”Noise” - Unpredictability on the Server
side
▶ We observed noise and unpredictable server
behavior when scaling up the IO workload thus
increasing the number of OSS service threads.
• Such behavior is related to the highly scalable number of
cores on two NUMA domains.
• Changing LNET partitions plays a little but yet visible effect on
server performance.
• Lustre PIO _should_ help since the effects we are seeing on
ARM servers are similar to KNL nodes (high core count, low
frequency) – Avoiding serialization should help.
▶ The best numbers are observed when using 24 to
32 cores
• More than 32 cores causes noisy and the results become
unpredictable. This effect is known, specially on high count
core SoC architecture.
• No L3 cache line and all coherent helps to minimize the effect
• 4 LNET partitions seems to be optimal for the tested CPU
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
28 Server Performance
▶ Reads seems reasonable, writes needs
improvement
• Lustre back-end write performance is limited to 3-
3.5GB/sec
o It’s about 50% of RAW I/O performance
o Client concurrency slow down to 2-2.5GB/sec
– Increasing the default number of OSS service threads didn’t take
much effect (default 360).
• Lustre back-end read performance seems to be max out to
5-5.5GB/sec
o Compared to other Lustre back-end, Read performance seems
good.
o Ext4 can provide maximum of 6-6.5GB/sec (for this test
environment).
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
29 Minimizing NUMA effects
root@s165:~# cat /proc/sys/lnet/cpu_partition_table
0 : 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
1 : 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47
2 : 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71
3 : 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95
root@s165:~# cat /etc/modprobe.d/lustre.conf
options lnet networks=o2ib(ib0)options
libcfs cpu_npartitions=4options
libcfs cpu_pattern="”
Change LNET partition table
- Initially set to 8 partitions, brought the inflexion point lower
- 4 partitions was the setting that provides better and more reliable
performance
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
30 ARM Lustre Clients
▶ Overall Performance equivalent to OLD Xeons,
but likely to be half of the current ones.
• 24 core ARM matches the 12 core Haswells (reads and
writes)
• Ability to write faster on an optimized DDN ES7K also
helps to blame ARM server for lower numbers
▶ Similar type of NUMA issues found on client,
but harder to understand and tune.
• Benefits of LNET partitions and other NUMA tuning still not
clear
• Applications can probably have better behavior using
numactl
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
31 Lustre
▶ Build procedure required three patches
• LU-9950 and LU-9951
o Build process, not really a Lustre code change
o Patches on JIRA
• LU-9564 backported (in order to build server on
Ubuntu/debian)
• Not very complicate, but require some cleanup in the
process (built on Ubuntu - caused some library
incompatibilities)
▶ The process overall is easy and straight
forward
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
32 What next?
▶ Study still in very preliminary stage
▶ More research on the server side
• We are interesting on alternatives for the current offerings
• Evaluate SoC features for better utilization (crypto, RAID
engine, virtualization)
• Profile IO and general workloads
▶ Need to test 40GbE
• Native chips and embedded Switch on SoC is supposedly to
deliver better I/O balance (opposed to utilization of single IB
card)
▶ Experiment in larger scale
• Looking for large environments willing to cooperate
▶ Lustre side
• P0: Run tests with PIO and compare results
• Profile writes
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
33
33
Thank you
Carlos Thomaz
Thanks for the help from Frank Leers, Gu Zheng and rest of the team.
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
34
34
Extra slides
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
35 Building Lustre
root@s164:~ git clone http://kernel.ubuntu.com/git-repos/ubuntu/ubuntu-xenial.git/ ubuntu-kernel
root@s164:~/ubuntu-kernel# uname -r 4.4.0-93-generic
root@s164:~/ubuntu-kernel# git tag | grep 4.4.0-93 Ubuntu-4.4.0-93.116
root@s164:~/ubuntu-kernel# git checkout Ubuntu-4.4.0-93.116
▶ Prepare kernel source
▶ Configure Kernel source
root@s164:~/ubuntu-kernel# touch .scmversion
root@s164:~/ubuntu-kernel# cp /boot/config-`uname -r` .config
root@s164:~/ubuntu-kernel# cp /usr/src/linux-headers-`uname -r`/Module.symvers
▶ Submitted patches in JIRA
• https://review.whamcloud.com/#/c/27323/
• https://jira.hpdd.intel.com/browse/LU-9950
• https://jira.hpdd.intel.com/browse/LU-9951
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
36 Building Lustre
▶ Patch Makefile
root@s164:~/ubuntu-kernel# git diff
diff --git a/Makefile b/Makefile
index f1fee0c..5f235dc 100644
--- a/Makefile
+++ b/Makefile
@@ -1,7 +1,8 @@
VERSION = 4
PATCHLEVEL = 4
-SUBLEVEL = 79
-EXTRAVERSION =
+SUBLEVEL = 0
+EXTRAVERSION = -93-generic
+
NAME = Blurry Fish Butt
# *DOCUMENTATION*
root@s164:~/ubuntu-kernel# make modules_prepare
▶ Patch Lustre
• LU-9950, LU-9951, review.whamcloud.com/#/c/27323/
▶ Build Lustre
bash autogen.sh && ./configure --enable-server --enable-ldiskfs --with-zfs=no --with-o2ib=/usr/src/ofa_kernel/default/ 
--with-linux=/root/ubuntu-kernel/ --enable-module && make debs
ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change.
37 Installing e2fsprogs
▶ Build and replace e2fsprogs
git clone git://git.hpdd.intel.com/tools/e2fsprogs.git
cd e2fsprogs git checkout v1.42.13.wc6 -b v1.42.13.wc6
wget -P ../ http://archive.ubuntu.com/ubuntu/pool/main/e/e2fsprogs/e2fsprogs_1.42.13-1ubuntu1.debian.tar.xz
tar --exclude "debian/changelog" -xf ../e2fsprogs_1.42.13-1ubuntu1.debian.tar.xz
sed -i 's/ext2_types-wrapper.h$//g' lib/ext2fs/Makefile.in
dpkg-buildpackage -b -us -uc
dpkg -i libcomerr2_1.42.13-1_arm64.deb libss2_1.42.13-1_arm64.deb e2fsck-static_1.42.13-1_arm64.deb e2fslibs_1.42.13-1_arm64.deb
e2fsprogs_1.42.13-1_arm64.deb

More Related Content

What's hot

AMD EPYC 7002 World Records
AMD EPYC 7002 World RecordsAMD EPYC 7002 World Records
AMD EPYC 7002 World RecordsAMD
 
Pvma32xx series-raid
Pvma32xx series-raidPvma32xx series-raid
Pvma32xx series-raidTTEC
 
How To Train Your Calxeda EnergyCore
How To Train Your  Calxeda EnergyCoreHow To Train Your  Calxeda EnergyCore
How To Train Your Calxeda EnergyCoreNaoto MATSUMOTO
 
Data Center: Cloud & Convergencia
Data Center: Cloud & ConvergenciaData Center: Cloud & Convergencia
Data Center: Cloud & ConvergenciaLogicalis Latam
 
Network Configuration Example: Junos OS NAT Configuration Examples for Screen...
Network Configuration Example: Junos OS NAT Configuration Examples for Screen...Network Configuration Example: Junos OS NAT Configuration Examples for Screen...
Network Configuration Example: Junos OS NAT Configuration Examples for Screen...Juniper Networks
 
Dom0less - Xen Developer Summit 2019
Dom0less  - Xen Developer Summit 2019Dom0less  - Xen Developer Summit 2019
Dom0less - Xen Developer Summit 2019Stefano Stabellini
 
In-Network Acceleration with FPGA (MEMO)
In-Network Acceleration with FPGA (MEMO)In-Network Acceleration with FPGA (MEMO)
In-Network Acceleration with FPGA (MEMO)Naoto MATSUMOTO
 
Gdc Sa 2100 A Brochure Final
Gdc Sa 2100 A Brochure FinalGdc Sa 2100 A Brochure Final
Gdc Sa 2100 A Brochure Finalguestd1c2eb7
 
PLNOG 5: Piotr Szołkowski - Data Center i nie tylko...
PLNOG 5: Piotr Szołkowski - Data Center i nie tylko...PLNOG 5: Piotr Szołkowski - Data Center i nie tylko...
PLNOG 5: Piotr Szołkowski - Data Center i nie tylko...PROIDEA
 
Avb pov 2017 v2
Avb pov 2017 v2Avb pov 2017 v2
Avb pov 2017 v2Jeff Green
 
SLES Performance Enhancements for Large NUMA Systems
SLES Performance Enhancements for Large NUMA SystemsSLES Performance Enhancements for Large NUMA Systems
SLES Performance Enhancements for Large NUMA SystemsDavidlohr Bueso
 
OSSJP/ALS19: The Road to Safety Certification: Overcoming Community Challeng...
OSSJP/ALS19:  The Road to Safety Certification: Overcoming Community Challeng...OSSJP/ALS19:  The Road to Safety Certification: Overcoming Community Challeng...
OSSJP/ALS19: The Road to Safety Certification: Overcoming Community Challeng...The Linux Foundation
 
從INTEL技術談網路卡
從INTEL技術談網路卡從INTEL技術談網路卡
從INTEL技術談網路卡zman
 

What's hot (20)

AMD EPYC 7002 World Records
AMD EPYC 7002 World RecordsAMD EPYC 7002 World Records
AMD EPYC 7002 World Records
 
Juniper 40G and 100G
Juniper 40G and 100GJuniper 40G and 100G
Juniper 40G and 100G
 
Pvma32xx series-raid
Pvma32xx series-raidPvma32xx series-raid
Pvma32xx series-raid
 
How To Train Your Calxeda EnergyCore
How To Train Your  Calxeda EnergyCoreHow To Train Your  Calxeda EnergyCore
How To Train Your Calxeda EnergyCore
 
Data Center: Cloud & Convergencia
Data Center: Cloud & ConvergenciaData Center: Cloud & Convergencia
Data Center: Cloud & Convergencia
 
Deploying Carrier Ethernet features on ASR 9000
Deploying Carrier Ethernet features on ASR 9000Deploying Carrier Ethernet features on ASR 9000
Deploying Carrier Ethernet features on ASR 9000
 
Network Configuration Example: Junos OS NAT Configuration Examples for Screen...
Network Configuration Example: Junos OS NAT Configuration Examples for Screen...Network Configuration Example: Junos OS NAT Configuration Examples for Screen...
Network Configuration Example: Junos OS NAT Configuration Examples for Screen...
 
Dom0less - Xen Developer Summit 2019
Dom0less  - Xen Developer Summit 2019Dom0less  - Xen Developer Summit 2019
Dom0less - Xen Developer Summit 2019
 
Supercomputers and Cloud Games
Supercomputers and Cloud GamesSupercomputers and Cloud Games
Supercomputers and Cloud Games
 
In-Network Acceleration with FPGA (MEMO)
In-Network Acceleration with FPGA (MEMO)In-Network Acceleration with FPGA (MEMO)
In-Network Acceleration with FPGA (MEMO)
 
SUN
SUNSUN
SUN
 
knoSYS_Hardware
knoSYS_HardwareknoSYS_Hardware
knoSYS_Hardware
 
Gdc Sa 2100 A Brochure Final
Gdc Sa 2100 A Brochure FinalGdc Sa 2100 A Brochure Final
Gdc Sa 2100 A Brochure Final
 
PLNOG 5: Piotr Szołkowski - Data Center i nie tylko...
PLNOG 5: Piotr Szołkowski - Data Center i nie tylko...PLNOG 5: Piotr Szołkowski - Data Center i nie tylko...
PLNOG 5: Piotr Szołkowski - Data Center i nie tylko...
 
Avb pov 2017 v2
Avb pov 2017 v2Avb pov 2017 v2
Avb pov 2017 v2
 
SLES Performance Enhancements for Large NUMA Systems
SLES Performance Enhancements for Large NUMA SystemsSLES Performance Enhancements for Large NUMA Systems
SLES Performance Enhancements for Large NUMA Systems
 
OSSJP/ALS19: The Road to Safety Certification: Overcoming Community Challeng...
OSSJP/ALS19:  The Road to Safety Certification: Overcoming Community Challeng...OSSJP/ALS19:  The Road to Safety Certification: Overcoming Community Challeng...
OSSJP/ALS19: The Road to Safety Certification: Overcoming Community Challeng...
 
Linux PV on HVM
Linux PV on HVMLinux PV on HVM
Linux PV on HVM
 
從INTEL技術談網路卡
從INTEL技術談網路卡從INTEL技術談網路卡
從INTEL技術談網路卡
 
Snapdragon
SnapdragonSnapdragon
Snapdragon
 

Similar to Lustre File System on ARM

Cisco EuroMPI'13 vendor session presentation
Cisco EuroMPI'13 vendor session presentationCisco EuroMPI'13 vendor session presentation
Cisco EuroMPI'13 vendor session presentationJeff Squyres
 
Cisco usNIC: how it works, how it is used in Open MPI
Cisco usNIC: how it works, how it is used in Open MPICisco usNIC: how it works, how it is used in Open MPI
Cisco usNIC: how it works, how it is used in Open MPIJeff Squyres
 
High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...
High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...
High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...LinuxCon ContainerCon CloudOpen China
 
5 Things You Need to Know About Enterprise Fl
 5 Things You Need to Know About Enterprise Fl 5 Things You Need to Know About Enterprise Fl
5 Things You Need to Know About Enterprise FlWestern Digital
 
Presentation deploying cloud based services
Presentation   deploying cloud based servicesPresentation   deploying cloud based services
Presentation deploying cloud based servicesxKinAnx
 
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Ontico
 
Deep Dive On Intel Optane SSDs And New Server Platforms
Deep Dive On Intel Optane SSDs And New Server PlatformsDeep Dive On Intel Optane SSDs And New Server Platforms
Deep Dive On Intel Optane SSDs And New Server PlatformsNEXTtour
 
Ceph Day New York 2014: Ceph, a physical perspective
Ceph Day New York 2014: Ceph, a physical perspective Ceph Day New York 2014: Ceph, a physical perspective
Ceph Day New York 2014: Ceph, a physical perspective Ceph Community
 
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStorDelivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStorRebekah Rodriguez
 
3. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 20133. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 2013Taldor Group
 
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptxQ1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptxMemory Fabric Forum
 
POLYTEDA PowerDRC/LVS overview
POLYTEDA PowerDRC/LVS overviewPOLYTEDA PowerDRC/LVS overview
POLYTEDA PowerDRC/LVS overviewAlexander Grudanov
 
2014/09/02 Cisco UCS HPC @ ANL
2014/09/02 Cisco UCS HPC @ ANL2014/09/02 Cisco UCS HPC @ ANL
2014/09/02 Cisco UCS HPC @ ANLdgoodell
 
Q1 Memory Fabric Forum: VMware Memory Vision
Q1 Memory Fabric Forum: VMware Memory VisionQ1 Memory Fabric Forum: VMware Memory Vision
Q1 Memory Fabric Forum: VMware Memory VisionMemory Fabric Forum
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSCeph Community
 
Oow2007 performance
Oow2007 performanceOow2007 performance
Oow2007 performanceRicky Zhu
 
Supermicro X12 Performance Update
Supermicro X12 Performance UpdateSupermicro X12 Performance Update
Supermicro X12 Performance UpdateRebekah Rodriguez
 
DPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet ProcessingDPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet ProcessingMichelle Holley
 
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral ProgramBig Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Programinside-BigData.com
 

Similar to Lustre File System on ARM (20)

Cisco EuroMPI'13 vendor session presentation
Cisco EuroMPI'13 vendor session presentationCisco EuroMPI'13 vendor session presentation
Cisco EuroMPI'13 vendor session presentation
 
Cisco usNIC: how it works, how it is used in Open MPI
Cisco usNIC: how it works, how it is used in Open MPICisco usNIC: how it works, how it is used in Open MPI
Cisco usNIC: how it works, how it is used in Open MPI
 
High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...
High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...
High Performance Linux Virtual Machine on Microsoft Azure: SR-IOV Networking ...
 
5 Things You Need to Know About Enterprise Fl
 5 Things You Need to Know About Enterprise Fl 5 Things You Need to Know About Enterprise Fl
5 Things You Need to Know About Enterprise Fl
 
Presentation deploying cloud based services
Presentation   deploying cloud based servicesPresentation   deploying cloud based services
Presentation deploying cloud based services
 
Open v ran
Open v ranOpen v ran
Open v ran
 
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
Dataplane networking acceleration with OpenDataplane / Максим Уваров (Linaro)
 
Deep Dive On Intel Optane SSDs And New Server Platforms
Deep Dive On Intel Optane SSDs And New Server PlatformsDeep Dive On Intel Optane SSDs And New Server Platforms
Deep Dive On Intel Optane SSDs And New Server Platforms
 
Ceph Day New York 2014: Ceph, a physical perspective
Ceph Day New York 2014: Ceph, a physical perspective Ceph Day New York 2014: Ceph, a physical perspective
Ceph Day New York 2014: Ceph, a physical perspective
 
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStorDelivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
 
3. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 20133. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 2013
 
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptxQ1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
 
POLYTEDA PowerDRC/LVS overview
POLYTEDA PowerDRC/LVS overviewPOLYTEDA PowerDRC/LVS overview
POLYTEDA PowerDRC/LVS overview
 
2014/09/02 Cisco UCS HPC @ ANL
2014/09/02 Cisco UCS HPC @ ANL2014/09/02 Cisco UCS HPC @ ANL
2014/09/02 Cisco UCS HPC @ ANL
 
Q1 Memory Fabric Forum: VMware Memory Vision
Q1 Memory Fabric Forum: VMware Memory VisionQ1 Memory Fabric Forum: VMware Memory Vision
Q1 Memory Fabric Forum: VMware Memory Vision
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
 
Oow2007 performance
Oow2007 performanceOow2007 performance
Oow2007 performance
 
Supermicro X12 Performance Update
Supermicro X12 Performance UpdateSupermicro X12 Performance Update
Supermicro X12 Performance Update
 
DPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet ProcessingDPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet Processing
 
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral ProgramBig Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
 

More from inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networksinside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networksinside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoringinside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecastsinside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Updateinside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuninginside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODinside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Accelerationinside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficientlyinside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Erainside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computinginside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Clusterinside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 

Lustre File System on ARM

  • 1. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 1 Lustre File System on ARM September 2017 Architecture Evaluation v1.1 Carlos Thomaz
  • 2. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 2 Motivations ▶ ARM momentum • 64bit evolution • Recent debuts on HPC • Traction in new areas such as Machine Learning and AI ▶ Another option in the market • Intel established as de-facto standard • Market needs competitors; cost reduction ▶ Technical reasons • Potential high bandwidth, high throughput processor • Low power consumption option • The Technical challenge
  • 3. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 3 The Cavium ThunderX Architecture ▶ SoC architecture • ISA: ARMV8 root@s167:/proc# lscpu Architecture: aarch64 Byte Order: Little Endian CPU(s): 96 On-line CPU(s) list: 0-95 Thread(s) per core: 1 Core(s) per socket: 48 Socket(s): 2 NUMA node(s): 2 L1d cache: 32K L1i cache: 78KL2 cache: 16384K NUMA node0 CPU(s): 0-47 NUMA node1 CPU(s): 48-95
  • 4. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 4 ARM ThunderX and Intel Xeon
  • 5. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 5 ARM Ecosystem Courtesy of ARM – http://arm.com/hpc
  • 6. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 6 DDN Goals evaluating ARM ▶ Understand if it is a viable option for mid/long term future products ▶ Understand what’s the effort necessary to make Lustre running optimally on ARM (client and server-side) ▶ Understand how Lustre and general I/O behaves on ARM SoC architecture ▶ Contribute to the community
  • 7. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 7 Test Environment used for the study IB-SRP FDR (56Gbps) SFA7700KX Block Flash Array ES7K IB Embedded NL-SAS 7.2K IB Switch 36p IBFDR56Gbps IBFDR56Gbps IB FDR (56Gbps) s164 Es7k-vm1 Es7k-vm1 s165 s166 s167 IB-SRPFDR(56Gbps) IB FDR IB FDR IB FDR IBFDR IBFDR IBFDR s161 s162 s163
  • 8. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 8 Test environment ▶ 4 x Gigabyte, Cavium ThunderX2 ARM servers • 128GB RAM, 3 x 40GbE | 4 x 10GbE, 1 x IB FDR 56Gbps ▶ 1 x SFA7700-IB (ib-srp) • Full flash array, 8 x RAID6 LUNs (200GB SSDs) ▶ 1 x ES7KE-IB (Intel based, DDN appliance) • Embedded Lustre appliance, 2 controllers, 8 RAID6 pools (OSTs), 2 SSD RAID1 pools for MDT ▶ 3 x DELL R620 servers • 2 sockets, 12 cores total, 64GB RAM
  • 9. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 9 Lustre File System configuration ▶ ARM Servers and Clients • OS: Ubuntu 16.04.03 LTS – Xenial Xerus • Kernel: Linux s166 4.4.0-31-generic #50-Ubuntu SMP Wed Jul 13 00:06:30 UTC 2016 aarch64 aarch64 aarch64 GNU/Linux • Lustre: 2.10.0.0 + patches o LU-9950, LU-9951, LU-9564 (backported for Ubuntu/debian) ▶ X86 clients • OS: CentOS Linux release 7.2.1511 (Core) • Kernel: Linux s162 3.10.0-327.el7.x86_64 #1 SMP Thu Nov 19 22:10:57 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux • Lustre: 2.10.0.0 ▶ ES7K Embedded Lustre Server • OS: CentOS Linux release 7.3.1611 (Core) • Kernel: Linux vm01-es7k01 3.10.0-514.21.2.el7.x86_64.lustre #1 SMP Wed Jun 21 03:34:21 PDT 2017 x86_64 x86_64 x86_64 GNU/Linux • Lustre: DDN Lustre 2.7.x + Patches
  • 10. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 10 10 Stand alone ARM servers Baseline Performance
  • 11. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 11 Single ARM Server – first glimpse Memory Bandwidth (stream) ARM# gcc-6 –O3 -march=ARMv8.1-a -fopenmp -mcmodel=large -DSTREAM_ARRAY_SIZE=2600000000 -Wall stream.c -o stream_h DELL# gcc -Ofast -fopenmp stream.c -Wall -m64 -mcmodel=medium -DSTREAM_ARRAY_SIZE=1100000000 -o stream_h 0 20000 40000 60000 80000 100000 120000 140000 160000 180000 200000 Copy Scale Add Triad Memory bandwidth - Absolute results ARM DELL Project ThunderX2 0 500 1000 1500 2000 2500 3000 3500 4000 Copy Scale Add Triad Memory bandwidth - Normalized (per core) results ARM DELL Project ThunderX2
  • 12. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 12 IB RDMA Network test with IB_SEND_RW Sanity tests root@s167:~# ib_send_bw -a -b -c UC -z 192.168.0.185 root@s165:~# ib_send_bw -a -b -c UC –z ************************************ *.Waiting for client to connect... * ************************************ --------------------------------------------------------------------------------------- Send Bidirectional BW Test Dual-port : OFF Device : mlx4_0 Number of qps : 1 Transport type : IB Connection type : UC Using SRQ : OFF TX depth : 128 RX depth : 1000 CQ Moderation : 100 Mtu : 2048[B] Link type : IB Max inline data : 0[B] rdma_cm QPs : OFF Data ex. method : rdma_cm --------------------------------------------------------------------------------------- local address: LID 0x25 QPN 0x0255 PSN 0xfe41b5 remote address: LID 0x26 QPN 0x47d1 PSN 0x8db89f --------------------------------------------------------------------------------------- #bytes #iterations BW peak[MB/sec] BW average[MB/sec] MsgRate[Mpps] 2 1000 9.01 7.98 4.185542 1000 16.95 16.42 4.303677 1000 33.17 32.75 4.292886 1000 66.34 65.39 4.285470 32 1000 132.68 130.73 4.283882 64 1000 265.36 262.16 4.295241 <SNIP> 131072 1000 8250.48 8244.47 0.065956 262144 1000 8263.43 8256.41 0.033026 524288 1000 8252.15 8246.37 0.016493 1048576 1000 8256.10 8248.31 0.008248 2097152 1000 8254.87 8251.13 0.004126 4194304 1000 8256.12 8251.63 0.002063 8388608 1000 8138.98 8138.19 0.001017 ---------------------------------------------------------------------------------------
  • 13. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 13 ARM Server – point to point IB BW MPI OSU BW and BIBW root@s165:/mnt/lustre# mpirun_rsh -hostfile /mnt/lustre/bin/mach -n 2 /mnt/lustre/mvapich/bin/osu_bw # OSU MPI Bandwidth Test v5.3.2 # Size Bandwidth (MB/s) 1 0.22 2 0.84 4 2.14 8 4.28 16 8.56 32 17.05 64 33.76 128 65.66 256 115.18 512 245.05 1024 399.22 2048 735.48 4096 1090.75 8192 1235.69 16384 2838.55 32768 4864.02 65536 4688.12 131072 5270.83 262144 5333.59 524288 5297.62 1048576 5387.66 2097152 5400.73 4194304 5415.79 root@s165:/mnt/lustre# mpirun_rsh -hostfile /mnt/lustre/bin/mach -n 2 /mnt/lustre/mvapich/bin/osu_bibw # OSU MPI Bi-Directional Bandwidth Test v5.3.2 # Size Bandwidth (MB/s) 1 0.28 2 1.25 4 2.51 8 4.97 16 4.02 32 19.13 64 38.35 128 72.64 256 142.76 512 269.86 1024 519.32 2048 872.59 4096 1261.87 8192 1449.39 16384 2608.61 32768 4686.06 65536 7477.37 131072 8310.94 262144 8436.23 524288 8539.24 1048576 8597.96 2097152 8624.46 4194304 8472.55 Unidirectional BW Bidirectional BW
  • 14. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 14 Single ARM server – storage backend ▶ Simple test to evaluate storage backend – FIO • 1 x ARM server connected to SFA7700X via IB-SRP (FDR) root@s165:/sys/block# fio --name=foo --rw=read --bs=1m --runtime=30 --time_based --ioengine=libaio --iodepth=64 --direct=1 --numjobs=8 -- group_reporting --filename=/dev/sdb --filename=/dev/sdc foo: (g=0): rw=read, bs=(R) 1024KiB-1024KiB, (W) 1024KiB-1024KiB, (T) 1024KiB-1024KiB, ioengine=libaio, iodepth=64 ... fio-3.0 Starting 8 processes Jobs: 8 (f=16): [R(8)][100.0%][r=5260MiB/s,w=0KiB/s][r=5260,w=0 IOPS][eta 00m:00s] foo: (groupid=0, jobs=8): err= 0: pid=37191: Tue Sep 26 18:51:52 2017 read: IOPS=5426, BW=5427MiB/s (5690MB/s)(159GiB/30074msec) slat (usec): min=191, max=101843, avg=1202.10, stdev=5434.79 clat (usec): min=342, max=562238, avg=92961.48, stdev=54905.53 lat (usec): min=807, max=562616, avg=94164.77, stdev=55509.54 clat percentiles (msec): | 1.00th=[ 7], 5.00th=[ 17], 10.00th=[ 31], 20.00th=[ 48], | 30.00th=[ 63], 40.00th=[ 75], 50.00th=[ 86], 60.00th=[ 96], | 70.00th=[ 112], 80.00th=[ 136], 90.00th=[ 165], 95.00th=[ 190], | 99.00th=[ 251], 99.50th=[ 284], 99.90th=[ 510], 99.95th=[ 527], | 99.99th=[ 550] bw ( KiB/s): min=153600, max=983040, per=12.50%, avg=694524.80, stdev=105260.97, samples=480 iops : min= 150, max= 960, avg=678.12, stdev=102.78, samples=480 lat (usec) : 500=0.01%, 750=0.01%, 1000=0.01% lat (msec) : 2=0.06%, 4=0.31%, 10=1.87%, 20=3.92%, 50=15.35% lat (msec) : 100=41.89%, 250=35.57%, 500=0.90%, 750=0.11% cpu : usr=0.34%, sys=31.73%, ctx=19114, majf=0, minf=21757 <SNIP> Run status group 0 (all jobs): READ: bw=5427MiB/s (5690MB/s), 5427MiB/s-5427MiB/s (5690MB/s-5690MB/s), io=159GiB (171GB), run=30074-30074msec Disk stats (read/write): sdb: ios=55827/0, merge=55472/0, ticks=2579464/0, in_queue=2582500, util=94.10% sdc: ios=53850/0, merge=57636/0, ticks=2780888/0, in_queue=2793204, util=95.68%
  • 15. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 15 15 Part 1 – ARM Server
  • 16. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 16 0 500 1000 1500 2000 2500 3000 3500 4000 1 2 4 8 12 16 24 32 64 IOR - Single Client Performance - 4MB RPCs /mnt/arm/bin/ior.arm.mvapich -a POSIX -b 1g -r -w -F -B -t 4m -o /mnt/arm/file.out Writes 4M - Real IO Reads 4M - Real IO IOR Single Client Performance – Multiple Threads
  • 17. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 17 IOR Single Client Performance – Multiple Threads 0 1000 2000 3000 4000 5000 6000 1 2 4 8 12 16 24 32 64 IOR Single Client Performance - 4MB RPCs - REGULAR vs FAKE IO /mnt/arm/bin/ior.arm.mvapich -a POSIX -b 1g -r -w -F -B -t 4m -o /mnt/arm/file.out Writes 4M - Fake IO Reads 4M - Fake IO Writes 4M - Real IO Reads 4M - Real IO
  • 18. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 18 IOR Results – end to end multiple clients (Real I/O) 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 2 4 6 8 10 12 14 16 24 32 48 64 96 128 Multiple Client Perofmance - 2 to 128 threads, 16MB RPC Command line used: /mnt/arm/bin/ior.arm.mvapich -a POSIX -b 1g -r -w -B -F -t 16m - o /mnt/arm/file-out -vv Reads (4MB) Writes (4MB) Reads (16MB) Writes (16MB)
  • 19. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 19 X86 clients against Lustre ARM server IOR Sequential Performance 0 1000 2000 3000 4000 5000 6000 0 5 10 15 20 25 30 35 40 Multiple Client IOR Performance - x86 Clients against ARM Server /mnt/arm/bin/ior.x86.mvapich -a POSIX -b 1g -r -w -F -B -t 16m -o /mnt/arm/file.out x86 Writes (16MB) x86 Reads (16MB)
  • 20. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 20 ARM and x86 Clients comparison IOR, multiple clients - Sequential 0 1000 2000 3000 4000 5000 6000 0 10 20 30 40 50 60 70 ARM and x86 Clients - IOR Sequential Reads / Writes (ARM Server) ARM Writes (16MB) ARM Reads (16MB) x86 Writes (16MB) x86 Reads (16MB)
  • 21. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 21 Sniplet from brw_stats during a set of runs (2 to 128 threads) Ltest-OST0000 <snip> read | write disk I/O size ios % cum % | ios % cum % 4K: 127 0 0 | 1 0 0 8K: 146 0 1 | 0 0 0 16K: 403 2 4 | 0 0 0 32K: 681 4 8 | 0 0 0 64K: 1590 10 19 | 0 0 0 128K: 1565 10 29 | 0 0 0 256K: 631 4 33 | 0 0 0 512K: 89 0 34 | 0 0 0 1M: 9905 65 100 | 169184 99 100 Ltest-OST0001 <snip> read | write disk I/O size ios % cum % | ios % cum % 4K: 44 0 0 | 1 0 0 8K: 44 0 0 | 0 0 0 16K: 119 0 1 | 0 0 0 32K: 245 1 3 | 0 0 0 64K: 461 3 7 | 0 0 0 128K: 452 3 10 | 0 0 0 256K: 148 1 12 | 0 0 0 512K: 30 0 12 | 0 0 0 1M: 10940 87 100 | 168096 99 100 Very much the same for all other OSTs ltest-OST000[0-6]
  • 22. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 22 22 Part II – ARM Clients
  • 23. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 23 Single Client Performance comparison 0 1000 2000 3000 4000 5000 6000 1 2 4 6 8 10 12 16 24 32 64 72 96 Single Client Performance (ARM x x86) - ES7K /mnt/arm/bin/ior.x86.mvapich -a POSIX -b 1g -r -w -F -B -t 16m -o /mnt/es7k/file.out x86 Writes (16MB) ARM Writes (16MB) x86 Reads (16MB) ARM Reads (16MB)
  • 24. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 24 Multiple Client performance comparison 0 1000 2000 3000 4000 5000 6000 7000 2 3 4 6 8 9 12 15 16 18 21 24 27 30 32 33 36 64 Multiple Client Performance (ARM x x86) - ES7K /mnt/arm/bin/ior.x86.mvapich -a POSIX -b 1g -r -w -F -B -t 16m -o /mnt/es7k/file.ou x86 Writes (16MB) ARM Writes (16MB) x86 Reads (16MB) ARM Reads (16MB)
  • 25. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 25 25 Preliminary Conclusions
  • 26. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 26 ARM Server - RAW vs Lustre ▶ RAW performance indicates the ARM systems could potentially sustain high bandwidth • We achieved about 7GB/sec reading/writing into and from a Flash based storage that is capable of doing 10GB/sec I/O. • The bottleneck is the IB-FDR used with IB-SRP as connection • Concurrent Infiniband traffic also performs well. Tests executed demonstrated about 6GB/sec unidirectional BW and about 9GB/sec bi-directional on both IB_RDMA calls (ib_send_bw) and also on MPI layer. • Memory bandwith per core is much lower than other x86 architecture that probably will affect Lustre IO too.
  • 27. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 27 ”Noise” - Unpredictability on the Server side ▶ We observed noise and unpredictable server behavior when scaling up the IO workload thus increasing the number of OSS service threads. • Such behavior is related to the highly scalable number of cores on two NUMA domains. • Changing LNET partitions plays a little but yet visible effect on server performance. • Lustre PIO _should_ help since the effects we are seeing on ARM servers are similar to KNL nodes (high core count, low frequency) – Avoiding serialization should help. ▶ The best numbers are observed when using 24 to 32 cores • More than 32 cores causes noisy and the results become unpredictable. This effect is known, specially on high count core SoC architecture. • No L3 cache line and all coherent helps to minimize the effect • 4 LNET partitions seems to be optimal for the tested CPU
  • 28. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 28 Server Performance ▶ Reads seems reasonable, writes needs improvement • Lustre back-end write performance is limited to 3- 3.5GB/sec o It’s about 50% of RAW I/O performance o Client concurrency slow down to 2-2.5GB/sec – Increasing the default number of OSS service threads didn’t take much effect (default 360). • Lustre back-end read performance seems to be max out to 5-5.5GB/sec o Compared to other Lustre back-end, Read performance seems good. o Ext4 can provide maximum of 6-6.5GB/sec (for this test environment).
  • 29. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 29 Minimizing NUMA effects root@s165:~# cat /proc/sys/lnet/cpu_partition_table 0 : 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 1 : 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 2 : 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 3 : 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 root@s165:~# cat /etc/modprobe.d/lustre.conf options lnet networks=o2ib(ib0)options libcfs cpu_npartitions=4options libcfs cpu_pattern="” Change LNET partition table - Initially set to 8 partitions, brought the inflexion point lower - 4 partitions was the setting that provides better and more reliable performance
  • 30. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 30 ARM Lustre Clients ▶ Overall Performance equivalent to OLD Xeons, but likely to be half of the current ones. • 24 core ARM matches the 12 core Haswells (reads and writes) • Ability to write faster on an optimized DDN ES7K also helps to blame ARM server for lower numbers ▶ Similar type of NUMA issues found on client, but harder to understand and tune. • Benefits of LNET partitions and other NUMA tuning still not clear • Applications can probably have better behavior using numactl
  • 31. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 31 Lustre ▶ Build procedure required three patches • LU-9950 and LU-9951 o Build process, not really a Lustre code change o Patches on JIRA • LU-9564 backported (in order to build server on Ubuntu/debian) • Not very complicate, but require some cleanup in the process (built on Ubuntu - caused some library incompatibilities) ▶ The process overall is easy and straight forward
  • 32. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 32 What next? ▶ Study still in very preliminary stage ▶ More research on the server side • We are interesting on alternatives for the current offerings • Evaluate SoC features for better utilization (crypto, RAID engine, virtualization) • Profile IO and general workloads ▶ Need to test 40GbE • Native chips and embedded Switch on SoC is supposedly to deliver better I/O balance (opposed to utilization of single IB card) ▶ Experiment in larger scale • Looking for large environments willing to cooperate ▶ Lustre side • P0: Run tests with PIO and compare results • Profile writes
  • 33. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 33 33 Thank you Carlos Thomaz Thanks for the help from Frank Leers, Gu Zheng and rest of the team.
  • 34. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 34 34 Extra slides
  • 35. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 35 Building Lustre root@s164:~ git clone http://kernel.ubuntu.com/git-repos/ubuntu/ubuntu-xenial.git/ ubuntu-kernel root@s164:~/ubuntu-kernel# uname -r 4.4.0-93-generic root@s164:~/ubuntu-kernel# git tag | grep 4.4.0-93 Ubuntu-4.4.0-93.116 root@s164:~/ubuntu-kernel# git checkout Ubuntu-4.4.0-93.116 ▶ Prepare kernel source ▶ Configure Kernel source root@s164:~/ubuntu-kernel# touch .scmversion root@s164:~/ubuntu-kernel# cp /boot/config-`uname -r` .config root@s164:~/ubuntu-kernel# cp /usr/src/linux-headers-`uname -r`/Module.symvers ▶ Submitted patches in JIRA • https://review.whamcloud.com/#/c/27323/ • https://jira.hpdd.intel.com/browse/LU-9950 • https://jira.hpdd.intel.com/browse/LU-9951
  • 36. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 36 Building Lustre ▶ Patch Makefile root@s164:~/ubuntu-kernel# git diff diff --git a/Makefile b/Makefile index f1fee0c..5f235dc 100644 --- a/Makefile +++ b/Makefile @@ -1,7 +1,8 @@ VERSION = 4 PATCHLEVEL = 4 -SUBLEVEL = 79 -EXTRAVERSION = +SUBLEVEL = 0 +EXTRAVERSION = -93-generic + NAME = Blurry Fish Butt # *DOCUMENTATION* root@s164:~/ubuntu-kernel# make modules_prepare ▶ Patch Lustre • LU-9950, LU-9951, review.whamcloud.com/#/c/27323/ ▶ Build Lustre bash autogen.sh && ./configure --enable-server --enable-ldiskfs --with-zfs=no --with-o2ib=/usr/src/ofa_kernel/default/ --with-linux=/root/ubuntu-kernel/ --enable-module && make debs
  • 37. ddn.com© 2016 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 37 Installing e2fsprogs ▶ Build and replace e2fsprogs git clone git://git.hpdd.intel.com/tools/e2fsprogs.git cd e2fsprogs git checkout v1.42.13.wc6 -b v1.42.13.wc6 wget -P ../ http://archive.ubuntu.com/ubuntu/pool/main/e/e2fsprogs/e2fsprogs_1.42.13-1ubuntu1.debian.tar.xz tar --exclude "debian/changelog" -xf ../e2fsprogs_1.42.13-1ubuntu1.debian.tar.xz sed -i 's/ext2_types-wrapper.h$//g' lib/ext2fs/Makefile.in dpkg-buildpackage -b -us -uc dpkg -i libcomerr2_1.42.13-1_arm64.deb libss2_1.42.13-1_arm64.deb e2fsck-static_1.42.13-1_arm64.deb e2fslibs_1.42.13-1_arm64.deb e2fsprogs_1.42.13-1_arm64.deb