Child pages
  • Benchmark Mar_26_2018
Skip to end of metadata
Go to start of metadata

References

Tensorflow Official Benchmarks (May 2017, GitHub source): https://www.tensorflow.org/performance/benchmarks

IBM Power9 benchmark results (Nov 2017, 1.4.0): https://developer.ibm.com/linuxonpower/perfcol/perfcol-mldl/

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour, Facebook (Jun 2017): https://research.fb.com/wp-content/uploads/2017/06/imagenet1kin1h5.pdf

Benchmark Source Code

https://github-dev.cs.illinois.edu/kindrtnk/DL

 

Official TF Benchmark System Characteristics

  • Instance type: NVIDIA® DGX-1™
  • GPU: 8x NVIDIA® Tesla® P100
  • OS: Ubuntu 16.04 LTS with tests run via Docker
  • CUDA / cuDNN: 8.0 / 5.1
  • TensorFlow GitHub hash: b1e174e
  • Benchmark GitHub hash: 9165a70
  • Build Command: bazel build -c opt --copt=-march="haswell" --config=cuda //tensorflow/tools/pip_package:build_pip_package
  • Disk: Local SSD
  • DataSet: ImageNet
  • Test Date: May 2017

Our System Characteristics (more details in GitHub Repo)

  • Instance type: IBM Power9, 8335-GTG AC922 server
  • CPU: 2x 20-core IBM POWER9 CPU @ 2.00GHz
  • SDRAM: 512G DDR4
  • GPU: 4x NVIDIA® Tesla® V100, 5120 cores, 16 GB HBM 2
  • Disk: Local SSD
  • OS: Red Hat Enterprise Linux Server release 7.4
  • Python Distribution: Anaconda python 3.6.2
  • CUDA / cuDNN: 9.1/7.0.5
  • TensorFLow Version: 1.5.0
  • DataSet: ImageNet (synthetic)
  • Precision: floating point 32 and 16
  • Test Date: Mar 25 2018

The following table is the result of running with the same configurations as the official Tensorflow benchmark mentioned in "Reference" section above:

 base_modelFP_typebatch_sizen_gpusvariable_updatelocal_parameter_deviceimage_per_sec
0alexnet165121replicatedN/A8092.12
1alexnet165122replicatedN/A15289.09
2alexnet165124replicatedN/A25295.36
3alexnet325121replicatedN/A4692.38
4alexnet325122replicatedN/A8993.17
5alexnet325124replicatedN/A15868.53
6inception316641parameter_servercpu_parameterDevice423.90
7inception316642parameter_servercpu_parameterDevice823.06
8inception316644parameter_servercpu_parameterDevice1561.19
9inception332641parameter_servercpu_parameterDevice245.36
10inception332642parameter_servercpu_parameterDevice472.07
11inception332644parameter_servercpu_parameterDevice918.71
12resnet15216641parameter_servercpu_parameterDevice283.59
13resnet15216642parameter_servercpu_parameterDevice531.70
14resnet15216644parameter_servercpu_parameterDevice975.86
15resnet15232641parameter_servercpu_parameterDevice148.61
16resnet15232642parameter_servercpu_parameterDevice287.02
17resnet15232644parameter_servercpu_parameterDevice567.19
18resnet5016641parameter_servercpu_parameterDevice618.52
19resnet5016642parameter_servercpu_parameterDevice1174.06
20resnet5016644parameter_servercpu_parameterDevice2294.24
21resnet5032641parameter_servercpu_parameterDevice364.54
22resnet5032642parameter_servercpu_parameterDevice719.03
23resnet5032644parameter_servercpu_parameterDevice1402.16
24vgg1616641replicatedN/A405.58
25vgg1616642replicatedN/A766.43
26vgg1616644replicatedN/A1264.35
27vgg1632641replicatedN/A234.22
28vgg1632642replicatedN/A451.64
29vgg1632644replicatedN/A809.41

 

This figure compares the result we get with Tensorflow official ones.

Green bars stand for our benchmark results using floating point 16.

Red bars are the official Tensorflow result.

Blue bars stand for our benchmark results using floating point 32.

This figure shows the performance ratio of our floating point 16 and 32 benchmarks with respect to Tensorflow official results:

The following table provides a more comprehensive benchmark results on our system:

 

 base_modelFP_typebatch_sizen_gpusvariable_updatelocal_parameter_deviceimage_per_sec
0alexnet165121parameter_serverN/A8176.13
1alexnet165121replicatedN/A8092.12
2alexnet165122parameter_serverN/A15724.07
3alexnet165122replicatedN/A15289.09
4alexnet165124parameter_serverN/A26709.08
5alexnet165124replicatedN/A25295.36
6alexnet325121parameter_serverN/A4645.89
7alexnet325121replicatedN/A4692.38
8alexnet325122parameter_serverN/A8994.28
9alexnet325122replicatedN/A8993.17
10alexnet325124parameter_serverN/A15563.43
11alexnet325124replicatedN/A15868.53
12inception316321parameter_servergpu_parameterDevice368.16
13inception316321parameter_servercpu_parameterDevice289.93
14inception316321replicatedcpu_parameterDevice345.22
15inception316321replicatedgpu_parameterDevice336.22
16inception316322parameter_servergpu_parameterDevice569.06
17inception316322parameter_servercpu_parameterDevice593.43
18inception316322replicatedcpu_parameterDevice610.20
19inception316322replicatedgpu_parameterDevice594.11
20inception316324parameter_servergpu_parameterDevice972.40
21inception316324parameter_servercpu_parameterDevice1030.18
22inception316324replicatedgpu_parameterDevice1076.02
23inception316324replicatedcpu_parameterDevice1064.68
24inception316641parameter_servercpu_parameterDevice423.90
25inception316641parameter_servergpu_parameterDevice440.50
26inception316641replicatedgpu_parameterDevice436.09
27inception316641replicatedcpu_parameterDevice436.16
28inception316642parameter_servercpu_parameterDevice823.06
29inception316642parameter_servergpu_parameterDevice811.04
30inception316642replicatedcpu_parameterDevice842.72
31inception316642replicatedgpu_parameterDevice848.60
32inception316644parameter_servercpu_parameterDevice1561.19
33inception316644parameter_servergpu_parameterDevice1502.41
34inception316644replicatedcpu_parameterDevice1701.83
35inception316644replicatedgpu_parameterDevice1605.17
36inception3161281replicatedno_parameterDevice491.12
37inception3161282replicatedno_parameterDevice972.62
38inception3161284replicatedno_parameterDevice1926.57
39inception3162561replicatedno_parameterDevice521.54
40inception3162562replicatedno_parameterDevice1032.37
41inception3162564replicatedno_parameterDevice2043.78
42inception332321parameter_servergpu_parameterDevice224.14
43inception332321parameter_servercpu_parameterDevice217.93
44inception332321replicatedcpu_parameterDevice224.14
45inception332321replicatedgpu_parameterDevice225.85
46inception332322parameter_servergpu_parameterDevice414.47
47inception332322parameter_servercpu_parameterDevice424.24
48inception332322replicatedcpu_parameterDevice431.49
49inception332322replicatedgpu_parameterDevice439.14
50inception332324parameter_servercpu_parameterDevice788.87
51inception332324parameter_servergpu_parameterDevice747.39
52inception332324replicatedcpu_parameterDevice856.59
53inception332324replicatedgpu_parameterDevice829.10
54inception332641parameter_servergpu_parameterDevice247.98
55inception332641parameter_servercpu_parameterDevice245.36
56inception332641replicatedcpu_parameterDevice247.98
57inception332641replicatedgpu_parameterDevice247.96
58inception332642parameter_servercpu_parameterDevice472.07
59inception332642parameter_servergpu_parameterDevice451.69
60inception332642replicatedgpu_parameterDevice491.18
61inception332642replicatedcpu_parameterDevice486.19
62inception332644parameter_servergpu_parameterDevice919.57
63inception332644parameter_servercpu_parameterDevice918.71
64inception332644replicatedgpu_parameterDevice963.25
65inception332644replicatedcpu_parameterDevice962.94
66resnet15216321parameter_servercpu_parameterDevice201.65
67resnet15216321parameter_servergpu_parameterDevice229.96
68resnet15216321replicatedgpu_parameterDevice218.05
69resnet15216321replicatedcpu_parameterDevice193.48
70resnet15216322parameter_servercpu_parameterDevice332.26
71resnet15216322parameter_servergpu_parameterDevice341.70
72resnet15216322replicatedcpu_parameterDevice329.15
73resnet15216322replicatedgpu_parameterDevice375.69
74resnet15216324parameter_servergpu_parameterDevice537.87
75resnet15216324parameter_servercpu_parameterDevice593.27
76resnet15216324replicatedcpu_parameterDevice664.84
77resnet15216324replicatedgpu_parameterDevice624.26
78resnet15216641parameter_servergpu_parameterDevice290.69
79resnet15216641parameter_servercpu_parameterDevice283.59
80resnet15216641replicatedcpu_parameterDevice287.45
81resnet15216641replicatedgpu_parameterDevice287.39
82resnet15216642parameter_servercpu_parameterDevice531.70
83resnet15216642parameter_servergpu_parameterDevice510.79
84resnet15216642replicatedcpu_parameterDevice562.41
85resnet15216642replicatedgpu_parameterDevice574.11
86resnet15216644parameter_servercpu_parameterDevice975.86
87resnet15216644parameter_servergpu_parameterDevice894.71
88resnet15216644replicatedcpu_parameterDevice1075.84
89resnet15216644replicatedgpu_parameterDevice945.07
90resnet152161281replicatedno_parameterDevice330.65
91resnet152161282replicatedno_parameterDevice648.43
92resnet152161284replicatedno_parameterDevice1288.11
93resnet15232321parameter_servercpu_parameterDevice131.59
94resnet15232321parameter_servergpu_parameterDevice137.46
95resnet15232321replicatedgpu_parameterDevice137.48
96resnet15232321replicatedcpu_parameterDevice137.46
97resnet15232322parameter_servercpu_parameterDevice252.06
98resnet15232322parameter_servergpu_parameterDevice258.02
99resnet15232322replicatedcpu_parameterDevice266.06
100resnet15232322replicatedgpu_parameterDevice269.09
101resnet15232324parameter_servercpu_parameterDevice475.65
102resnet15232324parameter_servergpu_parameterDevice428.43
103resnet15232324replicatedcpu_parameterDevice531.89
104resnet15232324replicatedgpu_parameterDevice510.74
105resnet15232641parameter_servercpu_parameterDevice148.61
106resnet15232641parameter_servergpu_parameterDevice153.30
107resnet15232641replicatedcpu_parameterDevice152.38
108resnet15232641replicatedgpu_parameterDevice153.30
109resnet15232642parameter_servergpu_parameterDevice297.53
110resnet15232642parameter_servercpu_parameterDevice287.02
111resnet15232642replicatedgpu_parameterDevice304.75
112resnet15232642replicatedcpu_parameterDevice302.79
113resnet15232644parameter_servercpu_parameterDevice567.19
114resnet15232644parameter_servergpu_parameterDevice546.67
115resnet15232644replicatedcpu_parameterDevice591.06
116resnet15232644replicatedgpu_parameterDevice587.99
117resnet152_v216321parameter_servercpu_parameterDevice210.39
118resnet152_v216321parameter_servergpu_parameterDevice234.22
119resnet152_v216321replicatedcpu_parameterDevice218.01
120resnet152_v216321replicatedgpu_parameterDevice225.86
121resnet152_v216322parameter_servercpu_parameterDevice319.55
122resnet152_v216322parameter_servergpu_parameterDevice357.69
123resnet152_v216322replicatedgpu_parameterDevice377.47
124resnet152_v216322replicatedcpu_parameterDevice384.85
125resnet152_v216324parameter_servercpu_parameterDevice652.52
126resnet152_v216324parameter_servergpu_parameterDevice550.14
127resnet152_v216324replicatedcpu_parameterDevice673.32
128resnet152_v216324replicatedgpu_parameterDevice632.01
129resnet152_v216641parameter_servergpu_parameterDevice294.10
130resnet152_v216641parameter_servercpu_parameterDevice290.34
131resnet152_v216641replicatedgpu_parameterDevice294.56
132resnet152_v216641replicatedcpu_parameterDevice294.12
133resnet152_v216642parameter_servergpu_parameterDevice532.29
134resnet152_v216642parameter_servercpu_parameterDevice542.82
135resnet152_v216642replicatedgpu_parameterDevice575.56
136resnet152_v216642replicatedcpu_parameterDevice574.35
137resnet152_v216644parameter_servercpu_parameterDevice1030.82
138resnet152_v216644parameter_servergpu_parameterDevice944.88
139resnet152_v216644replicatedgpu_parameterDevice1032.10
140resnet152_v216644replicatedcpu_parameterDevice1159.66
141resnet152_v2161281replicatedno_parameterDevice335.02
142resnet152_v2161282replicatedno_parameterDevice661.07
143resnet152_v2161284replicatedno_parameterDevice1296.60
144resnet152_v232321parameter_servercpu_parameterDevice132.99
145resnet152_v232321parameter_servergpu_parameterDevice139.09
146resnet152_v232321replicatedgpu_parameterDevice140.51
147resnet152_v232321replicatedcpu_parameterDevice138.95
148resnet152_v232322parameter_servergpu_parameterDevice252.95
149resnet152_v232322parameter_servercpu_parameterDevice257.58
150resnet152_v232322replicatedgpu_parameterDevice271.95
151resnet152_v232322replicatedcpu_parameterDevice271.85
152resnet152_v232324parameter_servergpu_parameterDevice447.54
153resnet152_v232324parameter_servercpu_parameterDevice484.61
154resnet152_v232324replicatedgpu_parameterDevice532.23
155resnet152_v232324replicatedcpu_parameterDevice532.04
156resnet152_v232641parameter_servergpu_parameterDevice155.19
157resnet152_v232641parameter_servercpu_parameterDevice151.34
158resnet152_v232641replicatedgpu_parameterDevice154.24
159resnet152_v232641replicatedcpu_parameterDevice154.24
160resnet152_v232642parameter_servercpu_parameterDevice298.92
161resnet152_v232642parameter_servergpu_parameterDevice295.79
162resnet152_v232642replicatedgpu_parameterDevice308.46
163resnet152_v232642replicatedcpu_parameterDevice306.41
164resnet152_v232644parameter_servercpu_parameterDevice557.58
165resnet152_v232644parameter_servergpu_parameterDevice526.94
166resnet152_v232644replicatedcpu_parameterDevice601.99
167resnet152_v232644replicatedgpu_parameterDevice595.10
168resnet5016321parameter_servercpu_parameterDevice486.00
169resnet5016321parameter_servergpu_parameterDevice521.21
170resnet5016321replicatedcpu_parameterDevice471.64
171resnet5016321replicatedgpu_parameterDevice516.20
172resnet5016322parameter_servergpu_parameterDevice824.17
173resnet5016322parameter_servercpu_parameterDevice891.88
174resnet5016322replicatedcpu_parameterDevice981.99
175resnet5016322replicatedgpu_parameterDevice953.31
176resnet5016324parameter_servercpu_parameterDevice1627.05
177resnet5016324parameter_servergpu_parameterDevice1502.83
178resnet5016324replicatedcpu_parameterDevice1834.62
179resnet5016324replicatedgpu_parameterDevice1598.16
180resnet5016641parameter_servercpu_parameterDevice618.52
181resnet5016641parameter_servergpu_parameterDevice641.96
182resnet5016641replicatedcpu_parameterDevice632.34
183resnet5016641replicatedgpu_parameterDevice638.60
184resnet5016642parameter_servercpu_parameterDevice1174.06
185resnet5016642parameter_servergpu_parameterDevice1221.21
186resnet5016642replicatedcpu_parameterDevice1245.83
187resnet5016642replicatedgpu_parameterDevice1239.34
188resnet5016644parameter_servercpu_parameterDevice2294.24
189resnet5016644parameter_servergpu_parameterDevice2199.00
190resnet5016644replicatedcpu_parameterDevice2489.61
191resnet5016644replicatedgpu_parameterDevice2376.21
192resnet50161281replicatedno_parameterDevice712.54
193resnet50161282replicatedno_parameterDevice1419.40
194resnet50161284replicatedno_parameterDevice2733.84
195resnet50162561replicatedno_parameterDevice749.41
196resnet50162562replicatedno_parameterDevice1477.06
197resnet50162564replicatedno_parameterDevice2932.09
198resnet5032321parameter_servercpu_parameterDevice324.01
199resnet5032321parameter_servergpu_parameterDevice336.18
200resnet5032321replicatedcpu_parameterDevice332.84
201resnet5032321replicatedgpu_parameterDevice332.86
202resnet5032322parameter_servergpu_parameterDevice642.03
203resnet5032322parameter_servercpu_parameterDevice621.98
204resnet5032322replicatedcpu_parameterDevice658.53
205resnet5032322replicatedgpu_parameterDevice648.60
206resnet5032324parameter_servergpu_parameterDevice1053.87
207resnet5032324parameter_servercpu_parameterDevice1201.09
208resnet5032324replicatedgpu_parameterDevice1246.13
209resnet5032324replicatedcpu_parameterDevice1282.85
210resnet5032641parameter_servercpu_parameterDevice364.54
211resnet5032641parameter_servergpu_parameterDevice371.99
212resnet5032641replicatedcpu_parameterDevice371.97
213resnet5032641replicatedgpu_parameterDevice372.02
214resnet5032642parameter_servercpu_parameterDevice719.03
215resnet5032642parameter_servergpu_parameterDevice722.70
216resnet5032642replicatedcpu_parameterDevice722.34
217resnet5032642replicatedgpu_parameterDevice736.44
218resnet5032644parameter_servergpu_parameterDevice1317.22
219resnet5032644parameter_servercpu_parameterDevice1402.16
220resnet5032644replicatedcpu_parameterDevice1424.48
221resnet5032644replicatedgpu_parameterDevice1424.51
222resnet50_v216321parameter_servercpu_parameterDevice510.22
223resnet50_v216321parameter_servergpu_parameterDevice537.68
224resnet50_v216321replicatedgpu_parameterDevice505.88
225resnet50_v216321replicatedcpu_parameterDevice516.14
226resnet50_v216322parameter_servergpu_parameterDevice953.22
227resnet50_v216322parameter_servercpu_parameterDevice942.64
228resnet50_v216322replicatedgpu_parameterDevice1011.77
229resnet50_v216322replicatedcpu_parameterDevice1000.48
230resnet50_v216324parameter_servercpu_parameterDevice1643.04
231resnet50_v216324parameter_servergpu_parameterDevice1517.55
232resnet50_v216324replicatedgpu_parameterDevice1703.01
233resnet50_v216324replicatedcpu_parameterDevice1778.21
234resnet50_v216641parameter_servercpu_parameterDevice647.49
235resnet50_v216641parameter_servergpu_parameterDevice655.20
236resnet50_v216641replicatedgpu_parameterDevice648.63
237resnet50_v216641replicatedcpu_parameterDevice655.02
238resnet50_v216642parameter_servercpu_parameterDevice1232.88
239resnet50_v216642parameter_servergpu_parameterDevice1200.12
240resnet50_v216642replicatedcpu_parameterDevice1283.14
241resnet50_v216642replicatedgpu_parameterDevice1297.01
242resnet50_v216644parameter_servergpu_parameterDevice2176.56
243resnet50_v216644parameter_servercpu_parameterDevice2347.95
244resnet50_v216644replicatedcpu_parameterDevice2553.59
245resnet50_v216644replicatedgpu_parameterDevice2492.30
246resnet50_v2161281replicatedno_parameterDevice733.20
247resnet50_v2161282replicatedno_parameterDevice1445.30
248resnet50_v2161284replicatedno_parameterDevice2771.44
249resnet50_v2162561replicatedno_parameterDevice766.46
250resnet50_v2162562replicatedno_parameterDevice1532.91
251resnet50_v2162564replicatedno_parameterDevice2997.32
252resnet50_v232321parameter_servercpu_parameterDevice332.66
253resnet50_v232321parameter_servergpu_parameterDevice347.81
254resnet50_v232321replicatedcpu_parameterDevice344.23
255resnet50_v232321replicatedgpu_parameterDevice345.27
256resnet50_v232322parameter_servergpu_parameterDevice672.42
257resnet50_v232322parameter_servercpu_parameterDevice650.97
258resnet50_v232322replicatedcpu_parameterDevice665.32
259resnet50_v232322replicatedgpu_parameterDevice683.57
260resnet50_v232324parameter_servercpu_parameterDevice1248.53
261resnet50_v232324parameter_servergpu_parameterDevice1101.25
262resnet50_v232324replicatedgpu_parameterDevice1258.33
263resnet50_v232324replicatedcpu_parameterDevice1303.18
264resnet50_v232641parameter_servercpu_parameterDevice371.81
265resnet50_v232641parameter_servergpu_parameterDevice383.19
266resnet50_v232641replicatedgpu_parameterDevice377.57
267resnet50_v232641replicatedcpu_parameterDevice383.30
268resnet50_v232642parameter_servercpu_parameterDevice732.40
269resnet50_v232642parameter_servergpu_parameterDevice747.31
270resnet50_v232642replicatedgpu_parameterDevice751.50
271resnet50_v232642replicatedcpu_parameterDevice751.11
272resnet50_v232644parameter_servergpu_parameterDevice1353.14
273resnet50_v232644parameter_servercpu_parameterDevice1423.04
274resnet50_v232644replicatedgpu_parameterDevice1445.11
275resnet50_v232644replicatedcpu_parameterDevice1444.34
276vgg1616321parameter_serverN/A379.42
277vgg1616321replicatedN/A379.52
278vgg1616322parameter_serverN/A722.64
279vgg1616322replicatedN/A676.75
280vgg1616324parameter_serverN/A991.95
281vgg1616324replicatedN/A1001.89
282vgg1616641parameter_serverN/A403.87
283vgg1616641replicatedN/A405.58
284vgg1616642parameter_serverN/A778.24
285vgg1616642replicatedN/A766.43
286vgg1616644parameter_serverN/A1264.47
287vgg1616644replicatedN/A1264.35
288vgg16161281replicatedN/A425.12
289vgg16161282replicatedN/A822.49
290vgg16161284replicatedN/A1466.10
291vgg16162561replicatedN/A396.79
292vgg16162562replicatedN/A778.23
293vgg16162564replicatedN/A1371.82
294vgg1632321parameter_serverN/A225.86
295vgg1632321replicatedN/A225.86
296vgg1632322parameter_serverN/A424.41
297vgg1632322replicatedN/A418.82
298vgg1632324parameter_serverN/A692.95
299vgg1632324replicatedN/A683.66
300vgg1632641parameter_serverN/A236.39
301vgg1632641replicatedN/A234.22
302vgg1632642parameter_serverN/A455.67
303vgg1632642replicatedN/A451.64
304vgg1632644parameter_serverN/A815.81
305vgg1632644replicatedN/A809.41

 

POWER8 (p8)

 

 

  • No labels