Tau Leng, Ph.D. VP, HPC July, 2014 @ GTC Tokyo

–
GPU
Tau Leng, Ph.D.
VP, HPC
July, 2014 @ GTC Tokyo
Confidential © Supermicro 2014
Supermicro
Tokyo, Japan
:
80
:
1993
21
:
:
,
:
, VAR, OEM
,
,
Performance
Energy Efficiency
2014, < 0.33 MWatts/PFLOP
EFLOPs
HISTORICAL CONTEXT 1.5 MWatts /PFLOP
PetaFLOPs
Challenge
PFLOPs
Scalability and
Performance / $$$
TeraFLOPs
Challenge
TFLOPs
10K Watts /TFLOP
Challenge
GigaFLOPs
Challenge
GFLOPs
Efficiency & Density
Performance / Watt / FT²
SMP / MPP
MFLOPs
1990
Hybrid System
CPU + GPU
Proprietary Solutions
1995
Commodity Components
App. Optimized
General Purpose, off the shelf
PC cluster
Blades or High Density, High
Efficiency Servers
2000
2005
2010
2015
Top500 / Green500
GPU/
Top500
Source: Top500 / Green500
 HPC
: > 80%
/
HPC
 Moving to the Top:Green500
/
(IDC: 2011
GPU
28.8%
2013
)

TSUBAME-KFC: 1
4 MFLOPS
 Wide Adoption: GPU
(
Green500
76.9%
17
): 2014
6
GPU/
(VDI
)
GPU
HPC
/
•
•
•
•
•
•
•
•
&
•
•
•
•
•
•
• EDA
(CFD)
超並列
アーキテクチャー
により科学・工学用
•
• Excel
•
アプリケーションを
高速化
OLAP
•
•
•
&
&
•
•
•
•
•
•
&
&
(
)
/
/
Supermicro
GPU
3 GPUs Blade
Mainstream
4U 8-GPUs
Density
Efficiency
GPGPU
Where it
started…
Hybrid
Computing
Pioneer
FatTwin™ 2-node
6 GPUs or MICs per
node
7U GPU Blades
20 CPUs + 20 GPUs
Ultra High
Efficiency
The fastest 1U server
in the world
Telsa S1070
FatTwin™ 4-node
3 GPUs per node
4 GPUs Workstation /
4U
2U 4-GPU
1U 4-GPU
Standalone box
2U GPU w/ QDR
IB onboard
PCI-E x16
1U Twin™
X9 (UP) 1U 2-GPUs
1U 3-GPU
2U Twin
The most
powerful PSC
2U 6-GPUs
1U 4-GPUs
2008
2009
2010
2011
2012
2013
2014
GPU
3U
 SuperServer
(
)
1U/2U 2CPU –
2
4
3
7047GR-TPRF
FatTwin
2
6
4
1U 1CPU –
,
–
1027GR-TQF
1027GR-TQFT
2027GR-TRFH
2027GR-TRFHT
1017GR-TF
3
4
2
5017GR-TF
1027GR-TRF
1027GR-TSF
1027GR-TRFT
2027GR-TRF
2027GR-TRFT
6037R-72RFT+
2

1
(
5037A-iL
2
5037A-i
SuperBlade
)
1
7037A-iL
2
7037A-i
2
7047A-T
4
7047GR-TRF
&


GPU GRID
,
NVIDIA GRID
,
GPU
GPU
GPU

 PCIe
PCB
 CPU
I/O







&
−
高効率 1800W 電源
x16
x16
x16
x16
GPU
CPU/GPU
PEARL




95%
&
& BBP(
)

(
)
Efficiency


/
(95%+)
Max. Power Requirements
2000w
1800w
1600w
1400w
1200w
1000w
800w
600w
400w
200w
No GPU
電源負荷
20%
1 GPU
2 GPUs
40%
3 GPUs
4 GPUs
60%
5 GPUs
80%
(95+%)
*
6 GPUs


(240W ~ 2000W)
1+1 or 2+1

&

/
&







&
GPU Workstation / 4U Server
(




/
)
Rack DCLC AHx™ -
Multiple
Rack CHx
CoolIT®
AHx
•二重化されたファン
•集中型ポンプアーキテクチャー
•CoolIT コマンドセンターによる
水冷システム状況の監視
•PDUの様な形状のスチールボディー
•金属製ドライブレークのクイックコネクト
•パッシブコールドプレート技術
•金属製ドライブレークのクイックコネクト
FatTwin™ GPU
CPU

: FatTwin GPU
3 GPU

1U, 2U, 4U… GPU
 CPU

GPU
4U 4
–
GPU
–

 SSD

“GPU
Supermicro Servers”
&
BIOS

Supermicro 1U


PUE ~ 1.12

25
(
–
)
~25kW
KW / rack
Green500 #1
Green500
GPU
17
http://www.supermicro.com/products/nfo/Green500.cfm
Top500 #311 (~4.5GFLOPS
)
– GPU /




GPU
Engineering Challenge


,
Supermicro
GPU
GPU
(http://www.supermicro.com/GPU/)
Tau Leng, Ph.D.
[email protected]