0% found this document useful (0 votes)

751 views18 pages

High Performance Computing in CST Studio Suite: Felix Wolfheimer

CST slides

Uploaded by

Pragash Sangaran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

751 views18 pages

High Performance Computing in CST Studio Suite: Felix Wolfheimer

CST slides

Uploaded by

Pragash Sangaran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

High Performance Computing

in
CST STUDIO SUITE
Felix Wolfheimer

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

GPU Computing Performance
GPU computing performance has
Speedup of Solver Loop been improved for CST STUDIO
18
SUITE 2014 as CPU and GPU
16 Promo offer for EUC resources are used in parallel.
14 participants:
25% discount for K40 cards
12
Speedup

GPU
10
8
CPU
6
CST STUDIO SUITE 2013
4
CST STUDIO SUITE 2014
2
0
0 1 2 3 4
Number of GPUs (Tesla K40)
Benchmark performed on system equipped with dual Xeon E5-2630 v2 (Ivy Bridge EP) processors, and four Tesla K40 cards. Model has 80 million mesh cells.

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

Typical GPU System Configurations
Entry level Professional level Enterprise level

Cluster system with high-

Workstation with 1 GPU card Workstation/server with speed interconnect.
multiple internal or
 Available "off the shelf“ High flexibility: Can
external GPU cards
 Good acceleration for handle extremely large
smaller models  Many configurations available models using MPI
 Limited model size  Good acceleration for medium Computing and also a lot
(depends on available GPU size and large models of parallel simulation
memory and features used)  Limited model size tasks using Distributed
(depends on available GPU Computing (DC)
memory and features used)  Administrative overhead
 Higher price
CST engineers are available to discuss with you which configuration makes sense for your applications and usage scenario.

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

MPI Computing — Area of Application
MPI Computing is a way to handle very large models efficiently
Some application examples for MPI Computing:

Electrically very large structures Extremely complex structures

(e.g. RCS calculation, lightning strike) (e.g.SI simulation for a full package)
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com
MPI Computing — Working Principle
Subdomain boundary CST STUDIO SUITE®
Frontend

connects to

MPI Client Nodes

Domain decomposition is
shown in mesh view. High speed/low latency interconnection network (optional)

 Based on a domain decomposition of the simulation domain.

 Each cluster computer works on its part of the domain.
 Automatic load balancing ensures an equal distribution of the workload.
 It works cross-platform on Windows and Linux systems.
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com
MPI Matrix Computation
The performance of the matrix computation step has been improved significantly for the
new version of CST STUDIO SUITE.

Performance Results (for two cluster nodes):*

Matrix Comp. Matrix Comp. Speedup Speedup
Model
Time/s (2013) Time/s (2014) (Matrix Comp.)** (Total Sim.)**

10,301 1,217 8.46 2.63

340M cells
Matrix computation is
CPU CPU single-threaded in case of
MPI up to version 2013.
12,921 4,018 3.22 1.85 Core Core

CPU CPU Version 2014 uses all

47M cells Core Core available cores on all
cluster nodes.

* =System configuration: Compute nodes are equipped with dual eight core Xeon E5-2650 processors, 4xK20 GPUs, and Infiniband FDR interconnect.
**=Speedup between version 2013 and 2014 of CST STUDIO SUITE.

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

MPI Calculation Example
2 GHz blade antenna positioned on aircraft

2 GHz
17.4 x 4.5 x 16.2 m
116 x 30 x 108 λ
375,840 λ3

660 million cells

4 node MPI cluster
4 Tesla K20 GPU on each node
Total of 16 GPUs with 6GB RAM at 60% Memory
Total memory: < 100 GB

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

MPI Calculation Example
2 GHz blade antenna positioned on aircraft

2 GHz
17.4 x 4.5 x 16.2 m
116 x 30 x 108 λ
375,840 λ3

660 million cells

4 node MPI cluster
4 Tesla K20 GPU on each node
Total of 16 GPUs with 6GB RAM at 60% Memory
Total memory: < 100 GB Broadband calculation time ~ 4h
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com
Sub-Volume Monitors
Sub-volume monitors allow to record field data only in a region of interest allowing for a reduction of
data. This is especially important for large models which have hundreds of millions mesh cells.

Field data is only stored in the

sub-volume defined by the box

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

Distributed Computing
CST STUDIO SUITE®
Frontend

“Jobs” could be: DC Main Controller

 port excitations*
excitations
 frequency points*
points
 parameter variations connects to
 optimization iterations
*2 in parallel included
with standard license DC Solver Servers

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

 Model has 16 ports
 Only 8 ports need to be computed if defining symmetry conditions
 Distribute the 8 simulation runs to different solver servers with
GPU acceleration
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com
DC Simulation Time Improvement
Speedup (total time)
30

25 CPU

1 GPU (Tesla 20)

20
Speedup

0
1 2 4 8
Number of DC Solver Servers

Dual Intel Xeon X5675 CPUs (3.06 GHz), fastest memory configuration, 1 Tesla 20 GPU
per node, 1 Gb Ethernet interconnect, 40 million mesh cells
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com
DC Main Controller
The DC Main Controller gives you a complete overview about what is happening on your cluster.
Job Status

Machine Status
Essential resources (RAM usage
and disk space) are monitored
as well in the 2014 version.

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

GPU Assignment
Users who have
smaller jobs can start
multiple solver servers
and assign each GPU
to a separate server.
This allows for a more
efficient use of multi-
GPU hardware

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

Supported Acceleration Methods
Acceleration methods supported by the solvers of CST STUDIO SUITE.
Solver Multithreading GPU Computing Distributed Computing MPI Computing

on one
GPU card

Most other solvers support Multithreading and Distributed Computing for parameter sweeps and
optimization.
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com
Choose the Right Acceleration Method
Number of
Solver Model Size Acceleration Technique
Simulations
Transient
below memory limit of GPU
low GPU Computing
hardware

Transient
below memory limit of GPU
medium/high GPU Computing on a DC Cluster (Distributed Excitations)
hardware

Transient
above memory limit of GPU
- MPI or combined MPI+GPU Computing
hardware

Frequency Domain
can be handled by a single
medium/high Distributed Computing (Distributed Frequency Points)
machine

Integral Equation
can't be handled by a single
- MPI Computing
machine

Integral Equation
can be handled by a single
medium/high Distributed Computing (Distributed Frequency Points)
machine

Parameter
n/a medium/high Distributed Computing
Sweep/Optimization

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

HPC in the Cloud
CST is working together with HPC hardware and service providers to enable easy
access to large computing power for challenging simulations which can't be run
on in-house hardware.
Users rent a CST license for the resources they need and pay the HPC provider
for the required hardware.
+
HPC system provider

Currently supported providers hosting CST STUDIO SUITE:

More information can be found in the HPC section of our website:

https://www.cst.com/Products/HPC/Cloud-Computing
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com
HPC Hardware Design Process
A general hardware recommendation is available on our website which helps you to
configure standard systems (e.g. workstations) for CST STUDIO SUITE.
For HPC systems (multi-GPU systems, clusters) our hardware experts are available to guide
you through the whole process of system design and benchmarking to ensure that your new
system is compatible with CST STUDIO SUITE and delivers the expected performance.
HPC System Design Process

Benchmarking of designed
Personal contact with CST computing solution in the Buy the machine if it fulfills your
engineers to design solution. hardware test center of the expectations.
preferred vendor.
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

CST Microwave & RF Component Design
No ratings yet
CST Microwave & RF Component Design
4 pages
CST Magic Tee Workflow1
No ratings yet
CST Magic Tee Workflow1
19 pages
CST S2 2014 Final Web
No ratings yet
CST S2 2014 Final Web
24 pages
Low Freq Simulation 2013
No ratings yet
Low Freq Simulation 2013
33 pages
CST STUDIO SUITE - Thermal and Mechanical Simulation
No ratings yet
CST STUDIO SUITE - Thermal and Mechanical Simulation
78 pages
CST Studio Suite - High Frequency Simulation
No ratings yet
CST Studio Suite - High Frequency Simulation
116 pages
CTS MWS AdvancedTopics PDF
100% (1)
CTS MWS AdvancedTopics PDF
155 pages
Introduction to CST Software Basics
No ratings yet
Introduction to CST Software Basics
7 pages
Talk 5-3-1 CST Euc 2012
No ratings yet
Talk 5-3-1 CST Euc 2012
175 pages
Frequency Selective Surfaces Analysis
100% (1)
Frequency Selective Surfaces Analysis
28 pages
Microwave Power Divider Design Report
No ratings yet
Microwave Power Divider Design Report
44 pages
2013EMCTraining Lightning
No ratings yet
2013EMCTraining Lightning
35 pages
Talk 6-3-2 CST EUC 2012
No ratings yet
Talk 6-3-2 CST EUC 2012
43 pages
Radio Wave Propagation Basics
No ratings yet
Radio Wave Propagation Basics
40 pages
Patch Antenna Design with CST Studio
No ratings yet
Patch Antenna Design with CST Studio
6 pages
Recent Research Results by Using CST Microwave Studio at Antenna Lab., POSTECH
100% (1)
Recent Research Results by Using CST Microwave Studio at Antenna Lab., POSTECH
15 pages
Feeding Techniques for Microstrip Antennas
No ratings yet
Feeding Techniques for Microstrip Antennas
7 pages
Antenna and EM Modeling With MATLAB
No ratings yet
Antenna and EM Modeling With MATLAB
21 pages
Epoxy Hybrid Composites for EMI Shielding
No ratings yet
Epoxy Hybrid Composites for EMI Shielding
8 pages
Software Lab1
No ratings yet
Software Lab1
4 pages
CST Thermal1
No ratings yet
CST Thermal1
16 pages
CST Application Note Designing Phased Array Antenna
No ratings yet
CST Application Note Designing Phased Array Antenna
6 pages
Hybrid Multi Band Antenna Array Overview
No ratings yet
Hybrid Multi Band Antenna Array Overview
3 pages
CST Suite
No ratings yet
CST Suite
105 pages
CST Assignment
No ratings yet
CST Assignment
25 pages
CST Microwave Studio Overview
No ratings yet
CST Microwave Studio Overview
16 pages
Thermal Co Simulation CST
No ratings yet
Thermal Co Simulation CST
18 pages
Capstone Project Report
No ratings yet
Capstone Project Report
29 pages
Dual-Frequency Microstrip Antenna Design
No ratings yet
Dual-Frequency Microstrip Antenna Design
1 page
Mini 90° Hybrid Coupler for QPSK
100% (2)
Mini 90° Hybrid Coupler for QPSK
4 pages
180° Lumped Element Hybrid Design
No ratings yet
180° Lumped Element Hybrid Design
4 pages
Optical Ring-Coupler Simulation Using CST MICROWAVE STUDIO
No ratings yet
Optical Ring-Coupler Simulation Using CST MICROWAVE STUDIO
3 pages
Design of Frequency Selective Surface Radome Over A Frequency Range
No ratings yet
Design of Frequency Selective Surface Radome Over A Frequency Range
6 pages
Lab Experiment Manual Heterodyne & Superheterodyne Receiver (Lab Session 5)
0% (1)
Lab Experiment Manual Heterodyne & Superheterodyne Receiver (Lab Session 5)
31 pages
8-Port Power Combiner
No ratings yet
8-Port Power Combiner
21 pages
Magic Tee: Microwave Engineering Guide
No ratings yet
Magic Tee: Microwave Engineering Guide
6 pages
Lecture Rectangular Waveguide
No ratings yet
Lecture Rectangular Waveguide
34 pages
Inductive Output Tubes: A Comparative Analysis
No ratings yet
Inductive Output Tubes: A Comparative Analysis
9 pages
R. Ludwig and G. Bogdanov "RF Circuit Design: Theory and Applications" 2 Edition Figures For Chapter 1
0% (1)
R. Ludwig and G. Bogdanov "RF Circuit Design: Theory and Applications" 2 Edition Figures For Chapter 1
29 pages
Microstrip Patch Antenna - Basics
100% (3)
Microstrip Patch Antenna - Basics
133 pages
Frequency Selective Surfaces - Ansoft
100% (1)
Frequency Selective Surfaces - Ansoft
34 pages
SIW-Integrated Parasitic DRA Array: Analysis, Design and Measurement
No ratings yet
SIW-Integrated Parasitic DRA Array: Analysis, Design and Measurement
5 pages
Microwave Propagation in Ferrites
No ratings yet
Microwave Propagation in Ferrites
16 pages
Skin Effect Analysis in Microstrip Lines
No ratings yet
Skin Effect Analysis in Microstrip Lines
9 pages
Radar Receiver Mixer Basics
50% (2)
Radar Receiver Mixer Basics
23 pages
Digital Communications: Bajibabu Mutte
100% (1)
Digital Communications: Bajibabu Mutte
24 pages
EMA3D Full Wave EM Simulation For Lightning Protection
No ratings yet
EMA3D Full Wave EM Simulation For Lightning Protection
56 pages
DFT and IDFT in Signal Processing
No ratings yet
DFT and IDFT in Signal Processing
29 pages
CST Microwave Studio: General Purpose Solver 3d-Volume
100% (1)
CST Microwave Studio: General Purpose Solver 3d-Volume
30 pages
PCAAD7 Manual
No ratings yet
PCAAD7 Manual
120 pages
CST MWS GPU Computing 2
No ratings yet
CST MWS GPU Computing 2
2 pages
CST Studio Suite 2011 Brochure
No ratings yet
CST Studio Suite 2011 Brochure
20 pages
Carterfest UlrichBecker
No ratings yet
Carterfest UlrichBecker
75 pages
CST STUDIO SUITE - High Frequency Simulation PDF
100% (2)
CST STUDIO SUITE - High Frequency Simulation PDF
128 pages
CST Studio Suite - High Frequency Simulation PDF
No ratings yet
CST Studio Suite - High Frequency Simulation PDF
108 pages
CST Studio Suite 2016
No ratings yet
CST Studio Suite 2016
28 pages
CST Studio Suite 2016
No ratings yet
CST Studio Suite 2016
28 pages
CST Studio Suite 2016
No ratings yet
CST Studio Suite 2016
28 pages
CST Hardware Acceleration
No ratings yet
CST Hardware Acceleration
2 pages
CST Studio Suite - Thermal and Mechanical Simulation
100% (1)
CST Studio Suite - Thermal and Mechanical Simulation
72 pages
Link Budget PDF
No ratings yet
Link Budget PDF
28 pages
ADS Momentum Filter Design Tutorial
No ratings yet
ADS Momentum Filter Design Tutorial
18 pages
Link Budget
No ratings yet
Link Budget
25 pages
Darling Ton Synthesis Revisited
No ratings yet
Darling Ton Synthesis Revisited
8 pages
CPCT Exam Overview and Details
No ratings yet
CPCT Exam Overview and Details
81 pages
Lesson Plan: Configuring Network Adapters
No ratings yet
Lesson Plan: Configuring Network Adapters
20 pages
History of Computers Explained
No ratings yet
History of Computers Explained
33 pages
Os 1-100
No ratings yet
Os 1-100
8 pages
CUDA 6.0 Overview and Features
No ratings yet
CUDA 6.0 Overview and Features
13 pages
CS3351-DPCO Lesson Plan
No ratings yet
CS3351-DPCO Lesson Plan
5 pages
Basic Computer Course Test
No ratings yet
Basic Computer Course Test
2 pages
DOS Printing to Windows Printers Guide
No ratings yet
DOS Printing to Windows Printers Guide
3 pages
Q-Word-Form 1 Computer Studies Topical Revision 1
No ratings yet
Q-Word-Form 1 Computer Studies Topical Revision 1
5 pages
c06367535 PDF
No ratings yet
c06367535 PDF
100 pages
Supreme Stickers in PC - Buscar Con Google
No ratings yet
Supreme Stickers in PC - Buscar Con Google
1 page
Apple Product Price List
No ratings yet
Apple Product Price List
1 page
IBM AS/400 Technical Overview
100% (1)
IBM AS/400 Technical Overview
86 pages
Arm Assembly Language Programming
100% (4)
Arm Assembly Language Programming
170 pages
Operating Systems: Unit 6
No ratings yet
Operating Systems: Unit 6
8 pages
Enhance RAM with USB Drive Guide
No ratings yet
Enhance RAM with USB Drive Guide
8 pages
2010 AcadNet Computers Quiz for Grades 11-12
No ratings yet
2010 AcadNet Computers Quiz for Grades 11-12
5 pages
Bootloader Basics for Microcontrollers
No ratings yet
Bootloader Basics for Microcontrollers
19 pages
ASIC Design for ORCA RISC Core
No ratings yet
ASIC Design for ORCA RISC Core
1 page
Microprocessor Architecture and Functions
No ratings yet
Microprocessor Architecture and Functions
2 pages
Fpga Problem Statement
No ratings yet
Fpga Problem Statement
3 pages
Understanding CPU Registers and Operations
No ratings yet
Understanding CPU Registers and Operations
16 pages
Dq45ek Desktop Board Executive Series Motherboard
No ratings yet
Dq45ek Desktop Board Executive Series Motherboard
84 pages
IT Theory Chapter 1
No ratings yet
IT Theory Chapter 1
20 pages
USB Virtual COM (VCOM) Driver Guide: Dated Jan. 15, 2020
No ratings yet
USB Virtual COM (VCOM) Driver Guide: Dated Jan. 15, 2020
7 pages
Computer Specifications
No ratings yet
Computer Specifications
3 pages
Installing Ram
No ratings yet
Installing Ram
11 pages
P4V88
No ratings yet
P4V88
36 pages
Service M5X0G SM
No ratings yet
Service M5X0G SM
98 pages

High Performance Computing in CST Studio Suite: Felix Wolfheimer

Uploaded by

High Performance Computing in CST Studio Suite: Felix Wolfheimer

Uploaded by

High Performance Computing

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

Cluster system with high-

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

Electrically very large structures Extremely complex structures

MPI Client Nodes

 Based on a domain decomposition of the simulation domain.

Performance Results (for two cluster nodes):*

10,301 1,217 8.46 2.63

CPU CPU Version 2014 uses all

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

660 million cells

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

660 million cells

Field data is only stored in the

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

“Jobs” could be: DC Main Controller

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

1 GPU (Tesla 20)

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com

Currently supported providers hosting CST STUDIO SUITE:

More information can be found in the HPC section of our website:

You might also like