Speaker: Marc Hamilton, VP of Solutions Architecture and Engineering, NVIDIA Panelists: Nicola Rieke, Dion Harris, Timothy Costa, Gilad Shainer, Geetika Gupta
by NVIDIA CEO Jensen Huang’s Teratec Keynote: The Industrial HPC Revolution
Taken from Sharcnet HPC
The Video will consider two cases – when the GPUs are inside a single node, and a multi-node case.
Join the NVIDIA Jetson team for the latest episode of our AMA-style live stream, Jetson AI Labs.
There is this question that I wanted to find out about deep learning. What are essential System, Network, Protocol that will speed up the Training and/or Inferencing. There may not be necessary to employ the same level of requirements from Training to Inferencing and Vice Versa. I have received this information during a Nvidia Presentation
- Scalability requires ultra-fast networking
- Same hardware needs as HPC
- Extreme network bandwidth
- SHARP (Mellanox Scalable Hierarchical Aggregation and Reduction Protocol)
- GPUDirect (https://developer.nvidia.com/gpudirect)
- Fast Access Storage
- Highly Transactional
- Ultra-low Latency
- Instant Network Response
- PeerDirect, GPUDirect
From Nvidia Developer. Interesting Youtube.
An interesting blog to explain what is the difference a DPU, CPU, and GPU?
So What Makes a DPU Different?
A DPU is a new class of programmable processor that combines three key elements. A DPU is a system on a chip, or SOC, that combines:
An industry standard, high-performance, software programmable, multi-core CPU, typically based on the widely-used Arm architecture, tightly coupled to the other SOC components
A high-performance network interface capable of parsing, processing, and efficiently transferring data at line rate, or the speed of the rest of the network, to GPUs and CPUs
A rich set of flexible and programmable acceleration engines that offload and improve applications performance for AI and Machine Learning, security, telecommunications, and storage, among others.
For more information, do take a look at What’s a DPU? …And what’s the difference between a DPU, a CPU, and a GPU?