May 3, 2021 by kittycool only

Analyzing Memory and Threading Correctness for GPU-Offloaded Code

Modern workloads are diverse—and so are architectures. No single architecture is best for every workload. Maximizing performance takes a mix of scalar, vector, matrix, and spatial architectures deployed in CPU, GPU, FPGA, and other future accelerators. Heterogeneity adds complexity that can be difficult to debug. This article introduces the new features of Intel® Inspector that support the analysis of code that’s offloaded to accelerators.

For more information: Analyzing Memory and Threading Correctness for GPU-Offloaded Code

May 3, 2021 by kittycool only

On a Mission of Disaster Management & Scientific Discoveries

On a Mission of Disaster Management & Scientific Discoveries

S	M	T	W	T	F	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31

The Linux Cluster

Linux Cluster Blog is a collection of how-to and tutorials for Linux Cluster and Enterprise Linux

Day: May 3, 2021

Analyzing Memory and Threading Correctness for GPU-Offloaded Code

On a Mission of Disaster Management & Scientific Discoveries