Analyzing Memory and Threading Correctness for GPU-Offloaded Code

Modern workloads are diverse—and so are architectures. No single architecture is best for every workload. Maximizing performance takes a mix of scalar, vector, matrix, and spatial architectures deployed in CPU, GPU, FPGA, and other future accelerators. Heterogeneity adds complexity that can be difficult to debug. This article introduces the new features of Intel® Inspector that support the analysis of code that’s offloaded to accelerators.

For more information: Analyzing Memory and Threading Correctness for GPU-Offloaded Code

INTEL® FPGA PAC can Filter, Aggregate, Sort, and Convert files faster than software alone

This article is taken from DATA PROCESSING TESTS BY NTT DATA SUGGEST THAT AN INTEL® FPGA PAC CAN FILTER, AGGREGATE, SORT, AND CONVERT FILES 4X FASTER THAN SOFTWARE ALONE from Intel

Nearly 80% of total data processing time is spent on tasks such as filtering, aggregation, sorting, and format conversion. NTT Data conducted proof-of-concept tests aimed at improving data processing performance for these tasks. The tests employed an Intel® FPGA Programmable Acceleration Card (Intel® FPGA PAC) to process Linux audit logs, resulting in processing speeds more than four times faster than the same processing done in exclusively in software.

Two factors drove this exercise:

  1. The advent of Intel FPGA PACs and other associated technologies have now made it far easier for companies to incorporate FPGAs as processing elements in data center servers.
  2. HLS technology—which enables engineers to use programming languages with C-like syntaxes for application development targeting FPGAs—makes it easier for software engineers to develop applications that target FPGAs.

For more information, do take a look at DATA PROCESSING TESTS BY NTT DATA SUGGEST THAT AN INTEL® FPGA PAC CAN FILTER, AGGREGATE, SORT, AND CONVERT FILES 4X FASTER THAN SOFTWARE ALONE

How to Install Intel® oneAPI Toolkits

In this four-minute video, Intel’s David Liu provides installation instructions for an offline/local installation of Intel’s oneAPI Toolkits. David shows you where to find the Intel oneAPI Toolkits, how to download the toolkits, and the various ways to install the toolkits: from the GUI or from the command line (a silent install without the GUI, an interactive install, or a custom install).

Graphics Trace Analyzer Deep Dive (Part 1)

Graphics Trace Analyzer Deep Dive | Part 1 | Configure and Capture a Trace | Intel Software

This video will demonstrate fundamental trace capture capabilities in Graphics Monitor that will aid in the configuration of a variety of performance data available for graphics application analysis trace files in Graphics Trace Analyzer.