HPC

New C++ Sender Library Enables Portable Asynchrony

NVIDIA is excited to announce the release of the stdexec library on GitHub and in the 22.11 release of the NVIDIA HPC Software Development Kit. The stdexec library is a proof-of-concept implementation of the Sender model (or Senders) for asynchronous computing—expected to become part of the C++ Standard in C++26. Senders help you separate the logic of your algorithm – the work you actually care about – from the details of where and how it gets executed. Unlike the C++17 parallel algorithms, with Senders you can chain multiple asynchronous computations (like GPU kernels) without unnecessary synchronization.

Read More
HPC

Leveraging Standards-Based Parallel Programming in HPC Applications

Last month I discussed why standards-based parallel programming should be in your HPC toolbox. Now, I am highlighting the successes of some of the developers who have already made standards-based parallelism an integral part of their strategy. As you will see, success with standards-based programming isn’t limited to just mini-apps.

Read More
HPC

Why Standards-Based Parallel Programming Should be in Your HPC Toolbox

HPC application developers have long relied on programming abstractions that were developed and used almost exclusively within the realm of traditional HPC. In this post I discuss why you should be parallel programming natively with ISO C++ and ISO Fortran.

Read More
HPC

Using Fortran Standard Parallel Programming for GPU Acceleration

In this post we discuss the benefits of using Fortran DO CONCURRENT loops for GPU programming.

Read More