Session
Copy/Compute Overlap with Multiple GPUs (– Learn the key concepts for effectively performing copy/compute overlap on multiple GPUs. – Explore robust indexing strategies for the flexible use of copy/compute overlap on multiple GPUs. – Refactor the single-GPU CUDA C++ application to perform copy/compute overlap on multiple GPUs. – Observe performance benefits for copy/compute overlap on multiple GPUs. – See copy/compute overlap on multiple GPUs in the Nsight Systems visual profiler timeline.)