StefanG3D Posted January 4 Report Share Posted January 4 Download now (Same driver as in update 1) General CUDA New Features CUDA User Mode Driver, CUDA Runtime libraries and CUBLAS now come with obfuscated symbol names and with frame pointers enabled. Frame Pointers are enabled for other libraries in the CUDA Toolkit: NVIDIA Management Library, CUDA Profiling Tools Interface, cuBLAS, Compiler libraries – NVRTC, PTXJIT compiler, nvJitLink, and libnvvm. Allows better runtime visibility and traceability, and allows easier exchange of runtime information with NVIDIA when needed for debugging purposes. See https://developer.nvidia.com/blog/cuda-toolkit-symbol-server/ for information on how to use obfuscated symbols. Symbol server address is: https://cudatoolkit-symbols.nvidia.com/. Lazy loading default enablement for Windows: Brings the significant memory savings and load-time reductions of lazy loading to Windows by default. Additionally, makes the behavior equivalent between Linux and Windows. Single-step CUDA uninstall for Windows: It is no longer necessary to uninstall multiple components of the CUDA Toolkit individually to upgrade or uninstall CUDA. This can now be done in a single step. CUDA Graphs: Graph edge data, allowing modified dependencies between nodes. Programmatic Dependent Launch may now be described natively in CUDA Graphs. Launch completion events: Allows a dependency on scheduling, but not completion, of all blocks in a kernel, enabling tighter control of scheduling. MPS: Added a CUDA API to query whether or not MPS is running. Added a driver API to return the name of a kernel function. Added an API to libnvJitLink to return the nvJitLink version. Added support for reading kernel parameters in device functions. Enable querying the return type of __device__ lambdas with trailing return type. Fixes uncommon failures when using device-side lambdas. NVML / nvidia-smi: Metric for front-end context switch utilization (FECS) Added metrics for Ada Lovelace AV1 codec utilization Support GPU monitoring on Tegra Added an NVML API to expose H100 PCIe counters and corresponding PCIe section in nvidia-smi Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.