2025-03-07, 09:40 AM
That’s a clever workaround. It’s not an ideal solution, but at least it keeps things running. Have you looked into whether the GPU is being reset or if there’s a driver issue? Maybe checking dmesg or nvidia-smi logs could provide more clues. Also, does the issue happen under specific workloads, or is it random? Curious if there’s a more permanent fix.