報錯For debugging consider passing CUDA_LAUNCH

報錯For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

news/2025/8/14 3:45:09/文章來源:https://blog.csdn.net/xiao_lxl/article/details/131228291

.報錯For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

/aten/src/ATen/native/cuda/NLLLoss2d.cu:103: nll_loss2d_forward_kernel: block: [29,0,0], thread: [707,0,0] Assertion t >= 0 && t < n_classes failed.

報錯信息如下：

./aten/src/ATen/native/cuda/NLLLoss2d.cu:103: nll_loss2d_forward_kernel: block: [29,0,0], thread: [707,0,0] Assertion t >= 0 && t < n_classes failed.
。。。。。。

。。。。。。
RuntimeError: CUDA error: device-side assert triggered
CUDA kernel errors might be asynchronously reported at some other API call, so
the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.

模型運行訓練，可到epoch=9 ，報錯
刪除models/__pycache__下的緩存文件，重新運行數據集，還是會報錯。

解決方案：
是標簽有問題，有一張圖片標簽壞了，某張圖片的label標簽個數超過了設定的類別數。

本文來自互聯網用戶投稿，該文觀點僅代表作者本人，不代表本站立場。本站僅提供信息存儲空間服務，不擁有所有權，不承擔相關法律責任。
如若轉載，請注明出處：http://www.pswp.cn/news/164412.shtml
繁體地址，請注明出處：http://hk.pswp.cn/news/164412.shtml
英文地址，請注明出處：http://en.pswp.cn/news/164412.shtml

如若內容造成侵權/違法違規/事實不符，請聯系多彩編程網進行投訴反饋email:809451989@qq.com，一經查實，立即刪除！