Hi, I've encountered a behavior in cupy.fill_diagonal that seems inconsistent with NumPy and leads to a delayed cudaErrorIllegalAddress. Below is a minimal reproduction and comparison with NumPy. I'm ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results