I have a freeze problem that I couldn't find any clue as to why it happens:
I am using Ubuntu 22.04.1 and kernel version 5.15.0-56-generic.
I have NVIDIA Quadro RTX 4000 and 525.60.13 that came with "sudo apt install cuda", which installed cuda 12.
When the freeze happens if a terminal is open I can type and I can move my mouse around. I can "ls" but i cannot get any output from "top", and after that I cannot CTRL-C to cancel.
When it freezes I can ssh into it but the above applies the same.
At first I thought it was the application I was working on that caused this. However I left it on for a while and it froze by itself when the application was idling. I checked the memory usage logs and when it froze it was using nominal amount of memory and a little less than average CPU.
I checked syslog and kernlog but nothing interesting on it, except a mysterious "^@^@^@^@" pattern at the end after the freeze.
A lot of "ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13" logs inside syslog and kern.log
No interesting logs from my application anywhere
Both my SSD and HDD have plenty of empty space
My swap size is 2GB and swappiness at 60
Besides, this same freeze also occurred without my application running.
I was suspicious of the GPU drivers but I switched to nouveou driver and it still kept happening.