Our group's server (running Ubuntu 20.04.5 LTS) is currently stuck in a "BUG: soft lockup" endless set of error message (2 out of 88 CPUs are unhappy).
However, the issue of the bug/soft lockup problem isn't what I'm asking about - I want to know if there's any way to escape the error messages/restart the server without physical access. I can't escape the error messages to do anything (during the first few times the error message appeared, I was able to do control C to get back to my bash session, but now it doesn't work either). I can't ssh into the server from a different terminal window (it just hangs), nor can I access it via KVM (just a black window, and it says the status is 'Down').
We do not have physical access to our server - it is kept in a secure building, and if, for example the power goes out and our server turns off, we have to pester the staff there via email to get it turned back on. None of them are responding to me today and I would dearly like to start troubleshooting this issue so that we can actually use our computing resources.
Is there something I can do in order to at least temporarily get out of the endless error messages saying "BUG: soft lockup - CPU#X stuck for 22/23s" so that I can restart the server? (FYI, I have zero CS background; I am merely (and frighteningly) the most computer-literate member of our research group, so, uh, be aware of that.) Thanks.