Score:10

Server

How can I make Linux reboot instead of remounting the filesystem as read-only?

user541686

3/23/23, 6:07 AM

Linux systems sometimes remount the root file system as read-only, e.g. if there's an I/O error.

I have a machine that becomes useless when this happens, and I end up rebooting it manually.

Is there a way to make Linux just automatically reboot when this happens? A read-only mount is useless to me.

2522

0 + 0

filesystems

corruption

reboot

Simon Richter

3/23/23, 3:35 PM

I'd also investigate the source of these I/O errors. The last time an ext2 filesystem went readonly for me was in 1994, and the cause could be traced to a broken CPU fan.

15

Reply

Austin Hemmelgarn

3/23/23, 9:27 PM

You have an [XY problem](https://meta.stackexchange.com/questions/66377/what-is-the-xy-problem) here. The correct solution is not to make the system reboot on an IO error (the accepted answer explains how to do that, _but_ that’s actually rather risky for multiple reasons), it’s to _fix the root cause of the IO errors_, because then the filesystem will not randomly get mounted read-only. If it’s only intermittent and the storage device is good, you probably have suspect RAM or a flaky PSU, both of which can cause much bigger issues than a simple filesystem error.

12

Reply

user541686

3/24/23, 12:35 AM

@AustinHemmelgarn: I don't have an XY problem here. You're just making a lot of assumptions that don't hold true in the case(s) I'm asking about.

1

Reply

user541686

3/24/23, 12:39 AM

@SimonRichter: I indeed have tried looking into the cause, but thanks for the reminder, others should probably do that before rebooting.

0

Reply

user541686

3/24/23, 12:45 AM

I just realized somehow I posted this on ServerFault rather than Unix.SE as I had intended to! Glad it's still on-topic I guess, but feel free to migrate if needed.

0

Reply

Shadur

3/24/23, 3:02 PM

Rebooting rather than sort out the reason for the R/O remount has a high likelihood of making the problem *worse* - especially if it fails to mount the system on reboot and you're now stuck with an entirely unresponsive system.

7

Reply

Austin Hemmelgarn

3/24/23, 5:34 PM

@user541686 You have random IO errors. That _will_ cause other problems eventually (and trust me, they will be much more of a pain to fix than just rebooting the system), hence my assertion that this is an XY problem. The fact that you do not recognize the X as a problem does not make it any less of an XY problem.

7

Reply

user541686

3/24/23, 6:34 PM

@AustinHemmelgarn: I'm well aware of what's going on in my situation and why I resorted to this solution. Unfortunately you're not. The fact that you don't recognize you're still making unfounded assumptions about my situation doesn't make you more correct, but admittedly I can't stop you from lecturing.

0

Reply

user541686

3/24/23, 6:38 PM

@Shadur: I fully understand all that, believe it or not. Nobody is saying this solution should be used in every situation. I'm just telling you I have **a** situation where this solution makes sense. If you can't imagine why, that's fine. Just have faith in me that I'm not stupid and that I'm only asking this because there's information I have that you don't.

1

Reply

Mark

3/24/23, 9:13 PM

@user541686, if there's relevant information, provide it. Don't just say "trust me".

4

Reply

user541686

3/25/23, 2:31 AM

@Mark: No, I won't provide irrelevant info. It's quite literally nobody's business what situation I'm dealing with that gave rise to this question. If you would rather believe it's out of my stupidity, feel free to continue believing that; don't feel obligated to "trust me". It's not like I can force you.

3

Reply

Andrew Henle

3/25/23, 12:23 PM

@user541686 You're papering over I/O errors on the root filesystem with a reboot, on the ***HOPE*** that your system will return to operational status. You're coming across as someone who thinks they know everything but in reality is just smart enough to be dangerous. You may think you know why you're getting IO errors, but what happens **when** you get one that's not like you think? You get a dead system that you can't access. "I know what's going on!" doesn't provide any limits as to what **can** go on - the universe doesn't care about what you think you know.

0

Reply

marcelm

3/25/23, 4:20 PM

@Mark (and others) _"... if there's relevant information, provide it. Don't just say trust me."_ - I don't think it's worth barking up the XY tree here. First of all, the question as it stands (panic/reboot instead of remounting ro) is a perfectly valid and answerable question. Secondly, the OP seems well aware that I/O errors are, ahem, not ideal, and has now explicitly declared that area off-topic. Sadly, sometimes there's just nothing you can do to fix the root cause _right now_, and a workaround is needed. With that in mind, I don't think we're in a place to demand OP provide more context.

3

Reply

Score:23

Server

shodanshok

3/23/23, 7:53 AM

I deduce you are using ext3 or ext4 as the file system. If so, you can mount it with the errors=panic option and configure watchdog to reboot your system in case a panic happen.

While more complex than roelvanmeer's answer (which I upvoted), it has the added bonus of working for all panic-level kernel crash.

As suggested by NikitaKipriyanov, setting the panic=5 kernel boot option can be a simpler alternative to watchdog (which has more configuration options but it is slightly more complex as result).

0 + 0

Nikita Kipriyanov

3/23/23, 9:51 AM

Alternative to watchdog might be adding something like `kernel.panic = 5` into the `/etc/sysctl.d/panic-reboot.conf`.

2

Reply

user541686

3/23/23, 10:07 AM

Thank you! I'll give this a shot. Hopefully it won't [fail to reboot](https://forums.debian.net//viewtopic.php?f=5&t=102033)!

0

Reply

shodanshok

3/23/23, 10:45 AM

@NikitaKipriyanov good suggestion, I'll edit my answer. Thanks.

0

Reply

joshudson

3/23/23, 3:54 PM

warning: probable reboot loop

6

Reply

user541686

3/24/23, 12:38 AM

@joshudson: Yeah I'm planning to watch out for that, that's definitely an important warning for anyone trying this.

0

Reply

Andrew Henle

3/25/23, 12:25 PM

@joshudson If it reboots at all. Relying on a system that knows its root filesystem might be corrupt and/or its root disk broken to reboot is based on wishful thinking and unicorns.

0

Reply

joshudson

3/25/23, 4:13 PM

@AndrewHenle: I've brought a lot of systems up with a trashed root filesystem. Usually I can' take over the boot process and get fsck to run because the damage rarely hits `/sbin` or files that haven't changed in awhile.

2

Reply

Andrew Henle

3/25/23, 4:24 PM

@joshudson You hope... ;-) My thoughts here are based on the idea that trying to soldier on when your root filesystem device is tossing IO errors is a misguided effort in the first place and throwing in a reboot only makes significant issues more likely - "My root device is going bad, so let's do something that ***really*** depends on the root device being fully functional and having proper access to the bulk of the filesystem!"

0

Reply

Score:14

Server

roelvanmeer

3/23/23, 7:42 AM

Maybe not a very pretty solution, but my first thought would be to run a command from cron every minute:

test -w / || reboot

0 + 0

user541686

3/23/23, 10:07 AM

+1 thanks, this'll be a great fallback if the other solution fails!

0

Reply

pabouk - Ukraine stay strong

3/23/23, 6:19 PM

I think it is not guaranteed that `test -w` checks if the filesystem is read-write. Though GNU `test` and `test` built into `bash` seems to do that. --- Here you can see what should POSIX-compliant `test` do: https://pubs.opengroup.org/onlinepubs/9699919799/utilities/test.html#tag_20_128_05 As I understand it `test` is only required to check the access rights of the file.

0

Reply

joshudson

3/23/23, 7:15 PM

In which case `tee -a /root/.bash_history < /dev/null || reboot` will work.

1

Reply

shodanshok

3/23/23, 8:38 PM

@joshudson Even simpler: `touch /writecheck || reboot`

0

Reply

FeRD

3/23/23, 10:07 PM

@shodanshok That's great if you don't mind a file called `/writecheck` lying around at the root of your filesystem, since it'll be created when the filesystem _isn't_ read-only. The other proposed methods were attempting to avoid creating any spurious empty files. (Though if pabouk is right — which I'm 50/50 on, personally — actually-creating a file may be unavoidable, in order to fully determine the filesystem's read-only state.)

0

Reply

pabouk - Ukraine stay strong

3/24/23, 10:49 AM

Another question about the problem of testing write access: [How to non-invasively test for write access to a file?](https://unix.stackexchange.com/q/159557/19702)

0

Reply

rackandboneman

3/24/23, 7:23 PM

@shodanshok that could lead to unexpected reboots - or reboot loops - from error conditions unrelated to filesystem errors, eg temporary upsets of the libc installation, OOM conditions, anything that could make touch fail....

2

Reply

shodanshok

3/24/23, 8:07 PM

@rackandboneman sure - but *any* script with `|| reboot` is subject to these issues. Moreover, if `touch` fails on your system due to libc issues, you probably have worse problem then a reboot loop. Anyway, as stated in my answer, `watchdog` is the way to go for more advanced needings.

3

Reply

Elon Musk

I sit in a Tesla and translated this thread with Ai:

EN: How can I make Linux reboot instead of remounting the filesystem as read-only?

TH: ฉันจะทำให้ Linux รีบูตแทนการเมานต์ระบบไฟล์ใหม่เป็นแบบอ่านอย่างเดียวได้อย่างไร

RO: Cum pot face să repornesc Linux în loc să remontez sistemul de fișiere ca doar pentru citire?

RU: Как я могу перезагрузить Linux вместо перемонтирования файловой системы только для чтения?

VI: Làm cách nào tôi có thể khởi động lại Linux thay vì sắp xếp lại hệ thống tệp ở dạng chỉ đọc?

How can I make Linux reboot instead of remounting the filesystem as read-only?

Post an answer