Score:0

BeegFS cluster - One node dead, need to move on without it

rs flag

As the title suggests, a cluster consisting of four machines has one dead machine in the rack.

The cluster is set up with a buddy mirror system for redundancy, so the data should still be intact. The dead machine is the secondary in its mirror group, so how does one start the cluster and ignore any warnings and errors that would stem from the machine that is unreachable?

As it currently stands, All beegfs-meta servers are running, All remaining beegfs-storage services are running, but No beegfs-clients want to start:

Jul31 17:42:31 *mount(44691) [Remoting (stat storage targets)] >> Error target (storage): 401; Msg: Communication error
Jul31 17:42:31 *mount(44691) [Mount sanity check] >> Retrieval of storage server free space info failed. Are the storage servers running and registered at the management daemon? Did you remove a storage target directory on a server? (Error: Communication error)
Jul31 17:42:31 *mount(44691) [App (stop components)] >> Stopping components...

Optimally, if possible, I would like to not remove the dead node, but somehow disable it, as it will eventually come after a hardware fix.

mangohost

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.