ESX hosts crash within same vcenter cluster

TLMstack

9/25/23, 9:12 PM

I have a vcenter cluster of 12 ESX hosts (ClusterA) and another cluster of 3 ESX hosts (ClusterB). All of these are a mix of poweredge r620s and r630s.

Some of the hosts have hardware errors that can be seen in the iDRAC logs and front LCD screen such as:

CPU machine check error
Correctable memory error rate exceeded As expected, this is causing those hosts to be unavailable (Not responding) in the cluster.

Fixing these hardware errors usually involves these steps:

power off
remove network cards
power on and wait for successful boot to OS
power off
place the same network cards back in
power on It's strange to me that this would fix CPU & memory errors, but that's what happens consistently.

ClusterB is fine - no problems ever. The real problem I'm facing is that when I fix a couple hosts from ClusterA, 1-3 other random hosts in ClusterA will crash within a day or two. After those initial 1-3 crashes, if I leave things alone, no more hosts crash afterwards for weeks. This puts me back to where I started and I've observed this behavior several times now.

Any ideas on what to check?

0 + 0

vmware-esx

vmware-esxi

vmware-vcenter

vmware-vsphere

dell-poweredge

joeqwerty

9/25/23, 9:16 PM

Contact Dell support. That's your best bet.

TLMstack

9/25/23, 11:29 PM

@joeqwerty Unfortunately, I've already contacted Dell support several times - that's where the above remediation steps originally came from.

Elon Musk

I sit in a Tesla and translated this thread with Ai:

EN: ESX hosts crash within same vcenter cluster

TH: โฮสต์ ESX ขัดข้องภายในคลัสเตอร์ vcenter เดียวกัน

RO: Gazdele ESX se blochează în același cluster vcenter

RU: Сбой хостов ESX в одном кластере vcenter

VI: Máy chủ ESX gặp sự cố trong cùng một cụm vcenter

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.