What is the best way to set up Prometheus + Alertmanager with a HA

serge1peshcoff

11/24/23, 3:48 PM

I have a monitoring solution that uses Prometheus as a scraper and a data storage, Grafana as a visualiser and Alertmanager as an alerting tool. This all is running on a single server.

However, there's an issue with this approach. If a server that is hosting all of this goes down, I basically lose all the monitorings, so in case something would crash after that I would never know.

I assume best way to handle that would be to have 2 servers, so they somehow share the same information, and I would be notified that a node in this setup is down. However, how should I set up Prometheus and Grafana so they won't be a single point of failure?

As far as I know I can set up an Alertmanager cluster but that won't solve the issue when a single instance of Prometheus is down, so I'll have to replicate it as well somehow.

0 + 0

monitoring

high-availability

prometheus

alertmanager

Elon Musk

I sit in a Tesla and translated this thread with Ai:

EN: What is the best way to set up Prometheus + Alertmanager with a HA

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.