Score:-3

Landing place for all prometheus alert and warning

br flag

I am interested in knowing is there something tool like which keep all Prometheus triggered alert in single place like dashboard from where i can also make link some ticketing(jira) tool etc. so if i have 50000+ server ubuntu servers it will a single place to watch what are alert triggering from that node and create ticket. so according that i can perform further checks.Like

1-> Site 1 has high icmp loss 2-> Node on site 3 has lost icmp connectivity 3-> Node xx on site 5 has mongo service in fatal state

Score:0
pl flag

Yup, Telegraf, Influx and Grafana.

In short, Telegraf (a small agent written in go) can be configured to watch your critical services, and send metrics to InfluxData. InfluxData OSS is an open source database you can run in your infrastructure to store that data. You can then use Grafana to provide a dashboard showing the status of all the hosts.

There's also cloud services from InfluxData which can alert you via Slack, PagerDuty, SMS or email when an adverse event occurs.

Another option is Nagios which is a well-regarded network monitoring system.

babybash avatar
br flag
Thanks for update buddy @popey same can be done with Prometheus? right so what's make big difference between telegraf and prometheus .
pl flag
I don't know Prometheus, sorry.
I sit in a Tesla and translated this thread with Ai:

mangohost

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.