Hi all!

i have a nice setup with some containers (podman rootless) and bare metal services (anything i can install bare metal, goes bare metal usually).

I used Monit, in the past, to keep an eye on my services and automatically restart something that for any reason goes down. I stopped using Monit because doesnt scale well on mobile browser and it’s frankly clumsy to configure.

I could go back to Monit i guess, but i am wondering if there is anything better out there to try.

A few requirements (not necessarily mandatory, but preferable):

  • Open Source (ideally: true open source, not just commercial sulutions with dumbed down free verisons)
  • Not limited, or focuesd, on containers (no Watchtower and similar)
  • For containers, it can just support “works” or “restart”
  • For containers, if it goes above the minimum “works” and “restart” must support podman
  • Must support bare metal services (status, start, stop)
  • Must send email or other kind of notifications (ok IM notifications, but email preferred)
  • Should additionally monitor external machines (es other servers on the LAN), or generic IP addresses
  • Should detect if a web service is alive but blocked
  • No need for fancy GUIs or a Web GUI (it’s a pro point, but not required)
  • No need for data reporting, graphics and such aminities. They are a plus, but 100% not required.

What do you guys use?

  • LifeBandit666
    link
    fedilink
    English
    arrow-up
    5
    ·
    2 days ago

    I think I’m a step behind you. I use Uptime Kuma for monitoring and it worked really well. Just have it running on a pi separate from my main machine.

    I worked out how to get it sending me emails when things are down and up, and now my email inbox is a fucking hot mess of notifications.

    So I’ve just this weekend integrated it into Home Assistant and set it to notify me when things are down for 5 minutes or more.

    My next step was going to be finding some way of integrating Portainer into Home Assistant so I can restart stopped containers, and maybe Proxmox so I can reboot VMs from HA. Not sure it’s possible yet though.

    Ultimately I want to have HA send me a notification with actionable buttons with “reboot container” and “reboot VM” which, when pressed, will sort the issue out.

    However this will not help when one of my drives goes down. They’re HDDs plugged in by USB3 which isn’t great and my server is behind the coat rack so sometimes the kids just throw their coats on and it falls onto my server, which then heats up and goes silly.

    • marsara9@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 days ago

      Can you share the Home Assistant automation / setup that you have for Uptime Kuma notifications? As I’m in the same boat as you. I just got a webhook setup but I’m getting flooded with notifications, especially after services update.

      My hope is I just want to be notified when a particular service is down for say 5 minutes but all I care about is knowing the node name. I don’t necessarily care to get notified if the service comes back up.

      • Darkassassin07@lemmy.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        1 day ago

        This is what the ‘retries’ setting in each monitor is for. It will only be considered down if its failed its heartbeat check <retries> number of times in a row.

      • LifeBandit666
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 days ago

        I did it all in Node Red so unfortunately I can’t share the automation, but I can point you at this HACS integration https://github.com/meichthys/uptime_kuma

        Set that up and all your nodes will be visible in HA then it’s just a case of “if node X is off for X minutes” - “notify”