So I have been working with the new version of Nagios for this Migration that we are doing at work. I have been impressed with what has been happening with the Nagios development. Now I haven’t done a whole lot with Nagios since 2.4 so some of the features that I mention may have been implemented before but I am going to try and check them from he what’s new section of the Docs.
One of the first features that I have noticed is that flap detection actually works. Now I may have not just set it up right before, but I never got it working before. This time around, I haven’t done anything special and I am getting flap notifications. It is nice for when a box is bouncing up and down.
Using Hostgroups as host group members, is one feature I have really used. Since we have duplicates or more of every type of server, It is nice to have them all in one hostgroup. And then instead of name each individual host for the more general host groups, I can just add the hostgroup for each computer type. That saves a lot of work and makes it easy to add new hosts.
This is one that could have been there before but being able to add hosts to hostgroups in the host definition is great. Then I can look at a host and tell it what groups it is part of rather than editing the group definition as well.
The Parallelization of service checks is a great addition to Nagios 3. I know you could do this before with a setting in the config file, but it should have just worked out of the box that way.
As I am reading through the what’s new, I see you can use multiple templates for a host or service. I might have to play with this one.
Performace gains: I have noticed that Nagios is a lot more responsive in the interface. Also, when I schedule a check in the interface it gets done a lot quicker.
The predictive server dependency checks looks like it is a nice feature. It will basically start checking dependencies the moment it gets a soft failure. It will also check all the dependencies up the chain, so you get the alert quicker.
A lot of work has been done on Host Checks. They now run in parallel which speeds up Nagios a lot. They had the predictability mentioned above. They can use active host checks much more effectively. Lot of other little changes have been added too.
Timerperiods have gotten something they needed. The ability to rotate around a set period of time rather than just days of the week. This means you can actually put a pager rotation into timeperiods and schedule admins around it. I haven’t set this up yet but it is one my list to get done after the migration calms down a bit.
Splunk got integrated into the nagios interface. We don’t use splunk, but I am considering it.
There are a bunch of other features that were added to Nagios 3, but those were some that I thought important. It is nice that it doesn’t require a big change for the configs in upgrading to the latest version. There are a few things that were deprecated and a few things that were added, but nothing in terms of major functionality.
I am also working on being able to have a failover server. I am going to follow the documentation for this. I will let you know how it goes once I get it down.