My device that was modeled in Spectrum went down, but no alarm was received. What could be the reason for this?

Document ID : KB000112626
Last Modified Date : 31/08/2018
Show Technical Document Details
Question:
We rebooted a device that is currently being monitored in CA Spectrum and we realized that we did not receive an alarm in Spectrum for the time that the device was down.  What coudl be the reason for this?
Environment:
Spectrum 10.1
Spectrum 10.2
Spectrum 10.3
Answer:
There could be a few different reasons why an alarm was not raised on the device if it went down or was rebooted.  Some of the reasons include:

1. SpectroSERVER performance problems  --  SNMP communications are backlogged due to SpectroSERVER performance issues.  If you suspect a performance issue please open a case with support so we can analyze with the Perfcollector9 script.

2. Device is a non-polling device  (such as a proxy or was in maintenance mode during the time)

3. Device had alarms suppressed because device caused a Trap Storm.  To determine if this is the case:

Query for Trap Storm event (0x10253):

A. Open a bash shell (bash -login)

B. Navigate to $SPECROOT/mysql/bin directory
cd mysql/bin

C. Launch the MySQL prompt
./mysql -uroot -proot ddmdb (Windows)
./mysqlcheck --defaults-file=../my-spectrum.cnf -uroot -proot ddmdb (Linux/Solaris)

D. Run the following MySQL query with the time and dates before and after the device went down.
SELECT hex(model_h), count(*) as c from ddmdb.event where utime > UNIX_TIMESTAMP('2018-08-31 00:00:01') and utime < UNIX_TIMESTAMP('2018-08-31 23:59:59') and type=66131 group by hex(model_h) order by c desc;

If any devices are listed please check the model handle to determine if it is the same as the device that did not alarm.

4. Polling Time is longer then reboot time  - For example if polling is set for 300 seconds (default) and the device reboots in 180 seconds, it is possible that Spectrum would not see the device down, however in this case a reboot alarm should be created because we would know that the SysUpTime (0x10245) and  snmpEngineBoots (0x230c52) are less then previous poll.