Your management card may send you the following alerts:
“System: Warmstart” (or, in AOS v5.1.5 or higher, “System: Network Interface restarted”) and “System: Coldstart” (or, in AOS v5.1.5 or higher, “System: Network Interface Coldstarted”).
These alerts are sent when the Network Management Card restarts. The alerts don’t necessarily indicate a problem, and, when they do, they only affect the Network Management Card’s interface. Your UPS load is unaffected.
A “System: Coldstart” alert means that the Network Management Card (NMC) has just been powered; this may happen if the device powering the Network Management Card suffers an interruption of power.
A “System: Warmstart” alert means that the Network Management Card (NMC) has restarted without losing power. This may happen for multiple reasons:
- The default gateway is wrong or the network traffic is too heavy and the gateway can not be reached.
- After a new AOS or Application firmware upgrade has been uploaded to the NMC.
- Modification of some NMC settings.
- The Reset button on the front panel of the NMC is pressed.
- Web Interface Reboot request
- Network settings have changed – At least one of the TCP/IP settings changed.
- A request to restart the current SNMP agent service was received.
- An internal request to load and execute a new SNMP agent service was received.
- A request to clear the NMC’s network settings and restart the SNMP agent service was received.
- Smart-UPS Output Voltage Change
- Remote Monitoring Service (RMS) communication has been lost (NMC2 only)
- An internal firmware error was detected by the NMC and to clear the error, the NMC firmware explicitly reboots itself as a failsafe.
- An undetected firmware error occurred and the hardware watchdog reboots the NMC to clear the error.
What you can do:
You should download all available event logs for your product: event.txt, data.txt, and config.ini for NMC1 and NMC2, as well as debug.txt and dump.txt for NMC2 only.
- Review the event.txt file to see if any of the causes listed above could be why your Network Management Card has restarted or coldstarted.
- Is this affecting more than one Network Management Card in your environment? This may point to a network traffic issue, causing the Management Card to reboot due to the watchdog mechanism outlined above.
- Note the frequency of the events in question. Can you pinpoint it to a certain time/certain set of events before and after? If the restarts are always at the same intervals, this may relate to a network traffic issue.
- Depending on what you find, try rebooting your card’s interface or resetting the card to defaults (after backing up your configuration and obtaining the aforementioned log files). See if the issue persists.
The following Network Management Cards may generate these alerts:
- Web/SNMP Card – AP9606
Which is embedded in, among others: APC Environmental Monitoring Unit 1 (AP9312TH)
Which are embedded in, among others: Metered/Switched Rack PDUs (APC AP78XX, AP79XX), Rack Automatic Transfer Switches (APC AP77XX), Environmental Monitoring Units (APC AP9320, AP9340, NetBotz 200)
Which are embedded in, among others: APC 2G Metered/Switched Rack PDUs (AP84XX, AP86XX, AP88XX, AP89XX), and some audio/video network management enabled products.
Here is the link to the original support document from APC:
Why does my management card warm start and cold start
Leave a Reply