OS: CentOS 5.5 64-bit
Software: Postgresql
Hardware: Sun X4200; dual-core AMD Opteron 1Ghz (x4); 8GB RAM; LSILogic raid controller + 2x146GB 10k drives.
Running net-snmp and using Traverse to monitor.
Seeing constant 2,000+ system interrupts per second. Traverse flags this as "critical" (default config) .... Is this number truly something to be concerned about?
looking at high number results for interrupts:
[~]# cat /proc/interrupts
CPU0 CPU1 CPU2 CPU3
14: 136 54655160 2332995 722234 IO-APIC-edge ide0
66: 618 329180300 20802132 172490 IO-APIC-level ohci_hcd:usb2
74: 4949 16107320 2295957 846017 IO-APIC-level ioc0
82: 22 662837259 233 129090405 IO-APIC-level eth0
90: 723 505860358 0 18967685 IO-APIC-level eth2
NMI: 187529 250006 100435 166795
LOC: 2140313519 2140313343 2140313287 2140313203
ERR: 0
MIS: 0
An additional question about the above output: Why do ide0 and usb2 show constant accrual of interrupts, even though there is no USB device connected, and the IDE device (CDROM) is not in use? This question is for my own curiosity.
LOC
interrupts running at 1000 Hz are normal with those kernels — there is no dynamic tick support in that kernel version, therefore the timer interrupt is running constantly. Other interrupts are probably normal too, if there is high network and disk load on the system.The most suspicious is the
ohci_hcd:usb2
interrupt — maybe some USB device is (or was) misbehaving (or just heavily used, then it's normal).I see them too, so I'm thinking not:
http://www.teaparty.net/munin/net/teaparty.net-irqstats.html
(Hardware was completely replaced last June, hence the sudden rise)
This serverfault article and this offsite article that it references are also thought-provoking.