I'm investigating some (SCOM 2012) alerts on our Exchange 2013 (SP1) environment:
- Malware filtering is taking too long (90th percentile)
- Mailbox Transport Submission is not keeping up with the work...
- Queue Alert: Internal Aggregate Delivery Queue (Normal Priority) exceeds threshold
- The total number of messages in shadow queues exceeds 1500
All the monitors are Performance Counter based monitors. When I investigate those performance counters (found via the Crimson Channels in the Eventlogs ../ActiveMonitoring/Monitordefinition) all affected counters have already dropped below threshold values but the associated Health Sets stay unhealthy
IE: Get-ServerHealth -Identity '<Server>' -HealthSet '<HealthSet>' still reports Total.Shadow.Queue.Length.Above.Threshold.Monitor as UnHealthy while perfmon reports values way below 1500 for that particular server.
How to (re)evaluate the monitors ?