Thanks for your reply Josh, here are my responses..
The fact that all 5 died about the same time is strange. Are all of them taken on & offline as a group?
No, they all run independantly.
If they are all taken on & offline as a group, and there is a constant leak, that might make sense.
Do all servers get the same traffic? Or are they in a load balanced configuration?
Each has their own configuration, not load blanaced.
What kind of monitoring system do you use? What is it's poll rate?
System monitors repsonse times + up/down state and alerts. Poll rate is every 60 seconds. Monitoring works perfectly and correctly identified the issue accross all servers at the time they became unavailable.
Very strange one isnt it! Thought it was a prime candidate for discussion on Thwack!
Any further thoughts appreciated. My main focus is the huge memory allocation on all servers for the serv-u.exe process which ultimiatly caused them all to crash.