...
- The core issue seems to be related to VMWare and IT need to provide a solution.
- This incident suggests that a previously logged technical debt issue (POL1-529), which has been considered medium/low priority, could be prioritized for development:
- fixing this issue could generally help with temporary DNS resolution errors, however the DNS issues were secondary in this incident and fixing this issue wouldn't have prevented the overall outage
- while VMWare disk corruption and network dns failures are external events and out of our the control of SWD, a further investigation for potentially improving potential improvement in processing resiliency is described in POL1-607.