Mesos on Azure fails catastrophically

Expected Behavior

None

Actual Behavior

None

Steps to Reproduce

None

Environment

None

Description

We've now seen this few times on dev env. the latest incident was on 21/03 when the whole dev env went down. It took more than 7 hours to recover.

We need to figure out why this happens and how to prevent it happening in prod.

Assignee

Antero Karppinen

Reporter

Sami Siren

More details from

None

Priority

High

Recurrence

None

User Agent

None

URL

None

Components

None

Story Points

None

Labels

None
Configure