Facilitates quick response, but AFTER incident occurs.
Prevents and reduces the duration and impact of incidents.
Is my application (or service) running?
How efficiently is application (or service) running?
Passively consume data and metrics about your system.
Actively explore and understand your environment.