This article describes how to monitor and troubleshoot Maestrano's infrastructure with New Relic
Table of Contents | ||
---|---|---|
|
1 - Overview
Server monitoring collects metrics about the servers running on the Maestrano environment
Drilling down into a specific server shows details such as CPU and Memory usage, Disk and Network I/O and Load average
The NewRelic server agent is installed by Nex! on all the racks.
2 - Server alerts policies
NewRelic provides a set of default alerts on the servers based on the CPU and Memory usage. Depending on the applications running on the servers, you may want to tune these alerts.
Go to NewRelic Dashboard > Servers > Alerts > Server Policies
Select the Policy group you want to edit or create a new one. These are the recommended settings
- CPU: Send alert after 5 minutes > 95 %
- Disk I/O: Send alert after 10 minutes > 75 %
- Memory: Send alert after 10 minutes > 100 % (this includes swap)
- Fullest disk: Send alert after 30 minutes > 90 %
It is highly recommended to send alerts to the Slack channel #alerts