Assimilation System Management Suite
The Assimilation Suite discovers systems, services, network connections, configuration and dependencies, IP and MAC addresses. This all goes into a continually updated graph-based configuration management database (CMDB). This is then compared and scored against best practices, services and servers are monitored - all with near-zero configuration - in a way that scales to hundreds of thousands of servers. It also provides visualization tools, APIs for sending alerts to humans and other systems, and a variety of canned reports (queries) to aid in securing and managing systems, hooking into ChatOps, and creating plans for triaging your security issues
linux open-source commercial C shell python config CMDB service-discovery monitoring visualization security hardening
Graphite
Graphite is a visualization tool for statistics configured from various sources. You can create customizable reports and dashboards, and it is one of the preferred backends for Statsd
linux open-source monitoring python visualization
Instrumental
Instrumental is a real-time, high-scale application and server monitoring tool with intelligent alerts, a powerful query language, and free development accounts.
linux windows oxs free metrics commercial metrics monitoring visualization php ruby python net java nodejs
Supervisord Monitor
Supervisord Monitor is multi server monitoring tool for supervisor - watch, start and stop all your supervisord instances and servvices from one place.
linux windows osx open-source monitoring provisioning php process-mgmt
Supervisor
Supervisor can manage application processes. It can start and stop a group of processes, restart processes if any of them terminate (keep-alive), start multiple instances of the same process (process pooling), automatically manage PID files, and much more! Very useful if you need to manage multiple processes for your application.
linux osx open-source process-mgmt python
Vaurien
A Chaos Monkey for TCP, it randomly disconnects or stalls TCP connections in your application to test application tolerance and high availability. Makes sure you correctly handle these unexpected events
linux windows osx open-source security python