AmonOne

A modern, self-hosted server monitoring system with system monitoring, process monitoring and notifications

     

linux windows osx open-source monitoring python

Anthracite

Tracks and manages all events that can have business and/or operational impact (deploys, manual changes, outages, press releases, etc) and provides metrics and reports

     

linux windows osx open-source monitoring python

Brooklyn

A Java based deployment tool, focused on both deployment as well as post-deploy lifecycle management. It is aware of network topologies, integrates with many cloud and PaaS APIs, and is designed to handle the entire roll-out

     

linux windows osx open-source provisioning orchestration java

Capistrano

The Ruby equivalent of Fabric, Capistrano is a Ruby framework to configure and run deployment or system administration tasks on local or remote machines. It extends the Rake DSL to define tasks, and can group tasks by Roles.

     

linux osx open-source provisioning orchestration ruby

check_mk

Check_MK is an extension to the Nagios monitoring system that allows creating rule-based configuration using Python and offloading work from the Nagios core to make it scale better, allowing more systems to be monitored from a single Nagios server. It comes with a set of system checks, a mod_python and JavaScript based web user interface, and a module that allows fast access to the Nagios core. On top of Nagios it also adds additional features

     

linux solaris bsd open-source monitoring python

Errbit

An open source, self-hosted error catching tool, which can collect and manage errors from other applications. It is compatible with the Airbrake API and can be used as a drop-in replacement.

     

linux windows osx open-source monitoring ruby

Fabric

Fabric is a python framework to configure deployment or system administration tasks, and then run those tasks on local or remote systems. It uses SSH heavily for interacting with remote systems, and uses Roles to group tasks

     

linux osx open-source orchestration python

FastForward

Flexible operating-system event and metric forwarding agent. Receives input and forwards output using plugins.

     

linux windows osx open-source metrics monitoring ruby

FnordMetric

A HTML5/Javascript API for collecting and visualizing time series data

     

linux windows osx open-source monitoring ruby

Glances

A cross-platform, curses-based monitoring tool written in Python

     

linux windows osx open-source monitoring python

Icinga

Icinga is an enterprise-grade monitoring system. It was forked from Nagios and has added many often new features and patches, including a HTML5 UI, REST interface, dynamic tables, etc. It is also backwards compatible with Nagios, so all the existing plugins can be reused.

     

linux windows osx open-source monitoring python

jmxtrans

Plugs Java VM statistics via JMX to various backend monitoring tools

     

linux windows osx open-source monitoring java

Logstash

A modern tool for managing events and logs, with an ElasticSearch backend which allows a high degree of searching and filtering. It includes a shipper in remote systems, indexer in the aggregator machine, and a Web UI.

     

linux windows osx open-source monitoring java

Logster

Reads log files and generates metrics in Graphite, Ganglia or Amazon CloudWatch

     

linux windows osx open-source monitoring python

MCollective

Marionette Collective (MCollective) is an orchestration framework built on top of Puppet. It can interact with a large number of servers (using a pub-sub middleware) and can execute jobs in parallel

     

linux windows osx free orchestration ruby

Metrics

Captures JVM and application-level metrics for Java

     

linux windows osx open-source monitoring java

Mina

A Ruby Rake-like DSL for fast deployment and automation, similar to Capistrano but faster and lesser overhead, since it creates only a single SSH connection for a deploy

     

linux osx open-source orchestration ruby

Monigusto

A collection of Chef recipes to install various monitoring tools (such as sensu, statsd, etc). Makes it a breeze to get a monitoring infrastructure up and running

     

linux windows osx open-source monitoring ruby

Orc

A model driven orchestration tool for deployment of application clusters

     

linux windows osx open-source provisioning orchestration ruby

Puppet Dashboard

A web interface for Puppet. It can view and analyze Puppet reports, assign Puppet classes and parameters to nodes, view inventory data and backed-up file contents, etc.

     

linux windows osx open-source monitoring ruby

Rundeck

Turns your operational procedures into self-service portal. Define common operational tasks in the Rundeck web portal, and safely hand it over to the rest of the team so that they can perform these tasks without you being around.

     

linux windows osx open-source orchestration java

SaltStack

Salt is a powerful and different approach to infrastructure management, by focusing on high speed communications between large number of systems, and can perform orchestration, remote code execution and much more.

     

linux windows osx open-source commercial config-mgmt provisioning orchestration python

Scales

Metrics for Python, tracks python state and statistics. Inspired by the Metrics library for Java

     

linux windows osx open-source monitoring python

Sentry

Automate error tracking to find and fix issues before your users even experience a crash. Sentry is open-source, real-time, and loved by 100,000 developers around the world.

     

osx windows linux open-source errors nodejs ruby python php java monitoring

Supervisor

Supervisor can manage application processes. It can start and stop a group of processes, restart processes if any of them terminate (keep-alive), start multiple instances of the same process (process pooling), automatically manage PID files, and much more! Very useful if you need to manage multiple processes for your application.

     

linux osx open-source process-mgmt python

Vaurien

A Chaos Monkey for TCP, it randomly disconnects or stalls TCP connections in your application to test application tolerance and high availability. Makes sure you correctly handle these unexpected events

     

linux windows osx open-source security python