Ansible

A versatile orchestration engine that can automate systems and apps. Instead of a custom scripting language or code, it is very simple and shell based. It is also agent-less, so you can just start using it right away and get things done

     

linux open-source provisioning config-mgmt orchestration python

Assimilation System Management Suite

The Assimilation Suite discovers systems, services, network connections, configuration and dependencies, IP and MAC addresses. This all goes into a continually updated graph-based configuration management database (CMDB). This is then compared and scored against best practices, services and servers are monitored - all with near-zero configuration - in a way that scales to hundreds of thousands of servers. It also provides visualization tools, APIs for sending alerts to humans and other systems, and a variety of canned reports (queries) to aid in securing and managing systems, hooking into ChatOps, and creating plans for triaging your security issues

     

linux open-source commercial C shell python config CMDB service-discovery monitoring visualization security hardening

bugsnag

Monitor everything you're building.

     

windows linux osx open-source ruby java python php nodejs errors

Graphite

Graphite is a visualization tool for statistics configured from various sources. You can create customizable reports and dashboards, and it is one of the preferred backends for Statsd

     

linux open-source monitoring python visualization

Herd

A bittorrent based file distribution system based on Twitter’s Murder. It uses BitTorrent to push artifacts and code very fast across thousands of servers.

     

linux windows osx open-source packaging python

InfluxDB

InfluxDB is a time series, metrics, and analytics database. It’s written in Go and has no external dependencies. InfluxDB is targeted at use cases for DevOps, metrics, sensor data, and real-time analytics.

     

linux open-source monitoring go metrics

Poni

A system management and provisioning tool, with centralized configuration management

     

linux open-source config-mgmt provisioning orchestration python

Prometheus

Prometheus is an open-source systems monitoring and alerting toolkit built at SoundCloud.

     

linux open-source go metrics monitoring

Pulp

A platform for managing software package repositories

     

linux open-source packaging python

Raygun

Raygun keeps a watchful eye on your web and mobile applications and tells you when problems arise.

     

windows linux osx open-source python nodejs java ruby errors

Rollbar

Detect, diagnose and defeat errors.

     

linux windows osx nodejs open-source ruby php java python errors

SaltStack

Salt is a powerful and different approach to infrastructure management, by focusing on high speed communications between large number of systems, and can perform orchestration, remote code execution and much more.

     

linux windows osx open-source commercial config-mgmt provisioning orchestration python

Sentry

Automate error tracking to find and fix issues before your users even experience a crash. Sentry is open-source, real-time, and loved by 100,000 developers around the world.

     

osx windows linux open-source errors nodejs ruby python php java monitoring

ShutIt

ShutIt is an automation platform. It makes it easy to create simple modular automation scripts that handle differing output, install software, log in and out of servers, capture output etc.. ShutItFiles are an extension to make it even easier for people to develop automation scripts.

     

linux open-source config-mgmt provisioning python

StackStorm

A platform for event-driven automation. StackStorm allows you to integrate and automate across services and tools. It ties together your existing infrastructure and application environment so you can more easily automate that environment -- with a particular focus on taking actions in response to events.

     

automation cd provisioning cloud orchestration config-mgmt orchestration python linux open-source apache2

Vaurien

A Chaos Monkey for TCP, it randomly disconnects or stalls TCP connections in your application to test application tolerance and high availability. Makes sure you correctly handle these unexpected events

     

linux windows osx open-source security python