A curated list of Site Reliability and Production Engineering Tools
-
Updated
Sep 1, 2025
A curated list of Site Reliability and Production Engineering Tools
A Simple Monitoring Dashboard for Docker Swarm Cluster
Dashboard for Docker Swarm Cluster
📊 Analyze and monitor Microsoft Intune Management Extension logs on Windows for real-time insights and error detection.
Advanced stealth web data collection framework for security
Utility to test and wipe hard disks and SSDs
Identify unused resources at Google Cloud Platform through Prometheus' metrics
A collection of scripts that extend EventSentry's functionality.
Network-Based Intrusion Detection System - dev/deploy-ment
Command line client for interacting with checkson.io
Wazuh integration to send alerts to Keep (open-source alert management and AIOps platform)
My Artificial Intelligence Log Sentinel for Postfix and beyond...
🖥️ Monitor RAM and CPU usage in Proxmox for hosts, LXC, and QEMU/KVM VMs with clear visuals and detailed metrics for better resource management.
🌐 Explore VandCloud, a cross-platform app to browse, test, and monitor APIs and services with real-time status updates.
Real-time log file monitoring with pattern highlighting and desktop notifications. Cross-platform Rust CLI tool with regex matching, file rotation support, and desktop notifications.
🔧 Curated Python scripts for MySQL administration: simplify tasks like security, backups, and performance tuning. Ready-to-use solutions for DBAs/devs to automate workflows and enforce best practices. Open-source & contributions welcome!
Real-time API Health Monitoring Dashboard | Python Flask + Postman Integration | Monitor multiple APIs with live status updates, performance metrics & automated testing
A utility for executing Sensu Go checks on systems that cannot run the Sensu Go Agent
Add a description, image, and links to the monitoring-tools topic page so that developers can more easily learn about it.
To associate your repository with the monitoring-tools topic, visit your repo's landing page and select "manage topics."