VPS Monitoring with Uptime Kuma and Grafana

You can't fix what you can't see. This guide covers setting up comprehensive monitoring for your VPS—from simple uptime checks to full metrics dashboards.

Why This Matters

Without monitoring, you only learn about problems when users complain—or when everything is already broken. Good monitoring means:

Catch issues before users notice - CPU spike alerts before crashes
Understand performance trends - Know when to scale
Debug faster - Historical data shows what changed
Sleep better - Automated alerts mean you're not constantly checking
Prove uptime - Show clients their SLA is met

Prerequisites

A VPS with Docker installed (we recommend Hostinger VPS for reliable performance metrics)
Basic Docker Compose knowledge
A domain name (optional, but recommended)

Quick Start: Choose Your Stack

Tool	Best For	Complexity
Uptime Kuma	Uptime monitoring, status pages	Easy
Grafana + Prometheus	Full metrics, dashboards	Medium
Netdata	Real-time system monitoring	Easy
Full stack	Production environments	Advanced

Part 1: Uptime Kuma (Simple & Effective)

Uptime Kuma is a self-hosted monitoring tool that's beautiful and easy to use.

Step 1: Deploy Uptime Kuma

# docker-compose.yml
services:
  uptime-kuma:
    image: louislam/uptime-kuma:1
    container_name: uptime-kuma
    volumes:
      - uptime-kuma-data:/app/data
    ports:
      - "3001:3001"
    restart: unless-stopped

volumes:
  uptime-kuma-data:

docker compose up -d

Access at http://your-server:3001

Step 2: Configure Monitors

After setup, add monitors for:

HTTP(S) Monitoring:

Your website URLs
API endpoints
Admin panels

TCP Port Monitoring:

Database ports (internal)
Redis, cache services

Docker Container Monitoring:

Monitor container health directly

DNS Monitoring:

Ensure DNS is resolving correctly

Step 3: Set Up Notifications

Uptime Kuma supports 90+ notification services:

Telegram - Instant mobile alerts
Discord/Slack - Team channels
Email - Traditional but reliable
Pushover - Premium push notifications
Webhook - Custom integrations

Configure at: Settings → Notifications

Step 4: Create a Status Page

Go to Status Pages
Create new page
Add your monitors
Share the public URL with users

Part 2: Full Metrics Stack (Prometheus + Grafana)

For comprehensive metrics collection and visualization.

Step 1: Create the Stack

# docker-compose.monitoring.yml
services:
  prometheus:
    image: prom/prometheus:latest
    container_name: prometheus
    volumes:
      - ./prometheus/prometheus.yml:/etc/prometheus/prometheus.yml
      - prometheus_data:/prometheus
    command:
      - '--config.file=/etc/prometheus/prometheus.yml'
      - '--storage.tsdb.path=/prometheus'
      - '--storage.tsdb.retention.time=30d'
    ports:
      - "9090:9090"
    restart: unless-stopped

  grafana:
    image: grafana/grafana:latest
    container_name: grafana
    volumes:
      - grafana_data:/var/lib/grafana
    environment:
      - GF_SECURITY_ADMIN_USER=admin
      - GF_SECURITY_ADMIN_PASSWORD=${GRAFANA_PASSWORD}
      - GF_USERS_ALLOW_SIGN_UP=false
    ports:
      - "3000:3000"
    restart: unless-stopped

  node-exporter:
    image: prom/node-exporter:latest
    container_name: node-exporter
    volumes:
      - /proc:/host/proc:ro
      - /sys:/host/sys:ro
      - /:/rootfs:ro
    command:
      - '--path.procfs=/host/proc'
      - '--path.sysfs=/host/sys'
      - '--path.rootfs=/rootfs'
      - '--collector.filesystem.mount-points-exclude=^/(sys|proc|dev|host|etc)($$|/)'
    ports:
      - "9100:9100"
    restart: unless-stopped

  cadvisor:
    image: gcr.io/cadvisor/cadvisor:latest
    container_name: cadvisor
    volumes:
      - /:/rootfs:ro
      - /var/run:/var/run:ro
      - /sys:/sys:ro
      - /var/lib/docker/:/var/lib/docker:ro
    ports:
      - "8080:8080"
    restart: unless-stopped

volumes:
  prometheus_data:
  grafana_data:

Step 2: Configure Prometheus

# prometheus/prometheus.yml
global:
  scrape_interval: 15s
  evaluation_interval: 15s

alerting:
  alertmanagers:
    - static_configs:
        - targets: []

scrape_configs:
  - job_name: 'prometheus'
    static_configs:
      - targets: ['localhost:9090']

  - job_name: 'node-exporter'
    static_configs:
      - targets: ['node-exporter:9100']

  - job_name: 'cadvisor'
    static_configs:
      - targets: ['cadvisor:8080']

  # Add your applications
  - job_name: 'myapp'
    static_configs:
      - targets: ['myapp:3000']
    metrics_path: '/metrics'

Step 3: Start the Stack

mkdir -p prometheus
# Create prometheus.yml as above
docker compose -f docker-compose.monitoring.yml up -d

Step 4: Set Up Grafana Dashboards

Access Grafana at http://your-server:3000
Login with admin / your-password
Add Prometheus data source:
- Configuration → Data Sources → Add
- Select Prometheus
- URL: http://prometheus:9090
- Save & Test
Import pre-built dashboards:
- Dashboards → Import
- Popular dashboard IDs:
  - 1860 - Node Exporter Full
  - 893 - Docker and System Monitoring
  - 14282 - cAdvisor Dashboard

Step 5: Create Custom Alerts

In Grafana:

Alerting → Alert Rules → New
Create conditions based on metrics
Set notification channels

Example alert: CPU > 80% for 5 minutes

100 - (avg by (instance) (rate(node_cpu_seconds_total{mode="idle"}[5m])) * 100) > 80

Part 3: Netdata (Real-Time Monitoring)

For instant, zero-config monitoring:

services:
  netdata:
    image: netdata/netdata
    container_name: netdata
    ports:
      - "19999:19999"
    cap_add:
      - SYS_PTRACE
    security_opt:
      - apparmor:unconfined
    volumes:
      - netdataconfig:/etc/netdata
      - netdatalib:/var/lib/netdata
      - netdatacache:/var/cache/netdata
      - /etc/passwd:/host/etc/passwd:ro
      - /etc/group:/host/etc/group:ro
      - /proc:/host/proc:ro
      - /sys:/host/sys:ro
      - /etc/os-release:/host/etc/os-release:ro
    restart: unless-stopped

volumes:
  netdataconfig:
  netdatalib:
  netdatacache:

Access at http://your-server:19999 - instant beautiful dashboards!

Part 4: Application-Level Monitoring

Add Metrics to Your Apps

Node.js with prom-client:

const client = require('prom-client');
const express = require('express');

// Collect default metrics
client.collectDefaultMetrics();

// Custom metrics
const httpRequestsTotal = new client.Counter({
  name: 'http_requests_total',
  help: 'Total HTTP requests',
  labelNames: ['method', 'path', 'status']
});

app.use((req, res, next) => {
  res.on('finish', () => {
    httpRequestsTotal.inc({
      method: req.method,
      path: req.route?.path || req.path,
      status: res.statusCode
    });
  });
  next();
});

// Expose metrics endpoint
app.get('/metrics', async (req, res) => {
  res.set('Content-Type', client.register.contentType);
  res.end(await client.register.metrics());
});

Python with prometheus-client:

from prometheus_client import Counter, Histogram, generate_latest
from flask import Flask, Response

app = Flask(__name__)

REQUEST_COUNT = Counter('requests_total', 'Total requests', ['method', 'endpoint'])
REQUEST_LATENCY = Histogram('request_latency_seconds', 'Request latency')

@app.route('/metrics')
def metrics():
    return Response(generate_latest(), mimetype='text/plain')

Part 5: Log Monitoring with Loki

Add centralized logging:

services:
  loki:
    image: grafana/loki:latest
    ports:
      - "3100:3100"
    command: -config.file=/etc/loki/local-config.yaml
    volumes:
      - loki_data:/loki
    restart: unless-stopped

  promtail:
    image: grafana/promtail:latest
    volumes:
      - /var/log:/var/log:ro
      - ./promtail-config.yml:/etc/promtail/config.yml
    command: -config.file=/etc/promtail/config.yml
    restart: unless-stopped

volumes:
  loki_data:

Promtail config:

# promtail-config.yml
server:
  http_listen_port: 9080

positions:
  filename: /tmp/positions.yaml

clients:
  - url: http://loki:3100/loki/api/v1/push

scrape_configs:
  - job_name: system
    static_configs:
      - targets:
          - localhost
        labels:
          job: varlogs
          __path__: /var/log/*log

  - job_name: docker
    static_configs:
      - targets:
          - localhost
        labels:
          job: docker
          __path__: /var/lib/docker/containers/*/*log

Add Loki as a Grafana data source and query logs alongside metrics!

Recommended Monitoring Setup

For most VPS deployments:

# docker-compose.monitoring.yml - Complete recommended stack
services:
  uptime-kuma:
    image: louislam/uptime-kuma:1
    volumes:
      - uptime-kuma-data:/app/data
    ports:
      - "3001:3001"
    restart: unless-stopped

  grafana:
    image: grafana/grafana:latest
    volumes:
      - grafana_data:/var/lib/grafana
    environment:
      - GF_SECURITY_ADMIN_PASSWORD=${GRAFANA_PASSWORD}
    ports:
      - "3000:3000"
    restart: unless-stopped

  prometheus:
    image: prom/prometheus:latest
    volumes:
      - ./prometheus.yml:/etc/prometheus/prometheus.yml
      - prometheus_data:/prometheus
    command:
      - '--config.file=/etc/prometheus/prometheus.yml'
      - '--storage.tsdb.retention.time=15d'
    restart: unless-stopped

  node-exporter:
    image: prom/node-exporter:latest
    volumes:
      - /proc:/host/proc:ro
      - /sys:/host/sys:ro
      - /:/rootfs:ro
    command:
      - '--path.procfs=/host/proc'
      - '--path.sysfs=/host/sys'
      - '--path.rootfs=/rootfs'
    restart: unless-stopped

volumes:
  uptime-kuma-data:
  grafana_data:
  prometheus_data:

Best Practices

Monitor from outside - External checks catch network issues
Set reasonable thresholds - Avoid alert fatigue
Layer your monitoring - Uptime + metrics + logs
Retain data appropriately - 15-30 days for metrics, longer for aggregates
Document runbooks - What to do when alert X fires
Test your alerts - Ensure they actually reach you
Monitor the monitors - Use external service to watch your monitoring
Secure your dashboards - Metrics can reveal sensitive info

Common Mistakes to Avoid

❌ Too many alerts - Alert fatigue means ignoring real issues

❌ No external monitoring - If your server is down, so is your monitoring

❌ Exposing metrics publicly - Use authentication or internal networks

❌ Not setting retention - Disk fills up with old data

❌ Monitoring without acting - Dashboards don't fix problems

❌ Single notification channel - Email is down? No alerts

❌ No baseline - You need to know what "normal" looks like

❌ Over-monitoring - Start simple, add complexity as needed

Key Metrics to Watch

System Metrics

CPU usage - Alert at 80%+ sustained
Memory usage - Alert at 85%+
Disk usage - Alert at 80%+ (disks fill fast)
Disk I/O - High wait indicates storage bottleneck
Network throughput - Baseline and anomaly detection

Application Metrics

Response time - p50, p95, p99 latencies
Error rate - Percentage of 5xx responses
Request rate - Traffic patterns
Active connections - Database pool usage
Queue depth - Background job backlogs

Business Metrics

Signups/purchases - Drop indicates problems
Active users - Engagement health
Revenue - The ultimate metric

FAQ

How much resources does monitoring use?

Minimal. Uptime Kuma: ~100MB RAM. Full Prometheus/Grafana stack: ~500MB-1GB. Worth it for the visibility.

Should I use cloud monitoring or self-hosted?

Self-hosted for cost control and data ownership. Cloud (Datadog, New Relic) if you have budget and want managed solution. Hostinger VPS has enough resources for self-hosted monitoring.

How do I monitor from outside my network?

Use external services like:

Uptime Robot (free tier)
Pingdom
Better Uptime
StatusCake

These catch issues your self-hosted monitoring can't see.

What alert thresholds should I set?

Start conservative, adjust based on experience:

CPU: 80% for 5 minutes
Memory: 85%
Disk: 80%
Response time: 2x your baseline p95

How long should I retain metrics?

High-resolution (15s): 7-15 days
Aggregated (5m): 30-90 days
Further aggregated: 1-2 years

Balance detail vs storage costs.

Can I monitor multiple servers?

Yes! Prometheus + Node Exporter scale well. Just add new targets to your scrape config and you can monitor 100+ servers from one dashboard.

Your monitoring stack is ready! Combine this with our backup guide and security guide for a production-ready VPS.

VPS Monitoring with Uptime Kuma and Grafana

VPS Monitoring with Uptime Kuma and Grafana

Why This Matters

Prerequisites

Quick Start: Choose Your Stack

Part 1: Uptime Kuma (Simple & Effective)

Step 1: Deploy Uptime Kuma

Step 2: Configure Monitors

Step 3: Set Up Notifications

Step 4: Create a Status Page

Part 2: Full Metrics Stack (Prometheus + Grafana)

Step 1: Create the Stack

Step 2: Configure Prometheus

Step 3: Start the Stack

Step 4: Set Up Grafana Dashboards

Step 5: Create Custom Alerts

Part 3: Netdata (Real-Time Monitoring)

Part 4: Application-Level Monitoring

Add Metrics to Your Apps

Part 5: Log Monitoring with Loki

Recommended Monitoring Setup

Best Practices

Common Mistakes to Avoid

Key Metrics to Watch

System Metrics

Application Metrics

Business Metrics

FAQ

How much resources does monitoring use?

Should I use cloud monitoring or self-hosted?

How do I monitor from outside my network?

What alert thresholds should I set?

How long should I retain metrics?

Can I monitor multiple servers?

Ready to get started?

// related topics