03 — Operations

Day-to-day operational runbooks for every service in the stack. Start here when you need to restart a container, check logs, rotate a config, or respond to a “service X is down” report.

For alert-driven operations see README. For secret rotation see README.

Monitoring Stack (vps-i1)

DocumentDescription
monitoring-stack-operations.mdDocker Compose stack on vps-i1 — start/stop/reload
monitoring-prometheus-grafana.mdPrometheus + Grafana deployment and config reference
grafana-operations.mdGrafana dashboard management, datasources, image renderer
monitoring-exporters-operations.mdCustom Python exporters (queue, pg-stats, backup, cost, vercel, credential)

Automation Services

DocumentDescription
n8n-operations.mdn8n queue-mode stack on bms-4 — workflows, workers, API
n8n-cloud-operations.mdn8n Cloud instance operations
n8n-postgresql-operations.mdn8n PostgreSQL backend operations
n8n-github-automation.mdn8n GitHub automation workflows
n8n-vercel-webhook-setup.mdVercel webhook integration with n8n
audit-engine-operations.mdFastAPI audit-engine on vps-h1 — deploy, schedule, runs

Communication & Messaging

DocumentDescription
waha-operations.mdWAHA WhatsApp gateway operations
waha-incident-router.mdCloudflare Worker — WAHA webhook → Supabase incident threads
waha-shadow-cutover.mdWAHA cutover playbook
discord-notifications.mdDiscord webhook notification setup
telegram-claude-bot-operations.mdTelegram Claude bot operations
telegram-inspection-bot-operations.mdTelegram vehicle inspection bot

Data & Storage

DocumentDescription
supabase-operations.mdSupabase project operations — migrations, RLS, roles
supabase-slow-query-monitoring.mdpg_stat_statements slow-query monitoring
media-storage.mdMedia/file storage (Wasabi S3) operations

GPS & Fleet

DocumentDescription
traccar-operations.mdTraccar GPS server operations
traccar-api-testing.mdTraccar API test procedures
teltonika-device-setup.mdTeltonika GPS device provisioning
nexcon-sim-operations.mdNexcon SIM card operations

Infrastructure Services

DocumentDescription
vps-i1-operations.mdIONOS VPS day-to-day operations
traefik-operations.mdTraefik reverse proxy (bms-4, vps-h1)
pdf-service-operations.mdGotenberg/pdf-service HTML-to-PDF API
report-scheduler-operations.mdFleet inspection report scheduler
openclaw-operations.mdOpenClaw operations
cloud-services-operations.mdSaaS / cloud service operations overview

AI Agents

DocumentDescription
claude-agent-setup.mdClaude Code agent setup on VPS nodes
claude-proxy-router-operations.mdClaude proxy (vps-h1 :9999) operations
ai-batch-gpu-operations.mdGPU batch processing operations

Developer Portal

DocumentDescription
portal.mdInternal ops portal (Next.js on Vercel)

Cross-references

  • README — observability for all these services
  • README — what to do when something breaks
  • README — automation layer perspective