Site Reliability Engineer
Abuja Electricity distribution company
Feb 2020 - Dec 2023 (3 years 11 months)
• Deployed and managed Kubernetes clusters with full observability using Prometheus, Grafana, and Loki.
• Designed and managed Kubernetes clusters with observability (Prometheus, Grafana, Loki).
• Deployed and maintained production-grade applications with CI/CD automation and infrastructure monitoring.
• Installed and configured Prometheus, Loki, and Grafana dashboards for real-time infrastructure and application monitoring.
• Implemented and managed MySQL clusters across multiple data centers.
• Automated provisioning and configuration workflows using Infrastructure as Code principles.
• Conducted root cause analysis and authored postmortems following critical incidents.
• Provided support for connectivity issues to various W