Senior Site Reliability Engineer
Current- Centralize all monitoring into a single cluster; setup mimir, loki, alertmanager, grafana-agent, grafana
- Setup alerting pipeline into OpsGenie, documentation & onboarding of users to receive alerts
- Centralize our helm charts, added unittest support, built & maintained many charts
- Setup ArgoCD in an effort to move away from GitLab Actions towards GitOps
- Setup Dex, integrated into all Operations apps, defined standards for RBAC moving forward
- Refactored our… Show more