Senior Site Reliability Engineer
Current- Wrote, troubleshot, & maintained sophisticated Azure infrastructure using prebuilt, internally-maintained Terraform Modules. Modules included Azure Kubernetes, Service Bus, Log Analytics, Azure Container Registry, Cost.
- Utilized Terraform-bootstrapped ArgoCD instance within AKS cluster to deploy Helm chart for application
- Configured Horizontal Pod Autoscalers based on production traffic patterns to minimize cost and efficiently utilize resources in-cluster
- Spearheaded a 6-month project to migrate complex Argo Workflows & Argo Events Batch Processing system from one data center to another, allowing automated billing of client batch requests
- Reverse engineered existing architecture of nested Helm charts for replication & created detailed documentation using MermaidJS for architecture and ongoing support
- Transitioned ownership of the system to another team while training & mentoring team leads to become SMEs