Software Engineering Manager
CurrentI am an Engineering Manager at Snap, where I lead Snap's CloudOps group, which includes 3 engineering domains: Reliability Engineering, Observability Platform, & FinOps. I also lead the partnerships with GCP & AWS for these domains to improve cloud reliability and optimization, leadership alignment, cross-functional development, and beta feature pilots. These partner relationships represent billions of dollars in investments. I create and execute my team’s roadmap to exceed Snap’s performance and cost goals.The Observability Team I lead runs a Tier 0 platform that ingests, optimizes, and vends Snap’s operational telemetry for application clients and backend systems. We provide critical metrics to thousands of operators and run one of the world's largest OSS metrics platforms.The Reliability Engineering team I lead builds tooling, measurements, best-practice automation, and solutions that standardize resiliency and reduce toil for Snap’s multi-cloud global-scale infrastructure. In addition, I lead Snap’s weekly critical Service Operations Program (SOP), which addresses ambiguous and chronic reliability issues. I have developed this forum to drive system-wide resiliency across all Tier 0 & 1 services. The FinOps team I lead manages a custom cost pipeline that accurately forecasts and attributes infrastructure and vendor costs. It is the source-of-truth for business decisions. We build tools to identify and prevent cost regressions as well as empower teams to make strategic-cost decisions.