Vice President, Site Reliability Engineering
- I lead a large team of brilliant engineers responsible for the architectural scalability of the company, including multiple complicated machine learning pipelines. We are responsible for the design, implementation, and.
- Self-managed Kubernetes, including custom controller work and upstream contributions
- A featureful and performant CI/CD pipeline, providing a reliable GitOps interface and fast easy rollbacks
- Monitoring via Prometheus
- Stats via Graphite
- High volume, reliable log transport and aggregation via Kafka