Senior Customer Reliability Engineer Ii
Current- Led remediation efforts for numerous production incidents, delivering expert guidance and ensuring swift resolution.
- Designed and implemented robust CI/CD pipelines using CircleCI, significantly enhancing deployment efficiency and reliability.
- Developed comprehensive toolkit reports utilizing Dramatiq and Bootstrap, improving reporting capabilities and user experience.
- Proactively led multiple technical projects, including the development of in-cluster monitoring and alerting solutions, and enhanced Grafana dashboards for improved system monitoring.
- Implemented synthetic monitoring solutions for frontend availability, significantly improving proactive issue detection and resolution.
- Spearheaded initiatives across various platforms, applications, and observability tools, with a focus on incident response and pipeline development in the AI/ML domain.