Devops/Sre
Current- Responsible for setting up IT infrastructure, its monitoring, and maintenance on AWS Cloud using EKS Managed Kubernetes
- Managing Infrastructure using Terraform, Jenkin (CI ) and ArgoCd ( Continous Delivery)
- Ensure the availability and reliability of distributed systems.
- Implement and troubleshoot using observability tools like Datadog, Prometheus, Grafana
- Drive availability and reliability by defining and implementing SLI, SLO, error budget, Observability, Disaster… Show more
- Drive availability and reliability by defining and implementing SLI, SLO, error budget, Observability, Disaster recovery, and backup to detect and mitigate issue