Platform Engineer
CurrentResponsibilities:- Design, implement, and maintain multi-cloud Kubernetes infrastructure (GKE, AKS, Rancher RKE, and Anthos for hybrid cloud management).- Collaborate with development and operations teams to define infrastructure requirements and implement Infrastructure as Code (IaC) using Terraform.- Configure and manage infrastructure automation tools, leveraging Ansible for configuration management.- Use GitLab and GitHub for source code management, ensuring version control and proper collaboration.- Optimize cluster costs using solutions provided by public clouds or with Kubecost, monitoring resource utilization and implementing cost-saving measures.- Ensure cluster security through the implementation of OPA with Gatekeeper, Cert-Manager, external-secrets, and HashiCorp Vault for secret management.- Monitor and maintain observability of Kubernetes clusters using Prometheus, Thanos, Grafana, - Alertmanager, and ELK stack, integrating with various tools like Kafka, Logstash, Fluent Bit, Vector, Filebeat, Elasticsearch, etc.- Drive continuous integration and deployment (CI/CD) pipelines using GitHub Actions, Flux v2, and ArgoCD for efficient application deployment and updates.- Manage container registries, including Azure Container Registry, Artifact Registry (GCP), and JFrog, ensuring secure and efficient image storage and distribution.