Software Engineer
Current- Built a containerized service using Python (FastAPI) for network usage measurement, deployed with Docker and Terraform on AWS ECR/ECS, integrating auto-scaling and failover mechanisms via Route 53 for resilience.
- Established a comprehensive monitoring stack with Prometheus, Datadog, Grafana, CloudWatch, and PagerDuty, boosting metrics tracking efficiency and incident resolution speeds.
- Designed proxy probe-based systems for real-time network metrics collection, enhancing Cisco OpenDNS server monitoring and enabling effective failover and capacity planning.
- Developed a React-based UI for visualizing network routes, empowering customers with intuitive troubleshooting and performance optimization tools.