Alex Caldwell Email and Phone Number
I have over 10 years of experience working in the DevOps space. Recently, I worked as Engineering Manager at PayPal where I lead a cross-functional team of four, focusing on building and maintaining production facing core services such as DNS and Puppet. I’m deeply passionate about building cohesive teams that strive to deliver best-in-class products and services. Prior to being promoted to manager, I worked as a DevOps Engineer on a production operations team, deploying software, installing and configuring observability tools, and continuously improving the reliability and resiliency of our applications while being part of a 24/ 7 on-call rotation. I took ownership of our observability system and updated our version control setup to eliminate code loss. I recently moved to Western North Carolina and I’m passionate about music and photography. I also like meeting new people over a cup of coffee. Feel free to reach out at beatmassa@gmail.com or DM me on Twitter @beatmassa.Competencies: Google Cloud Platform, CI/CD, Infrastructure as Code, AWS, Puppet, Terraform, Site Reliability Engineering, Observability, Vulnerability Remediation, Program Management, Incident Response, Root Cause Analysis.
Self Employed
View-
Currently Seeking New Engineering Manager Opportunity | Site Reliability Engineering, Cloud InfraSelf Employed Mar 2024 - PresentSatellite Beach, Florida, Us -
Engineering Manager | Site Reliability Engineering, Cloud Infrastructure, Vulnerability RemediationPaypal May 2019 - Mar 2024San Jose, Ca, UsManaged the systemic reliability and performance of the company's website and online platform, leading an engineering team to implement and refine operational procedures, manage incidents, and ensure the security and scalability of systems.● Orchestrated the successful transition of on-site DNS infrastructure to Google Cloud, achieving a 60% improvement in system uptime and a 100% increase in scalability, paving the way for future growth.● Introduced LogicMonitor, enhancing virtualized infrastructure monitoring with 50% increased reliability and 80% less configuration work, and created comprehensive escalation rules and notification chains using PagerDuty and Slack.● Advanced engineering standards by conducting regular code reviews, fostering test-driven development, and continuously refining workflows, contributing to a 20% increase in team efficiency and a 35% reduction in technical debt.● Ensured 100% Payment Card Industry (PCI) compliance by implementing stringent procedures and utilizing Splunk reports to prioritize adjustments and collaborate with the Security Ops team, setting a record for the highest number of secure systems in a Business Unit.● Conducted root cause analysis (RCA) on recurring service outages, identified key issues, and implemented resolutions, reducing downtime by 50% and enhancing overall system uptime to 99.9%.● Ensured the successful and timely delivery of projects by leveraging substantial expertise in Agile/Scrum sprint planning, increasing work velocity and throughput by 50%.● Migrated legacy systems to a modern cloud infrastructure and exited the datacenter, resulting in a 50% increase in data processing speed and reducing rental costs by $1,000,000 annually -
Devops Engineer | Production Support, Observability, Site Reliability Engineering, Incident ResponsePaypal Jan 2016 - May 2019San Jose, Ca, UsLed the transformation of operational processes, introducing containerization and automated monitoring, which reduced incident response times by 55% and increased overall software delivery reliability by 30%.● Implemented monitoring solutions and alert systems that reduced incident response times by 60%, ensuring high availability and performance of critical applications.● Delivered 24/7 support for critical production issues through active participation in on-call rotations, significantly enhancing system reliability and contributing to our goal of 99.95% uptime and meeting SLA targets.● Reduced toil and human caused errors by 120% by developing and maintaining infrastructure as code with Terraform, Jenkins, Puppet, Bash, and Ansible, optimizing infrastructure management for heightened efficiency.● Coordinated and performed production releases in collaboration with development teams and stakeholders, ensuring smooth deployments with 0% downtime.● Conducted thorough root cause analyses of production issues, reducing recurring incidents by 45% and improving system resilience with targeted SRE interventions.● Integrated Git into an inherited observability system, developing a streamlined process and conducting training sessions, which led to 0% code loss, enhanced code reviews, and minimized errors from version control issues. -
Devops Engineer | Site Reliability Engineering, Observability, Metrics, Incident ResponseTata Communications Jul 2013 - Jun 2015Mumbai, Maharashtra, InAs a DevOps Engineer in a NOC, I installed and configured Nagios observability tools and created a knowledge base to enhance the efficiency and effectiveness of support engineers.● Imagined and executed a bespoke Nagios live video stream observability system, meticulously overseeing the health of 1,400 video feeds across the global content delivery network, resulting in heightened operational efficiency and reliability.● Defined daily responsibilities encompassing the validation of critical services like Wowza and Varnish-based proxy cache servers, taking servers in and out of production via nodes, and optimizing load-balancing servers to support client-utilized services.● Handpicked to lead all team-based and customer-facing documentation, communicating technical information with rare clarity and precision. -
Web DeveloperLas Olas Technologies, Inc. Oct 2012 - Jun 2013Miami, Fl, UsServed as part of a web development team tasked with handling a large-scale website upgrade.● Engineered robust backup and sync scripts in Bash, ensuring seamless synchronization of staging environments and bolstering their reliability.● Spearheaded the installation and maintenance of third-party scripts and modules, crucial for meeting the site's functional demands, resulting in optimized performance and enhanced user experience.
Alex Caldwell Skills
Alex Caldwell Education Details
-
State University Of New York At OswegoInformation Science
Frequently Asked Questions about Alex Caldwell
What company does Alex Caldwell work for?
Alex Caldwell works for Self Employed
What is Alex Caldwell's role at the current company?
Alex Caldwell's current role is Site Reliability & DevOps Engineering Manager, Cloud Infrastructure, Agile Methodologies, Program Management, Cross-functional Team Leadership.
What schools did Alex Caldwell attend?
Alex Caldwell attended State University Of New York At Oswego.
What skills is Alex Caldwell known for?
Alex Caldwell has skills like Leadership, Writing, Project Management, Entrepreneurship, Business Development, Education, Tutoring, Teacher, Act Prep, Sat.
Free Chrome Extension
Find emails, phones & company data instantly
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial