With over 9 years of experience in site reliability engineering, business continuity, and disaster recovery, I am a seasoned leader who can ensure the smooth and secure operation of complex and high-volume online platforms. I currently manage the site reliability engineering team at Pangea Money Transfer, a global fintech company that provides fast and low-cost money transfers across 30 countries.In my role, I oversee the development and implementation of effective processes and tools for holiday readiness, disaster recovery, and incident management. I lead an incident response team, establish formal policies and procedures, and drive high uptime and site reliability through preventative tooling and service risk assessment. I have a strong expertise in monitoring, alarming, runbooks, ticketing, severity definitions, and post-mortems. I am also a certified incident responder who can handle complex and critical situations with agility and professionalism. I am passionate about optimizing production operations and ensuring business continuity through ongoing risk assessments and threat analysis. My goal is to deliver reliable, secure, and scalable solutions that meet the needs and expectations of our customers and stakeholders.
Listed skills include Troubleshooting, Linux, Active Directory, Unix, and 34 others.