Principal Software Engineer
CurrentI work in the areas of RAS (Reliability, Availability, Serviceability) and Fault Management. I design and develop software that collects error telemetry from various parts of a running computer system and analyzes that data in real time to determine which piece of hardware or software is broken.