Data Platform - Site Reliability Engineer (Sre)
CurrentDeployment, configuration and maintenance of the distributed data systems that comprise Wikimedia data platform. The stack includes Hadoop, Kafka, Spark, Cassandra, Presto, Druid, Airflow, Superset, DataHub, Turnilo.Monitoring of systems and services, optimization of performance and resource utilizationCookbook/runbook implementation for common maintenance.