Data Engineer
Current- Developed and optimized architecture to track Daily Active User metrics across Chartboost monetization and mediation by leveraging Kafka and Apache BEAM for over 50 million users
- Enhanced Data Governance utility using Python, google dataplex and bigquery to implement GDPR deletions and anonymization
- Architected solutions to consolidate cross-product data to streamline company revenue whileoptimizing BigQuery slot usage by ~66%
- Automated PR review process using shell scripts and Github actions and integrated the same as CI/CD ( ~80% processes)
- Scaling, orchestration and real-time monitoring of stream/batch (~12.00k/s) data pipelines (>100TB) utilizing Airflow, Datadog and Dataflow