Data Analyst
CurrentData Engineering Team - Designed a scalable big data ETL pipeline for ingesting various datasets generated by banking divisions of Goldman Sachs, with strict requirements of data validation as required by Compliance Surveillance. - Implementing the ingestion pipeline using Python and Apache Spark with various modules like Enrichment from reference data and.