Data Engineer
Current- Designed and implemented scalable datapipelines using Python, Apache Spark, and AWSservices (Redshift, S3, Lambda)
- Developed automated data quality checksreducing data errors by 75% and ensuring 99.9%data accuracy
- Created real-time data streaming solutions usingApache Kafka and Airflow for mission-criticalapplications
- Optimized existing data warehouse architecture,reducing query response time by 60%
- Collaborated with cross-functional teams togather requirements and deliver data solutions forBI and ML projects
- Worked with different stakeholders.Implemented data governance protocols anddocumentation standards ensuring GDPRcompliance