Machine Learning Engineer
Current- Developed advanced credit risk and fraud detection models using Spark MLlib, decision trees, gradient boosting, and ensemble methods, reducing default rates by 15% and false positives by 25%. Optimized data retrieval.
- Engineered a high-performance ETL pipeline with Airflow, AWS SFTP, S3, and PySpark, improving data ingestion speed by 20%. Deployed using AWS CodePipeline for continuous integration and delivery, ensuring efficient and.
- Conducted large-scale A/B tests on financial product offerings and constructed time-series forecasting models using PCA, ARIMA, and LSTM on Spark clusters, yielding a 12% increase in customer engagement and a 15%.
- Designed and deployed a star-schema data warehouse on AWS Redshift for customer feedback analytics, employing AWS DMS for CDC with SCD Type-2. Deployed with AWS CloudFormation for efficient processing.
- Integrated XGBoost, TensorFlow, and Spark ML pipelines for loan default risk prediction, executing distributed training on Snowflake. Utilized Power BI for actionable insights and integrated MLflow for model tracking.