• Data Engineer with 4+ years of experience across the data pipeline, from acquiring and validating large datasets (structured and unstructured) to building data models, developing reports, and utilizing visualization tools for impactful insights.• Orchestrated complex data pipelines using DAGs (Directed Acyclic Graphs) in Airflow to automate data ingestion, transformation, and loading (ETL) processes.• Excellent knowledge of AWS and Azure cloud services, including EC2, S3, RDS, Lambda, Glue, Athena, AWS Pipeline, Redshift, Azure DevOps, Azure Data Lake, Azure Data Factory, Azure Databricks with expertise in infrastructure management, storage, data warehousing, serverless computing, and automated deployment.• Experience in building Spark applications (Python/PySpark) for large-scale data processing and improved processing speedcompared to traditional methods.• Ability to maintain the entire data pipeline infrastructure (Kafka, Snowflake, MongoDB) to ensure high availability and real-time data processing for fraud detection.