Expertise in big data technologies like Spark, Hadoop, Spark SQL, Azure Data Factory, Databricks.Proficient in Data Warehousing and Data Lake concepts, with hands-on experience in Azure Data Lake Storage and HDFS.Skilled in Git for version control, Intellij for development, and PowerShell for scripting.Thorough understanding and application of DevOps principles, CI/CD pipelines, releases, branching, and merging strategies.Analyzes user requirements, designs, and develops ETL processes for Data Warehouses.Develops, tests, schedules, and orchestrates ETL processes using ETL pipelines and PySpark code.Analyzes data using PySpark and SQL to ensure data quality and identify discrepancies.Manages operational activities which includes monitoring production runs, debugs failures, and rectifie errors across environments.Implements data governance practices, ensures data quality, integrity, security, and privacy standards.Collaborates with data scientists and analysts to understand data requirements and ensure data system performance.Expertise in mathematics, statistical analysis, and building/evaluating predictive models for supervised and unsupervised learning.