# Cloud Data Engineering: Hands-on experience with AWS Glue, Athena, Aurora, Azure Data Lakehouse, Datawarehouse, and Delta Lake, Databricks. # Big Data Expertise: Proficient in Spark Core, Spark SQL, PySpark, Structured Streaming, Scala, HDFS, Hive, SQOOP, and Impala. # Testing Skills: Experience in Spark testing using Dataframebasesuite, Funsuite, HOLDENKARAU tools, and Python code quality checks using Pylint. # Scripting and Automation: Skilled in shell scripting, Python scripting, Git, Jenkins, and CI/CD pipelines. # Machine Learning Applications: Worked on sentiment analysis, statistical testing, and accuracy prediction using Random Forest classifiers. # ML Tools and Libraries: Proficient in Python, NumPy, Pandas, Matplotlib, Scipy, Scikit-learn, and Seaborn. # Azure Expertise: Hands-on experience with Azure Databricks, Delta Lake, and other Azure data engineering tools. # Aws Expertise: Hands-on experience with AWS Glue, AWS Lambda, AWS S3, Aurora DB, Cloudwatch, IAM
Listed skills include Python, Data Structures, Java, Apache Spark, and 13 others.