Big Data Engineer with 6+ years of professional IT experience in Data Modeling, Ingestion, Processing, ETL,storage, Data-Driven quantitative analysis, Data Integration and Resource utilization in the Big Data ecosystem. Experience in project development, implementation, deployment, and maintenance using Hadoop and Spark related technologies using Cloudera, Hortonworks, Amazon EMR, and Azure HDInsight. Experienced on Data architecture including data ingestion pipeline design, Hadoop information architecture, data modeling and data mining, machine learning and advanced data processing. Strong experience in Business and Data Analysis, Data Profiling, Data Migration, Data Integration, Data governance and Metadata Management, Master Data Management and Configuration Management. Working knowledge on Azure cloud components (HDInsight, DataBricks, DataLake, Blob Storage, Data Factory, Storage Explorer, SQL DB, SQL DWH, CosmosDB). Experienced in developing Data Pipelines in Azure Data Factory and Datasets/pipelines during ETL process from Azure SQL, Blob Storage, Azure SQL Datawarehouse. Experience with working on AWS platforms (EMR, EC2, RDS, EBS, S3, Lambda, Glue, Elasticsearch, Kinesis, SQS, DynamoDB, Redshift, API Gateway, Athena, Glue, ECS).TECHNICAL ACUMEN:Big Data Ecosystem: HDFS, MapReduce, Yarn, Spark, Kafka, Airflow, Hive, Impala, StreamSets, Sqoop, HBase,Flume, Pig, Ambari, Oozie, Zookeeper, Nifi, Sentry, Ranger.Hadoop Distributions: Apache Hadoop, Cloudera CDP, Hortonworks HDP, AWS (EMR, EC2, EBS, RDS, S3,Athena, Glue, Elasticsearch, Lambda, DynamoDB, Redshift, ECS, QuickSight), Azure (HDInsight, DataBricks,DataLake, Blob Storage, Data Factory ADF, SQL DB, SQL DWH, CosmosDB, Azure AD).Programming Languages: Python, Scala, Java, Shell Scripting, Pig Latin, HiveQL.NoSQL Database: MongoDB 3.x, Hadoop HBase, Apache Cassandra, Redis.Database: Snowflake, AWS RDS, Teradata, Oracle, MySQL, Microsoft SQL, Postgres SQL.Version Control: Git, SVN, BitbucketETL/BI: Snowflake, Informatica, SSIS, SSRS, SSAS, Tableau, Matplotlib, Power BI.Operating systems: Linux (Ubuntu, Centos, RedHat), Windows.Others: ARM Templates, Terraform, Docker, Kubernetes, Jenkins, Ansible, Splunk, Jira.