Data Engineer
Abbott Park, Illinois, US
- Extract Transform and Load data from Sources Systems to Azure Data Storage services using a combination of Azure Data Factory, T-SQL, Spark SQL, and Azure Data Lake Analytics. Data Ingestion to one or more Azure.
- Designed and Implemented cloud-based solutions in Azure by creating Azure SQL database, setting up Elastic pool jobs and designing tabular models in Azure analysis services. Have extensive knowledge in creating.
- Develop modern data solutions by analyzing, designing, and constructing them with the Azure PaaS service.
- Used various GCP components such as Dataflow with python SDK, DataProc, BigQuery, Composer (Airflow), Gsuite for impersonation of the service accounts, Cloud IAM, Cloud Pub/Sub, Cloud functions for handling functions.
- Worked on implementing Data Lake in Google Big Query, Google Cloud Storage, SQL Scripts to load data to BigQuery, and Composer for running the Talend and Query Scripts.
- Worked with the Spark improving the performance and optimization of the existing algorithms in Hadoop using Spark Context, Spark-SQL, Dataframe API, Spark Streaming, and MLlib, and worked explicitly on PySpark.