Venu G Email and Phone Number
• Having total of 16+ years and an ETL professional with 10+ years of experience in Data Engineering domain.• Having 5+ years of experience in Python, PySpark, Azure Databricks, Azure Data Factory, Azure Synapse, and Azure Data Lake Storage (Gen2).• Microsoft certified Azure Data Engineer Associate (DP-203) & Azure Administrator (AZ-104)• Experience in Hadoop Framework, Map/Reduce, HIVE and SQOOP. • SCRUM Master Certified professional.• Good communication, interpersonal skills, self-motivated and quick learning skills.• Primary Skill category: Azure Databricks, Azure Data Factory, Azure Data Lake Storage (Gen2), Azure SQL Server, Azure Synapse and Azure Devops.• Programing Languages and Scripting: Python, Pyspark, Shell Scripting• DataBase: MySQL, Microsoft SQL Server• Bigdata Skills: Hadoop, Spark and Power BI• Other Tools: Git, JIRA, and Agile.
-
Azure Data Engineer Lead/ManagerIdea Queues Llc Mar 2022 - Present• Collecting the business requirements and drafting in to the design format/architecture diagram.• Migrating the on-prem informatica processing to azure cloud.• Writing the SQL stored procedures in to pyspark code.• Writing the common utilities in python code.• Developing the databricks notebooks.• Applied various business logics using pyspark.• Implemented optimized file format Delta.• Applied various optimizations while implementing the transformation… Show more • Collecting the business requirements and drafting in to the design format/architecture diagram.• Migrating the on-prem informatica processing to azure cloud.• Writing the SQL stored procedures in to pyspark code.• Writing the common utilities in python code.• Developing the databricks notebooks.• Applied various business logics using pyspark.• Implemented optimized file format Delta.• Applied various optimizations while implementing the transformation logic.• Writing back the transformed data to the processed layer (ADLS).• Orchestrate all pyspark jobs using ADF.• Enable the proper alerting mechanism, and data quality checks in ADF.• Enabled the logging mechanism.• Deployed the code on various environments using azure devops (CI/CD). • Prepared Unit test cases • Involved in CI/CD pipeline process.• Prepared sprint planning and achieved the goals.• Managed a team of 7 people. Show less -
Azure Data EngineerForsmart Solutions And Innovations Pvt.Ltd Aug 2018 - Mar 2022• Extract the data from various source systems and land it in ADLS (landing layer).• Converting the raw data in a unified file format in staging layer.• Start reading the data from staging layer using databricks notebooks.• Applied various business logics using pyspark.• Thorough understanding of mapping document while applying transformation logic.• Implemented optimized file formats like Avro, ORC, Parquet and Delta.• Applied various optimizations while implementing… Show more • Extract the data from various source systems and land it in ADLS (landing layer).• Converting the raw data in a unified file format in staging layer.• Start reading the data from staging layer using databricks notebooks.• Applied various business logics using pyspark.• Thorough understanding of mapping document while applying transformation logic.• Implemented optimized file formats like Avro, ORC, Parquet and Delta.• Applied various optimizations while implementing the transformation logic.• Writing back the transformed data to the processed layer(ADLS).• Orchestrate all pyspark jobs using ADF.• Enable the proper alerting mechanism, and data quality checks in ADF.• Properly handled exceptions when we are dealing with huge amount of data.• Enabled the logging mechanism • Prepared Unit test cases • Prepared DoD (Definition of Done)• Prepared proper transition plan to Ops(Operations) team. Show less
-
Big Data DeveloperForsmart Solutions And Innovations Pvt.Ltd Mar 2012 - Mar 2018• Used the Cloudera as a distribution platform for Hadoop.• Created HIVE Tables on top of datasets which are in staging layer.• Used Various Hive performance optimization techniques.• Created Impala Tables as the target tables to load data into the integration layer.• Extensively used Cloudera Stream-sets to transfer Raw data files into HDFS foundation layer.• Used Parquet file format in all target tables in Impala for better performance.• Used Spark core and Spark SQL… Show more • Used the Cloudera as a distribution platform for Hadoop.• Created HIVE Tables on top of datasets which are in staging layer.• Used Various Hive performance optimization techniques.• Created Impala Tables as the target tables to load data into the integration layer.• Extensively used Cloudera Stream-sets to transfer Raw data files into HDFS foundation layer.• Used Parquet file format in all target tables in Impala for better performance.• Used Spark core and Spark SQL for data transformations.• Involved in Building data frames and RDD’s using Spark SQL and Spark Core Show less
-
Etl DeveloperTata Consultancy Services Aug 2003 - Feb 2012
Venu G Education Details
-
Master Of Computer Applications - Mca
Frequently Asked Questions about Venu G
What company does Venu G work for?
Venu G works for Idea Queues Llc
What is Venu G's role at the current company?
Venu G's current role is Azure Data Engineer Lead/Manager at IDEA QUEUES LLC.
What schools did Venu G attend?
Venu G attended Osmania University.
Not the Venu G you were looking for?
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial