Vishal G

Vishal G Email and Phone Number

Data Engineer/Data Analyst @ Pike Engineering
fort mill, south carolina, united states
Vishal G's Location
Irving, Texas, United States, United States
About Vishal G

I have 8+years experience in managing Databricks Lake House, migrating Hadoop and SQL databases to Azure Data Lake, Azure Data Lake Analytics, Azure SQL Database, Data Bricks, Azure SQL Data Warehouse, excel at managing and granting data access, as well as migrating on-premises databases to Azure Data Lake store using Azure Data Factory.I have experience in writing SQL Complex queries, data visualization tools, and ETL tools.Cloud Technologies and Services: Amazon AWS- EMR, EC2, ENS, RDS, S3, Athena, Glue, Elastic search, Lambda, SQS, DynamoDB, Redshift, Kinesis, Microsoft Azure- Databricks, Data Lake, Blob Storage, Azure Data Factory, SQL Database, SQL Data Warehouse, Google Cloud Platform.Big Data Ecosystems: Databricks Lakehouse, Apache Spark, HDFS, YARN, Map-reduce, Sqoop, Hive, Oozie, Pig, Spark, Zookeeper, Cloudera Manager, Kafka, Flume, NiFi, Connect, Airflow, Stream Sets, Kafka connectHadoop Distributions: Apache Hadoop 2. x, Cloudera CDP, Hortonworks HDPScripting language Python, PySpark, SparkSQL, SQL, Scala, R, shell scripting, HiveQL.NoSQL Database Cassandra, MongoDB, HbaseDatabase MySQL, Oracle, Teradata, MSSQL SERVER, PostgreSQL, DB2Version Control Git, SVNBI tools Tableau, PowerBI

Vishal G's Current Company Details
Pike Engineering

Pike Engineering

View
Data Engineer/Data Analyst
fort mill, south carolina, united states
Employees:
20
Vishal G Work Experience Details
  • Pike Engineering
    Senior Data Engineer
    Pike Engineering Apr 2020 - Present
    North Carolina, United States
    •Led a team of engineers in designing and implementing a scalable and high-performance Databricks architecture, ensuring optimal resource utilization and cost efficiency.•Led the design and implementation of Kafka-based data streaming pipelines, ensuring high throughput, fault tolerance, and low latency for critical business processes.•Orchestrated the migration from ZooKeeper-based Kafka clusters to ZooKeeper-less Kafka clusters, resulting in improved cluster stability and easier maintenance.•Collaborated with cross-functional teams to develop custom Kafka Connect connectors for integrating Kafka with various external systems, reducing data integration complexities.•Managed the Schema Registry to enforce schema compatibility, versioning, and data governance standards across the organization's Kafka topics.•Leveraged KSQL to build real-time stream processing applications, enabling the extraction of actionable insights from streaming data.•Configured and optimized Rest Proxy for secure and efficient RESTful access to Kafka topics, enhancing data accessibility for external applications.•Implemented Mirror Maker to replicate Kafka data between geographically distributed data centers, ensuring data redundancy and disaster recovery capabilities
  • Sonder Inc.
    Data Engineer
    Sonder Inc. Apr 2019 - Mar 2020
    San Francisco Bay Area
    • Utilized Spark SQL API in PySpark to extract and load data and perform SQL queries.• Worked on developing PySpark script to encrypting the raw data by using hashing algorithms concepts on client specified columns.• Extracted, transformed, and loaded data from various source systems into Azure Data Storage services using Azure Data Factory, T-SQL, Spark SQL, and U-SQL Azure Data Lake Analytics.• Processed data in Azure Databricks and ingested it into Azure services such as Azure Data Lake, Azure Storage, Azure SQL, and Azure Data Warehouse.• Created pipelines in Azure Data Factory using Linked Services, Datasets, and Pipelines to extract, transform, and load data from different sources like Azure SQL, Blob storage, Azure SQL Data Warehouse, and write-back tools.• Developed PySpark and Spark-SQL applications for data extraction, transformation, and aggregation from multiple file formats, providing insights into customer usage patterns.• Assumed responsibility for estimating cluster size, monitoring, and troubleshooting Spark Databricks clusters.• Proficient in optimizing the performance of Spark applications, including batch interval time, parallelism level, and memory tuning.• Responsible for Design, Development, and testing of the database and Developed Stored Procedures and Views.
  • Barclays
    Data Engineer
    Barclays Jul 2015 - Mar 2018
    Remote
    • Developed Spark applications using PySpark and Spark SQL to extract, transform, and aggregate data from multiple file formats, enabling analysis and transformation to uncover insights into customer usage patterns.• Led the architecture and design of a large-scale data warehousing system, incorporating best practices in data modeling and dimensional modeling.• Worked closely with business stakeholders to define data architecture strategies that aligned with organizational goals.• Ensured data governance and quality by implementing data profiling, data cleansing, and data lineage tracking.• Collaborated with AWS cloud architects to leverage DynamoDB and other AWS services for high-performance NoSQL database solutions.• Developed cost optimization strategies, leading to a 15% reduction in operational expenses.• Mentored junior data engineers and provided technical guidance on data modeling, architecture, and database optimization.• Possess a solid understanding of Spark Architecture, including Spark Core, Spark SQL, Data Frames, Spark Streaming, driver node, worker node, stages, executors, and tasks.• Wrote Spark jobs using Scala to interact with PostgreSQL databases through Spark SQL Context and accessed Hive tables using Hive Context.

Vishal G Education Details

Frequently Asked Questions about Vishal G

What company does Vishal G work for?

Vishal G works for Pike Engineering

What is Vishal G's role at the current company?

Vishal G's current role is Data Engineer/Data Analyst.

What schools did Vishal G attend?

Vishal G attended University Of New Orleans.

Who are Vishal G's colleagues?

Vishal G's colleagues are Aidan Monte, Kory Ziegler, Scott Brooks, Charles Barr, Chris Kilbarger, Zach Harrelson, Kyle Stropki.

Not the Vishal G you were looking for?

  • Vishal G

    Charlotte, Nc
  • Vishal G

    Senior Data Engineer At Urgently
    Dublin, Ca
  • Vishal G

    San Francisco, Ca
  • Vishal G

    Sr. Java Full Stack Developer | Specializing In Java, Spring Boot, Angular, And Aws | Crafting Scalable Microservices & Cloud-Based Solutions | Focused On Performance, Clean Code, And Agile Methodologies
    Dayton, Oh
  • Vishal G

    Full Stack Developer At The Bank Of New York Mellon Corporation Foundation
    Southgate, Mi

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.