AeroLeads people directory · profile

Santosh K Email & Phone Number

Senior GCP Data Engineer at Edward Jones | HDFS | Python | GCP | Pyspark | PL/SQL | SCALA| Hadoop | HTML| XML | SQL | Tableau | MySQL | Actively looking for new opportunities on C2C/C2H at Edward Jones
Location: Charlotte, North Carolina, United States 5 work roles 1 school
LinkedIn matched
✓ Verified May 2026 3 data sources Profile completeness 86%

Contact Signals

LinkedIn Profile matched
3 free lookups remaining · No credit card
Current company
Role
Senior GCP Data Engineer at Edward Jones | HDFS | Python | GCP | Pyspark | PL/SQL | SCALA| Hadoop | HTML| XML | SQL | Tableau | MySQL | Actively looking for new opportunities on C2C/C2H
Location
Charlotte, North Carolina, United States
Company size

Who is Santosh K? Overview

A concise factual answer block for searchers comparing this professional profile.

Quick answer

Santosh K is listed as Senior GCP Data Engineer at Edward Jones | HDFS | Python | GCP | Pyspark | PL/SQL | SCALA| Hadoop | HTML| XML | SQL | Tableau | MySQL | Actively looking for new opportunities on C2C/C2H at Edward Jones, a company with 35357 employees, based in Charlotte, North Carolina, United States. AeroLeads shows a matched LinkedIn profile for Santosh K.

Santosh K previously worked as Senior GCP Data Engineer at Edward Jones and GCP Data Engineer at Amigos Software Solutions Private Limited. Santosh K holds Bachelor'S Degree, Computer Science from Kl University.

Company email context

Email format at Edward Jones

This section adds company-level context without repeating Santosh K's masked contact details.

Edward Jones

Review company-level records connected to Santosh K before choosing the right outreach path.

Profile bio

About Santosh K

• IT professional with nearly a decade of experience as a Senior Data Engineer, specializing in designing data-intensive applications across a spectrum of technologies including the Hadoop Ecosystem, Big Data Analytics, Data Warehousing/Data Mart, Cloud Data Engineering, and Data Visualization. • Possesses an in-depth understanding of Hadoop architecture and its integral components, encompassing YARN, HDFS, Name Node, Data Node, Job Tracker, Application Master, Resource Manager, Task Tracker, and the MapReduce programming paradigm. • Demonstrates strong proficiency in developing enterprise-level solutions leveraging Hadoop, harnessing key components such as Apache Spark, Airflow, MapReduce, HDFS, Sqoop, PIG, Hive, HBase, Flume, NiFi, Kafka, Zookeeper, and YARN. • Extensive hands-on experience in developing Spark applications utilizing tools such as Spark Core, Spark MLlib, Spark Streaming, and RDD transformations. • Proficient in data cleansing and analysis using HiveQL, Pig Latin, and custom MapReduce programs in Python. • Skilled in importing streaming data into HDFS using Flume sources, sinks, and interceptors. • Experienced in utilizing Oozie as a workflow scheduler to manage Hadoop jobs with a Directed Acyclic Graph (DAG) structure. • Familiar with common operators in Airflow, including Python Operator, Bash Operator, and Google Cloud Storage operators. • Expertise in data import and export using Sqoop, facilitating seamless data movement between Hadoop Distributed File System (HDFS) and Relational Database Systems (RDBMS) such as Teradata. • Skilled in working with various database platforms, including both NoSQL and RDBMS tools such as MySQL, Oracle, SQL Server, PostgreSQL, DB2, DynamoDB, MongoDB, HBase, Cassandra, and Cosmos DB. • Extensive experience in designing, implementing, and optimizing data models in Apache Cassandra, leveraging denormalization, wide rows, and partition keys for scalability and fault tolerance. • Proficient in designing and implementing secure data ingestion pipelines using Apache NiFi, leveraging robust security features such as user authentication, role-based access control (RBAC), and secure data transmission protocols (e.g., SSL/TLS). • Familiar with scheduling and workflow orchestration tools like Automic, Control-M, and Tivoli. • Strong understanding of Data Warehousing concepts with hands-on experience in implementing complete life cycle projects, including data modeling, OLTP & OLAP database system design using ER diagrams, ETL processing, and data marts.

Current workplace

Santosh K's current company

Company context helps verify the profile and gives searchers a useful next step.

Edward Jones
Edward Jones
Senior GCP Data Engineer at Edward Jones | HDFS | Python | GCP | Pyspark | PL/SQL | SCALA| Hadoop | HTML| XML | SQL | Tableau | MySQL | Actively looking for new opportunities on C2C/C2H
st. louis, missouri, united states
Website
Employees
35357
AeroLeads page
5 roles

Santosh K work experience

A career timeline built from the work history available for this profile.

Senior Gcp Data Engineer

Current

St Louis, Missouri, United States

  • Collaborated extensively with Google Cloud Platform (GCP) services, including BigQuery, Google Cloud Storage (GCS) buckets, Google Cloud Functions, Cloud Dataflow, Data Proc, and Stackdriver.
  • Constructed Power BI reports on Azure Analysis Services, optimizing performance for enhanced data analysis capabilities.
  • Leveraged Cloud Shell SDK within Google Cloud Platform (GCP) to configure services such as Data Proc, Storage, and BigQuery.
  • Developed Power BI reports using Azure Analysis Services to enhance data analysis capabilities and optimize performance.
  • Engineered intricate mappings for loading data from diverse sources into the Data Warehouse, incorporating various transformations and stages such as Joiner, Transformer, Aggregator, Update Strategy, Rank, Lookup.
  • Engineered streaming applications with PySpark to ingest data from Kafka and store it in NoSQL databases like HBase and Cassandra.
May 2023 - Present

Gcp Data Engineer

Hyderabad, Telangana, India

  • Employed a range of tools and services within the Google Cloud Platform, such as BigQuery, Cloud Storage, Pub/Sub, Composer, Cloud Balancing, Cloud SQL, Datapost, and Stack Drive.
  • Engaged in the complete data engineering life cycle, encompassing analysis, solution design, data pipeline engineering, testing, deployment, scheduling, and implementation.
  • Crafted Python scripts within Airflow to execute ETL operations on extracted data, leveraging Apache Spark for processing before transferring it to Mongo DB.
  • Configured HDFS to effectively store streamed data from Spark, enabling continuous data ingestion from Kafka. Engineered a data pipeline to capture crucial data points from customer chatbot service interactions.
  • Facilitated the migration of on-premise ETL processes to the Google Cloud Platform (GCP) using cloud-native technologies like BigQuery, Cloud Dataproc, Google Cloud Storage, and Composer to empower the Data Science.
  • Orchestrated AWS EC2 instances and initiated Spark jobs on AWS Elastic Map Reduce (EMR).
May 2019 - Jul 2022

Senior Aws Data Engineer

Hyderabad, Telangana, India

  • Designed and implemented an Enterprise Data Lake to accommodate diverse use cases, including analytics, processing, storage, and reporting of large and rapidly evolving data sets.
  • Established a robust security framework using AWS Lambda and DynamoDB to enable fine-grained access control for objects within AWS S3.
  • Configured and deployed Kerberos authentication principals to ensure secure network communication within the cluster, extensively testing HDFS, Hive, Pig, and MapReduce functionalities for new users.
  • Conducted comprehensive architecture and implementation evaluations of AWS services such as Amazon EMR, Redshift, and S3.
  • Developed machine learning algorithms in Python to make predictions by leveraging Kinesis Firehose and the S3 data lake.
  • Utilized AWS EMR to efficiently transform and migrate large volumes of data across various AWS data stores and databases, including Amazon S3 and Amazon DynamoDB.
Jun 2017 - Apr 2019

Azure Data Engineer

Hyderabad, Telangana, India

  • Collaborated with Azure Data Factory to seamlessly integrate data from diverse sources, including on-premises databases like MySQL and Cassandra, as well as cloud storage solutions such as Blob storage and Azure SQL.
  • Managed and fine-tuned resources across the cluster using Azure Kubernetes Service, while monitoring Spark cluster performance through Log Analytics and Ambari Web UI.
  • Enhanced query performance by transitioning log storage from Cassandra to Azure SQL Data Warehouse.
  • Engineered robust data pipelines utilizing Apache Flink for both stream processing and batch processing of extensive datasets. These pipelines delivered high throughput and low latency processing capabilities.
  • Developed data ingestion pipelines on Azure HDInsight Spark cluster using Azure Data Factory and Spark SQL, while also interfacing with Cosmos DB using SQL API and Mongo API.
  • Leveraged Azure Logic Apps to orchestrate workflows for scheduling and automating batch jobs, integrating various services including ADF pipelines, HTTP requests, and email triggers.
Sep 2015 - Mar 2017

Data Engineer

Hyderabad, Telangana, India

  • Mastered the creation of Tableau dashboards for comprehensive reporting on analyzed data, ensuring effective visualization of insights.
  • Proficient in working with NoSQL databases, particularly HBase, for efficient data management and retrieval. Implemented a staged process for input records files, ensuring thorough cleaning and validation of data.
  • Automated the extraction of numerous flat/excel files from diverse sources, including FTP and SFTP (Secure FTP), streamlining data acquisition processes.
  • Utilized Jenkins for continuous integration, seamlessly integrating code changes, while leveraging GitHub as a centralized repository for version control and collaborative development.
  • Executed various dataflow and control flow tasks, including loop and sequence containers, script tasks, SQL task execution, and package configuration, to streamline data processing workflows.
  • Developed SSIS packages to facilitate the export of data from SQL Server to Excel spreadsheets and vice versa, ensuring smooth data interchange between platforms.
May 2013 - Aug 2015
Team & coworkers

Colleagues at Edward Jones

Other employees you can reach at edwardjones.com. View company contacts for 35357 employees →

1 education record

Santosh K education

FAQ

Frequently asked questions about Santosh K

Quick answers generated from the profile data available on this page.

What company does Santosh K work for?

Santosh K works for Edward Jones.

What is Santosh K's role at Edward Jones?

Santosh K is listed as Senior GCP Data Engineer at Edward Jones | HDFS | Python | GCP | Pyspark | PL/SQL | SCALA| Hadoop | HTML| XML | SQL | Tableau | MySQL | Actively looking for new opportunities on C2C/C2H at Edward Jones.

Where is Santosh K based?

Santosh K is based in Charlotte, North Carolina, United States while working with Edward Jones.

What companies has Santosh K worked for?

Santosh K has worked for Edward Jones, Amigos Software Solutions Private Limited, Hudda Infotech, Maisa Solutions, Inc., and Ceequence Technologies Pvt Ltd.

Who are Santosh K's colleagues at Edward Jones?

Santosh K's colleagues at Edward Jones include Debbie Pritchard Stuart, Rasaan Moshesh, Skip Knapp, Jessica Voge, and Sufia Orbe.

How can I contact Santosh K?

You can use AeroLeads to view verified contact signals for Santosh K at Edward Jones, including work email, phone, and LinkedIn data when available.

What schools did Santosh K attend?

Santosh K holds Bachelor'S Degree, Computer Science from Kl University.

Find 750M verified contacts

Search by job title, company, industry, location, and seniority. Export verified B2B contact data when you need it.

People with similar names

Check these profiles if this is not the Santosh K you were looking for.

View similar profiles