Results-driven Senior Data Engineer with 5+ years of experience in designing and implementing scalable data-driven solutions across cloud platforms like AWS, GCP, and Azure. Skilled in building robust ETL processes, data pipelines, and integrating real-time data workflows using tools such as Databricks, Spark, and AWS Glue. Proficient in data warehousing, dimensional modeling, and performance tuning, ensuring high data quality and efficiency.Collaborative team player with a proven ability to work across cross-functional teams to deliver impactful data insights and machine learning model integration. Strong expertise in maintaining secure, stable systems, applying best practices for data architecture, and driving business success through innovative big data solutions.Key strengths include:=> Building ETL processes in Databricks, Spark, and AWS Glue for real-time processing=> Developing scalable data pipelines and leveraging cloud services (AWS, GCP, Azure)=> Optimizing SQL queries for high performance and efficiency=> Hands-on experience with data lakes, distributed processing, and dimensional modeling=> Leading cloud migration strategies and implementing secure data architecturesPassionate about leveraging emerging technologies to deliver high-impact results and drive business growth.
-
Sr Data EngineerArcadisUnited States -
Data EngineerArcadis May 2023 - PresentSan Antonio, Texas, United StatesI conducted thorough assessments of third-party data handling solutions on AWS, ensuring compliance with internal requirements and stakeholder needs. Designed and implemented scalable data pipelines using AWS Glue and Databricks, facilitating real-time data processing and ETL operations. Utilized AWS Lambda and Databricks notebooks for serverless processing tasks, automating workflows to reduce operational overhead. Additionally, I developed and maintained data lakes on Amazon S3, focusing on… Show more I conducted thorough assessments of third-party data handling solutions on AWS, ensuring compliance with internal requirements and stakeholder needs. Designed and implemented scalable data pipelines using AWS Glue and Databricks, facilitating real-time data processing and ETL operations. Utilized AWS Lambda and Databricks notebooks for serverless processing tasks, automating workflows to reduce operational overhead. Additionally, I developed and maintained data lakes on Amazon S3, focusing on data partitioning and storage optimization, while leveraging Apache Spark on Amazon EMR for large-scale processing and performance tuning. My work included building CI/CD pipelines with AWS CodePipeline and CodeBuild, performing data quality checks, and collaborating on data models using Databricks Delta Lake. Furthermore, Developed Spark applications for data analysis, employed data cleansing methods, and adhered to best practices in data engineering, enhancing data reliability and quality across the organization. Show less -
Graduate Teaching AssistantUniversity Of Alabama At Birmingham Aug 2022 - Dec 2022Birmingham, Alabama, United StatesSupported "Foundations of Data Science" course, providing tutorials and one-on-one assistance to improve student outcomes. Contributed to course content development, aligning materials with curriculum objectives.Mentored students in machine learning and problem-solving techniques and led presentations on real-world data science projects. -
Data EngineerWipro Apr 2019 - May 2022Hyderabad, Telangana, IndiaI worked extensively on Google Cloud Dataflow to integrate data from on-prem systems like MySQL and Cassandra with cloud services such as Google Cloud Storage and BigQuery, applying transformations and loading the data back into GCP services. Managed and scheduled resources across clusters using Google Kubernetes Engine (GKE) and monitored Spark clusters through Google Cloud Logging and Dataproc UI. I transitioned log storage from Cassandra to BigQuery, enhancing query performance. I developed… Show more I worked extensively on Google Cloud Dataflow to integrate data from on-prem systems like MySQL and Cassandra with cloud services such as Google Cloud Storage and BigQuery, applying transformations and loading the data back into GCP services. Managed and scheduled resources across clusters using Google Kubernetes Engine (GKE) and monitored Spark clusters through Google Cloud Logging and Dataproc UI. I transitioned log storage from Cassandra to BigQuery, enhancing query performance. I developed data ingestion pipelines on Google Cloud Dataproc using Spark and Dataflow. Additionally, I created dashboards and visualizations in Google Data Studio and Looker, enabling business users and upper management to gain valuable insights. I performed large dataset migrations using Databricks, administered clusters, configured pipelines, and loaded data from Google Cloud Storage to Databricks using Dataflow. Using Google Cloud Functions, I built workflows to schedule and automate batch jobs, and utilized Terraform for pipeline orchestration, Finally, I leveraged Spark Streaming and RDD transformations in Databricks to conduct streaming analytics by ingesting data in mini-batches. Show less -
Data AnalystNtt Data Sep 2018 - Mar 2019Bengaluru, Karnataka, IndiaExperienced in analyzing large datasets to uncover trends and patterns, effectively communicating insights through text, charts, and graphs. Skilled in data cleaning, aggregation, and performing missing value imputation to enhance data quality. Demonstrated ability to assess data management metrics and provide recommendations for enterprise-wide data improvements. Successfully led recruitment efforts and developed strategic alliances to optimize team capabilities. Proficient in handling… Show more Experienced in analyzing large datasets to uncover trends and patterns, effectively communicating insights through text, charts, and graphs. Skilled in data cleaning, aggregation, and performing missing value imputation to enhance data quality. Demonstrated ability to assess data management metrics and provide recommendations for enterprise-wide data improvements. Successfully led recruitment efforts and developed strategic alliances to optimize team capabilities. Proficient in handling large-scale datasets, such as customer credit attributes from TransUnion, to drive actionable business insights. Show less
Rakesh Kumar Education Details
-
Computer Science
Frequently Asked Questions about Rakesh Kumar
What company does Rakesh Kumar work for?
Rakesh Kumar works for Arcadis
What is Rakesh Kumar's role at the current company?
Rakesh Kumar's current role is Sr Data Engineer.
What schools did Rakesh Kumar attend?
Rakesh Kumar attended University Of Alabama At Birmingham, Sathyabama Institute Of Science & Technology, Chennai.
Who are Rakesh Kumar's colleagues?
Rakesh Kumar's colleagues are Marcílio Martins, Katherine Chua, Natalia Geier, Ricknair Capin, Mohannad Nasralah, Carlos Recalde, Pia Niesen.
Not the Rakesh Kumar you were looking for?
-
Rakesh Kumar
Crystal Lake, Il3gmail.com, amada.com, e-ci.com1 +184746XXXXX
-
Rakesh Kumar
Senior Director Of Product | Driving Strategy And Execution In Payments & Identity | Optimizing Enterprise Operations | Passionate About Customer-Centric SolutionsO'fallon, Mo -
Rakesh Kumar
Naperville, Il5yahoo.com, yahoo.com, sierraatlantic.com, hitachiconsulting.com, hitachiconsulting.com2 +151055XXXXX
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial