Arghya Saha

Arghya Saha Email and Phone Number

Solution Architect - Generative AI and Big Data | Senior Staff Engineer @ Seagate @ Seagate Technology
cupertino, california, united states
Arghya Saha's Location
Pune, Maharashtra, India, India
Arghya Saha's Contact Details

Arghya Saha work email

Arghya Saha personal email

n/a
About Arghya Saha

As a Data Engineering Lead and Data Platform Solution Architect, I have extensive hands-on experience in building robust and scalable data pipelines and platforms to support various applications like Apache Spark, Hive, Presto/Trino, Kafka, Airflow, Delta Lake, Iceberg on Kubernetes. I specialize in designing platform-agnostic solutions that can be easily lifted and shifted to any public cloud provider or on-premise environment.In my role as a Data Engineering Lead, I have extensive experience in managing teams of data engineers, providing technical leadership, and driving successful delivery of complex data solutions. I work closely with stakeholders to understand their needs, develop roadmaps, and ensure timely and high-quality delivery of data projects. I am a strong communicator and enjoy collaborating with cross-functional teams to achieve shared goals.My expertise extends to a wide range of technologies, including Kubernetes (EKS, Rancher), Docker, CRD, Helm, Ingress Controller, and Cluster Autoscaler. I am also proficient in Python, Core Java, and Scala, and have a deep understanding of AWS services such as EMR, EKS, EC2, S3, SNS, SQS, Lambda, and CloudWatch.In addition, I have strong skills in infrastructure and DevOps tools such as Terraform, Jenkins, and GitLab Runner. Overall, my goal is to help organizations leverage the full potential of their data by building robust and scalable data platforms that are tailored to their unique needs. I am passionate about staying up-to-date with the latest technologies and best practices in the industry and am always looking for new challenges and opportunities to learn and grow.

Arghya Saha's Current Company Details
Seagate Technology

Seagate Technology

View
Solution Architect - Generative AI and Big Data | Senior Staff Engineer @ Seagate
cupertino, california, united states
Website:
seagate.com
Employees:
15272
Arghya Saha Work Experience Details
  • Seagate Technology
    Senior Staff Engineer
    Seagate Technology Jun 2022 - Present
    Pune, Maharashtra, India
    Designing & Building Data Analytics and Gen AI Platform using LLM
  • Seagate Technology
    Staff Engineer
    Seagate Technology Sep 2021 - May 2022
    Pune, Maharashtra, India
    As a Data Engineer and Data Platform Solution Architect, I was responsible for designing and building a platform-agnostic data engineering platform. My key achievements in this role include:- Building a platform to run big data applications like Spark, Presto/Trino, Hive Metastore Service, Apache Airflow, and JupyterHub on Kubernetes, which can be easily lifted and shifted to any public cloud provider or on-premise environment.- Migrating Seagate Enterprise Data Platform from AWS to Seagate Lyve Cloud, ensuring seamless transition of data and applications.- Migrating thousands of production Spark jobs from AWS EMR to Spark Operator on Kubernetes, achieving better performance, cost efficiency, and resource utilization.- Optimizing Spark on Kubernetes performance, cost, and resource utilization by using NMVe, S3A Committers, and enabling graceful decommission.- Using new Spark 3 features like AQE and Dynamic Partition Pruning to boost performance of existing jobs and reduce processing time.- Extracting important Spark driver and executor JMX metrics to Prometheus and using those to optimize resource utilization, identify bottlenecks, and fine-tune the platform.- Conducting AWS cost optimization across EMR, EKS, S3, and EC2, achieving significant savings for the organization.- Building a CI/CD pipeline for Kubernetes application deployments and data pipelines, ensuring fast and reliable delivery of new features and updates.- Performing POC with innovative data products like Varada, Alluxio, Iguazio, Neuroblade, and evaluating their potential to enhance the platform and address specific use cases.- Segregating compute and storage, scaling them independently, and solving latency issues by implementing suitable and smart caching solutions.
  • Seagate Technology
    Senior Engineer
    Seagate Technology Jun 2019 - Aug 2021
    Pune, Maharashtra, India
    As a Data Engineer, I was responsible for designing and building platform-agnostic data engineering pipelines. My key achievements in this role include:- Designed and built data pipelines using the latest open source tools such as Airflow, Spark, Livy, Presto, and Kubernetes, to enable efficient data processing and analysis.- Developed a generic ETL framework using Spark and Presto to allow for seamless data integration across various sources.- Successfully migrated data pipelines from on-premises MapReduce to Spark on AWS EMR, resulting in improved scalability and performance.- Deployed a Presto cluster using AWS EC2 ASG with auto scaling and graceful shutdown features to ensure high availability and reduce operational costs.- Conducted Presto cluster stabilization and optimization activities to improve query performance, reduce latency, and ensure reliable operation.
  • Barclays
    Senior Data Engineer
    Barclays Feb 2017 - Jun 2019
    Pune, Maharashtra, India
    As a Data Engineer at Barclays, I achieved the following:- Designed and developed a highly scalable Slowly Changing Dimension (SCD) engine using Spark and Scala, which improved data quality and reduced processing time.- Developed a reconciliation tool using Spark to compare large volumes of data from different sources, which helped ensure data accuracy and consistency across the organization.- Contributed to various data warehouse and data mart projects utilizing Big Data technologies, demonstrating my ability to work with diverse teams and successfully deliver high-quality data solutions.
  • Cognizant
    Data Engineer
    Cognizant Jul 2014 - Feb 2017
    Pune, Maharashtra, India
    As a Data Engineer at Cognizant, I achieved the following:- Worked on various Big Data projects for Barclays, contributing to their successful implementation and completion- Involved in data migration and transformation projects, ensuring smooth transition of data between systems- Designed and developed multiple automation utilities as part of the innovation team, improving overall efficiency and productivity- Utilized skills in big data technologies and programming languages such as Hadoop, Spark, Scala, and Python to build scalable and effective solutions- Collaborated with cross-functional teams, including data scientists, business analysts, and software developers, to deliver high-quality results

Arghya Saha Skills

Selenium Webdriver Testng Junit Jenkins Git Sql Java Pl/sql Perl Automation Cgi/perl Test Automation Greeen Hat Tester Hadoop Hive Etl Testing Ab Initio Cucumber Maven Jira Agile Methodologies Selenium Shell Scripting

Arghya Saha Education Details

Frequently Asked Questions about Arghya Saha

What company does Arghya Saha work for?

Arghya Saha works for Seagate Technology

What is Arghya Saha's role at the current company?

Arghya Saha's current role is Solution Architect - Generative AI and Big Data | Senior Staff Engineer @ Seagate.

What is Arghya Saha's email address?

Arghya Saha's email address is ar****@****ant.com

What schools did Arghya Saha attend?

Arghya Saha attended St. Thomas' College Of Engineering & Technology 122, Jawahar Navodaya Vidyalaya (Jnv).

What skills is Arghya Saha known for?

Arghya Saha has skills like Selenium Webdriver, Testng, Junit, Jenkins, Git, Sql, Java, Pl/sql, Perl Automation, Cgi/perl, Test Automation, Greeen Hat Tester.

Who are Arghya Saha's colleagues?

Arghya Saha's colleagues are Dave Schell, Elizabeth Diaz-Santos, Kenny Moore, Yashwanth Kumar B G, Subongkot Longwilai, Rahul Goel, Dat Quach.

Not the Arghya Saha you were looking for?

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.