Arghya Saha Email and Phone Number
Arghya Saha work email
- Valid
Arghya Saha personal email
As a Data Engineering Lead and Data Platform Solution Architect, I have extensive hands-on experience in building robust and scalable data pipelines and platforms to support various applications like Apache Spark, Hive, Presto/Trino, Kafka, Airflow, Delta Lake, Iceberg on Kubernetes. I specialize in designing platform-agnostic solutions that can be easily lifted and shifted to any public cloud provider or on-premise environment.In my role as a Data Engineering Lead, I have extensive experience in managing teams of data engineers, providing technical leadership, and driving successful delivery of complex data solutions. I work closely with stakeholders to understand their needs, develop roadmaps, and ensure timely and high-quality delivery of data projects. I am a strong communicator and enjoy collaborating with cross-functional teams to achieve shared goals.My expertise extends to a wide range of technologies, including Kubernetes (EKS, Rancher), Docker, CRD, Helm, Ingress Controller, and Cluster Autoscaler. I am also proficient in Python, Core Java, and Scala, and have a deep understanding of AWS services such as EMR, EKS, EC2, S3, SNS, SQS, Lambda, and CloudWatch.In addition, I have strong skills in infrastructure and DevOps tools such as Terraform, Jenkins, and GitLab Runner. Overall, my goal is to help organizations leverage the full potential of their data by building robust and scalable data platforms that are tailored to their unique needs. I am passionate about staying up-to-date with the latest technologies and best practices in the industry and am always looking for new challenges and opportunities to learn and grow.
Seagate Technology
View- Website:
- seagate.com
- Employees:
- 15272
-
Senior Staff EngineerSeagate Technology Jun 2022 - PresentPune, Maharashtra, IndiaDesigning & Building Data Analytics and Gen AI Platform using LLM -
Staff EngineerSeagate Technology Sep 2021 - May 2022Pune, Maharashtra, IndiaAs a Data Engineer and Data Platform Solution Architect, I was responsible for designing and building a platform-agnostic data engineering platform. My key achievements in this role include:- Building a platform to run big data applications like Spark, Presto/Trino, Hive Metastore Service, Apache Airflow, and JupyterHub on Kubernetes, which can be easily lifted and shifted to any public cloud provider or on-premise environment.- Migrating Seagate Enterprise Data Platform from AWS to Seagate Lyve Cloud, ensuring seamless transition of data and applications.- Migrating thousands of production Spark jobs from AWS EMR to Spark Operator on Kubernetes, achieving better performance, cost efficiency, and resource utilization.- Optimizing Spark on Kubernetes performance, cost, and resource utilization by using NMVe, S3A Committers, and enabling graceful decommission.- Using new Spark 3 features like AQE and Dynamic Partition Pruning to boost performance of existing jobs and reduce processing time.- Extracting important Spark driver and executor JMX metrics to Prometheus and using those to optimize resource utilization, identify bottlenecks, and fine-tune the platform.- Conducting AWS cost optimization across EMR, EKS, S3, and EC2, achieving significant savings for the organization.- Building a CI/CD pipeline for Kubernetes application deployments and data pipelines, ensuring fast and reliable delivery of new features and updates.- Performing POC with innovative data products like Varada, Alluxio, Iguazio, Neuroblade, and evaluating their potential to enhance the platform and address specific use cases.- Segregating compute and storage, scaling them independently, and solving latency issues by implementing suitable and smart caching solutions. -
Senior EngineerSeagate Technology Jun 2019 - Aug 2021Pune, Maharashtra, IndiaAs a Data Engineer, I was responsible for designing and building platform-agnostic data engineering pipelines. My key achievements in this role include:- Designed and built data pipelines using the latest open source tools such as Airflow, Spark, Livy, Presto, and Kubernetes, to enable efficient data processing and analysis.- Developed a generic ETL framework using Spark and Presto to allow for seamless data integration across various sources.- Successfully migrated data pipelines from on-premises MapReduce to Spark on AWS EMR, resulting in improved scalability and performance.- Deployed a Presto cluster using AWS EC2 ASG with auto scaling and graceful shutdown features to ensure high availability and reduce operational costs.- Conducted Presto cluster stabilization and optimization activities to improve query performance, reduce latency, and ensure reliable operation. -
Senior Data EngineerBarclays Feb 2017 - Jun 2019Pune, Maharashtra, IndiaAs a Data Engineer at Barclays, I achieved the following:- Designed and developed a highly scalable Slowly Changing Dimension (SCD) engine using Spark and Scala, which improved data quality and reduced processing time.- Developed a reconciliation tool using Spark to compare large volumes of data from different sources, which helped ensure data accuracy and consistency across the organization.- Contributed to various data warehouse and data mart projects utilizing Big Data technologies, demonstrating my ability to work with diverse teams and successfully deliver high-quality data solutions. -
Data EngineerCognizant Jul 2014 - Feb 2017Pune, Maharashtra, IndiaAs a Data Engineer at Cognizant, I achieved the following:- Worked on various Big Data projects for Barclays, contributing to their successful implementation and completion- Involved in data migration and transformation projects, ensuring smooth transition of data between systems- Designed and developed multiple automation utilities as part of the innovation team, improving overall efficiency and productivity- Utilized skills in big data technologies and programming languages such as Hadoop, Spark, Scala, and Python to build scalable and effective solutions- Collaborated with cross-functional teams, including data scientists, business analysts, and software developers, to deliver high-quality results
Arghya Saha Skills
Arghya Saha Education Details
-
Computer Science -
Science
Frequently Asked Questions about Arghya Saha
What company does Arghya Saha work for?
Arghya Saha works for Seagate Technology
What is Arghya Saha's role at the current company?
Arghya Saha's current role is Solution Architect - Generative AI and Big Data | Senior Staff Engineer @ Seagate.
What is Arghya Saha's email address?
Arghya Saha's email address is ar****@****ant.com
What schools did Arghya Saha attend?
Arghya Saha attended St. Thomas' College Of Engineering & Technology 122, Jawahar Navodaya Vidyalaya (Jnv).
What skills is Arghya Saha known for?
Arghya Saha has skills like Selenium Webdriver, Testng, Junit, Jenkins, Git, Sql, Java, Pl/sql, Perl Automation, Cgi/perl, Test Automation, Greeen Hat Tester.
Who are Arghya Saha's colleagues?
Arghya Saha's colleagues are Dave Schell, Elizabeth Diaz-Santos, Kenny Moore, Yashwanth Kumar B G, Subongkot Longwilai, Rahul Goel, Dat Quach.
Not the Arghya Saha you were looking for?
-
Arghya Saha
Mumbai -
Arghya Saha
Bengaluru2gmail.com, abnormalsecurity.com -
Arghya Saha
Arghya Saha || Ceo And Founder At Ideatix || Backend Developer || Social Media Marketing SpecialistKarimganj
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial