Saroj Kumar Sahoo

Saroj Kumar Sahoo Email and Phone Number

Senior Data Engineer @ Atlassian
Kirkland, WA, US
Saroj Kumar Sahoo's Location
Greater Seattle Area, United States
About Saroj Kumar Sahoo

As a Senior Data Engineer at Amazon Web Services, I design and implement scalable, secure, and flexible cloud architectures that handle massive amounts of data. I have over 13 years of experience in big data processing, data warehousing, ETL, and data analytics, using various AWS services and technologies such as EMR, S3, Glue, Redshift, Spark, and Lambda.I enjoy solving challenging architectural and scalability problems and delivering high-quality data solutions that meet the needs of different data science teams and customers. I have led and mentored other data engineers, built data lakes and streaming data pipelines, and worked on proof of concepts for various AWS technologies. I am also an Oracle Certified Associate and have a strong background in SQL and Python. I am passionate about learning new skills and staying updated on the latest trends and innovations in the data and cloud domain.

Saroj Kumar Sahoo's Current Company Details
Atlassian

Atlassian

View
Senior Data Engineer
Kirkland, WA, US
Website:
atlassian.com
Employees:
17621
Saroj Kumar Sahoo Work Experience Details
  • Atlassian
    Senior Data Engineer
    Atlassian
    Kirkland, Wa, Us
  • Amazon Web Services (Aws)
    Senior Data Engineer
    Amazon Web Services (Aws) Sep 2021 - Present
    Seattle, Washington, United States
    • Architected, Designed and implemented scalable, secure cloud architecture based on Amazon Web Services. Leveraged AWS cloud services such as Glue, EMR ,Redshift,Lambda to build secure, highly scalable and flexible systems that handled expected and unexpected load bursts and can quickly evolve during development iterations.• Designed and developed a highly scalable data model and data warehouse using Redshift and snowflake resulting in a 40% improvement in data processing speed and a 25% reduction in storage costs.• Collaborated with stakeholders to understand their data requirements and developed customized data solutions, leading to a 30% increase in data accuracy and a 20% improvement in data accessibility.• Implemented data security policies and procedures, ensuring compliance with industry regulations and achieving a 100% data security audit score.• Developed generic Spark frameworks to assist different data science teams on onboarding different datasets from various sources resulting in 50% improvement in onboarding effort.• Leveraged AWS S3, Lambda and Glue to build serverless event driven data pipelines.• Developed end to end streaming data pipelines using Kinesis stream, S3 and Athena. • Developed python code for different tasks, dependencies, SLA watcher and time sensor for each job for workflow management and automation using Airflow tool.• Developed and maintained ETL pipelines using Apache Airflow to orchestrate complex data workflows, including extraction, transformation, and loading processes.• Deployed data pipelines with CICD process using Code pipelines, CDK. Environment: AWS , Snowflake, Redshift, GLue, EMR, ,Python, PySpark, Airflow, CDK, Tableau, QuickSight
  • Slalom
    Big Data Engineer
    Slalom Jan 2019 - Sep 2021
    Seattle, Washington
    • Lead architecture design and automate end to end data / ETL pipelines using AWS, Google Big query and Python to migrate data from GCP/MsSQL to Snowflake• Optimized ETL processes for loading data into Snowflake, resulting in a 50% reduction in data loading time and a 15% improvement in overall data quality.• Created Datawarehouse and tables for the Marketing use cases like Leads , opportunity to support data model in Snowflake improving the query efficiency by 70%.• Designed and automated existing sales and inventory processes using SQL reducing the reporting time. • Utilized Python to get data from API and performing exploratory data analysis .• Used distributed computing tools such as Google Big Query, Snowflake, Redshift for structured processing of data, data querying and integration.• Integrated Airflow with cloud services such as AWS S3, Redshift, and EMR to facilitate seamless data transfer and processing in a distributed environment.• Developed and maintained ETL pipelines using Apache Airflow to orchestrate complex data workflows, including extraction, transformation, and loading processes.• Worked on importing data from various sources and performed transformations using HIVE.• Ingested the customer data using Flume and analyzed the customer behavior by performing click stream analysis.• Collected and aggregated large amount of log data using Flume and stored the data in HDFS for further analysis.Environment: AWS,GCP, Snowflake, Redshift, Glue, Databricks, BigQuery, Snowflake, DBT, Python, Tableau, Airflow
  • Deloitte
    Senior Consultant
    Deloitte Jan 2017 - Dec 2018
    Greater Seattle Area
    • Design, implement and support an analytical data infrastructure on traditional data warhouse and AWS• Created data warehouse and data marts to build client specific reports and dashboards• Developed ETL workflows on AWS resources using EC2, S3,RDS, Redshift and so on.• Interface with other technology teams to extract, transform, and load data from a wide variety of data sources using SQL and AWS big data technologies.• Worked on Poc on multiple AWS technologies which solves clients usecases.
  • Jp Morgan Chase
    Associate
    Jp Morgan Chase Apr 2015 - Sep 2016
    Bangaon Area, India
    • Develop and maintain data warehouse / data Mart to enable client to effectively acquire, integrate and distribute information regarding its customers, activities, products and so on. • Designed and developed mappings, mapplets, sessions and workflows to load data from source systems to target systems• Created target database using informatica powercenter. Extracted data from different sources like flat files, oracle and other databases.• Built reusable components including mapplets and sessions.• Created and scheduled workflows using control -M for daily processing of data. • Worked extensively on Informatica powercenter, oracle plsql and bigdata technologies for the serving the client’s DW needs.• Worked with Big data teams to access feasibilty to move the current Datawarehouse to Hadoop.• Worked with advanced file formats on big data technolgy for performance optimization
  • Tata Consultancy Services
    Ita
    Tata Consultancy Services Dec 2008 - Apr 2015
    Hyderabad Area, India
    Informatica developer,Tech lead, Database developer,Oracle,Netezza

Saroj Kumar Sahoo Education Details

Frequently Asked Questions about Saroj Kumar Sahoo

What company does Saroj Kumar Sahoo work for?

Saroj Kumar Sahoo works for Atlassian

What is Saroj Kumar Sahoo's role at the current company?

Saroj Kumar Sahoo's current role is Senior Data Engineer.

What schools did Saroj Kumar Sahoo attend?

Saroj Kumar Sahoo attended Biju Patnaik University Of Technology, Odisha, Kendriya Vidyalaya.

Who are Saroj Kumar Sahoo's colleagues?

Saroj Kumar Sahoo's colleagues are Mark Lennon Mba, Pmp, Katherine Nguyen, Şadiye Alıcı, Amol Kamble, Mariam Alromaithy, Ling Li, Belkacem Abbas.

Not the Saroj Kumar Sahoo you were looking for?

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.