Over 8+ years of extensive hands on Big Data Capacity with the help of Hadoop Eco Systems across internal and cloud-based platforms. Expertise in Cloud Computing and Hadoop architecture and its various components-Hadoop File System HDFS, Map Reduce, Spark, Name node, Data Node, Job Tracker and Secondary Name Node and also in different Google Cloud Platforms like BigQuery, Dataflow, Dataproc, Pub sub and Airflow and working with various Hadoop Distributions like Cloudera, Hortonworks and Amazon EMR to fully implement and leverage new Hadoop features. Design and Development of Ingestion Framework over Google Cloud and Hadoop cluster. Strong experience using HDFS, Map Reduce, Hive, Spark, Sqoop, Oozie and HBase. Solod experience on large scale data warehousing programs and E2E data integration solutions on snowflake cloud, AWS Red shift, Informatica Intelligent cloud services and informatica power center integrated with multiple relational data bases. Experience in developing Spark Applications using Spark RDD, Spark-SQL and Data frame APIs. Worked with real-time data processing and streaming techniques using Spark streaming and Kafka. Expertise in working with HIVE data warehouse infrastructure-creating tables, data distribution by implementing Partitioning and Bucketing, developing and tuning the HQL queries. Deep knowledge of troubleshooting and tuning Spark applications and Hive scripts to achieve optimal performance and also Migrating SQL database to Azure data Lake, Azure data lake Analytics, Azure SQL Database, Data Bricks and Azure SQL Data warehouse and Controlling and granting database access and Migrating On premise databases to Azure Data lake store using Azure Data factory. Experience in GCP, Big Query, GCS bucket, G-cloud function, cloud migration, cloud dataflow, Pub/sub cloud shell, GSUTIL, BQ command line utilities, Data Proc, Stack driver. Strong understanding of Java Virtual Machines and multi-threading process and in writing complex SQL queries, creating reports and dashboards. Proficient in using Unix based Command Line Interface. Strong experience with ETL and/or orchestration tools like Talend, Oozie, Airflow. Experience setting up AWS Data Platform - AWS CloudFormation, Development EndPoints, AWS Glue, EMR and Redshift, S3, and EC2 instances. Experienced in using Agile methodologies including extreme programming, SCRUM and Test-Driven Development.
-
Senior Data EngineerFirst RepublicCleveland, Oh, Us -
Senior Data EngineerFirst Republic Apr 2022 - PresentSan Francisco, Ca, Us -
Big Data EngineerBroadridge Sep 2019 - Mar 2022New York, New York, Us -
Data EngineerMerck Pharma Gmbh May 2017 - Aug 2019 -
Hadoop DeveloperBrio Technologies Nov 2015 - Sep 2017Hyderabad, Telangana, In -
Hadoop DeveloperGrape Software Limited Sep 2015 - Oct 2015
Frequently Asked Questions about Sai Sri
What company does Sai Sri work for?
Sai Sri works for First Republic
What is Sai Sri's role at the current company?
Sai Sri's current role is Senior Data Engineer.
Free Chrome Extension
Find emails, phones & company data instantly
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial