Sai R. Email & Phone Number
Who is Sai R.? Overview
A concise factual answer block for searchers comparing this professional profile.
Sai R. is listed as Senior GCP Data Engineer at CorEvitas, LLC, a company with 176 employees, based in Dallas-Fort Worth Metroplex, United States, United States. AeroLeads shows a matched LinkedIn profile for Sai R..
Sai R. previously worked as GCP Data Engineer at Blue Cross Blue Shield Association and Data Engineer at Conduent. Sai R. holds Master Of Science - Ms, Information Technology from University Of Denver.
Email format at CorEvitas, LLC
This section adds company-level context without repeating Sai R.'s masked contact details.
Review company-level records connected to Sai R. before choosing the right outreach path.
About Sai R.
• Over 6+ years of experience in designing and building scalable data pipelines to collect, parse, clean, and transform data from multiple source systems and generate high-quality data sets for advanced analytics, dashboards, alerts, and visualizations.• Experience with Big Data/ Hadoop Ecosystem: Spark, Hadoop, Hive, Airflow, Sqoop, Kafka, Oozie, Databricks.• In-depth understanding of Spark Architecture and performed several batch and real-time data stream operations using Spark (Core, SQL, Structured Streaming).• Hands-on experience in GCP, Big Query, and GCS bucket experienced in handling large datasets using Spark in-memory capabilities, Partitions, Broadcast variables, Accumulators, Effective and efficient joins.• Experience using different data engineering frameworks on Cloudera, AWS, and Azure.• Performed Hive operations on large datasets with proficiency in writing HiveQL queries using transactional and performance-efficient concepts: Partitioning, Bucketing, Windowing, etc.• Good understanding of storage technologies like HDFS (Hadoop Distributed File System), AWS S3, and Azure ADLS.• Good experience in using different file formats like Parquet, Avro, and ORC in different parts of the data lake.• Hands-on work experience in writing applications on NoSQL database -HBase.• Extensive experience working with Spark, Hive, Python, Azure, and AWS suites to create Data Pipelines.• Extensively worked on Spark performance tuning.• Comprehensive experience in designing and implementing Data Lake, Data Modelling, and Data Warehousing solutions using traditional and modern data platforms.• Understanding and experience with Extract, Transform, Load (ETL) methodologies, integrating with Big Data Systems like Hadoop.• A thorough cloud professional to implement any data-related transformations using AWS and Azure cloud offerings.• Experience in importing and exporting data using Sqoop to HDFS from Relational Database Systems.• Gained knowledge and expertise in PostgreSQL which is an open-source object-relational database system, used as the primary data store or data warehouse for many web, mobile, geospatial, and analytics applications• Experienced the integration of various data sources like RDBMS, Spreadsheets, and Text files.• Expertise in developing and scheduling jobs using Airflow, Azure Data Factory, Oozie, Crontab, and Elastic search to index, fetch, and filter log data.• Good experience in Data Modelling with star schema and snowflake schema• Created facts and dimensions tables according to the data model.
Sai R.'s current company
Company context helps verify the profile and gives searchers a useful next step.
Sai R. work experience
A career timeline built from the work history available for this profile.
Gcp Data Engineer
Current- Configure, monitor, and automate Google Cloud Services as well as be involved in deploying the services using Google compute engine and Google storage buckets.
- Worked in a Machine learning operations team that was building a machine learning platform to streamline the ML lifecycle including data capture, analysis, model training, evaluation, and model deployment.
- Involved in designing and building modern data solutions using Google Cloud to support data visualization.
- Experience in GCP DataProc, GCS, Cloud functions, BigQuery Utilizing the current state of production and determining the impact of novel implementation on existing business processes.
- Developed data transition programs from Teradata to Google BigQuery using Google function by creating functions in Python for certain events based on client requirements.
- Involved in building Data pipelines, end-to-end ETL, and ELT processes for Data ingestion and transformation in GCP using an App engine.
Data Engineer
- Designed and developed a Security Framework to provide fine-grained access to objects in AWS S3 using AWS Lambda, and DynamoDB.
- Set up and worked on Kerberos authentication principals to establish secure network communication on cluster and testing of HDFS, Hive, Pig, and MapReduce to access cluster for new users.
- Performed to-end Architecture & implementation assessment of various AWS services like Amazon EMR, Redshift, S3
- Developed spark applications in Python (PySpark) on a distributed environment to load a huge number of CSV files.
- Created Databricks notebooks using SQL, Python, and automated notebooks using jobs
- Implemented near real-time data pipeline using a framework based on Kafka, and Spark.
Data Engineer
- Analyzed large amounts of data sets to determine the optimal way to aggregate and report on them using Map Reduce programs.
- Responsible for data services and data movement infrastructures, worked with ETL concepts, building ETL solutions and Data modeling.
- Designed, developed, implemented, and maintained solutions for using Docker, Jenkins, and Git, for microservices and continuous deployment.
- Involved in loading data from rest endpoints to Kafka Producers and transferring the data to Kafka Brokers.
- Worked on Snowflake Schemas and Data Warehousing and processed batch and streaming data load pipeline using Snow Pipe and Matillion from data lake Confidential AWS S3 bucket.
- Involved in creating Hive tables, loading and analyzing data using Hive queries, and writing complex Hive queries to transform the data.
Data Engineer
- Excellent SQL Server administration skills including Database Creation, Tables, Indexes, and Clusters Creation.
- Evaluated the suitability of Hadoop and its ecosystem for the project and implemented/Validated various proof of concept (POC) applications to eventually adopt them to benefit from the Bigdata Hadoop initiative.
- Estimated the Name Node and Data Node software and hardware requirements and planned the cluster.
- Extracted the needed data from the server into HDFS and bulk-loaded the cleaned data into HBase.
- Designed, implemented, and deployed within the customer’s existing Hadoop, Cassandra cluster for a series of custom parallel algorithms for various customer-defined metrics and unsupervised learning models.
- Using the Spark framework Enhanced and optimized product Spark code to aggregate, group, and run data mining tasks.
Sai R. education
Frequently asked questions about Sai R.
Quick answers generated from the profile data available on this page.
What company does Sai R. work for?
Sai R. works for CorEvitas, LLC.
What is Sai R.'s role at CorEvitas, LLC?
Sai R. is listed as Senior GCP Data Engineer at CorEvitas, LLC.
Where is Sai R. based?
Sai R. is based in Dallas-Fort Worth Metroplex, United States, United States while working with CorEvitas, LLC.
What companies has Sai R. worked for?
Sai R. has worked for Corevitas, Llc, Blue Cross Blue Shield Association, Conduent, Talen Energy, and Ndiz Solutions.
How can I contact Sai R.?
You can use AeroLeads to view verified contact signals for Sai R. at CorEvitas, LLC, including work email, phone, and LinkedIn data when available.
What schools did Sai R. attend?
Sai R. holds Master Of Science - Ms, Information Technology from University Of Denver.
Search by job title, company, industry, location, and seniority. Export verified B2B contact data when you need it.
Start free trialCheck these profiles if this is not the Sai R. you were looking for.
View similar profiles