As a Certified Senior Data Engineer at PNC Bank, I bring over 9 years of experience in architecting and deploying advanced cloud-based data solutions. My passion lies in harnessing the power of cloud technologies to drive innovation and efficiency in data engineering.🌐 Technical Proficiencies:Cloud Platforms: AWS, GCP, AzureData Engineering: Apache NiFi, Airflow, Talend, Hadoop ecosystem, Apache Kafka, Apache FlinkContainerization: Kubernetes, DockerInfrastructure as Code: Terraform, CloudFormation, Deployment ManagerMonitoring & Logging: Grafana, Splunk, Prometheus, ELK StackData Storage & Processing: Databricks, Delta Lake, AWS Lake Formation, Azure Data Lake, Google BigQuery, Amazon Redshift SpectrumReal-time Data Ingestion & Processing: Azure Stream Analytics, Google Cloud Dataflow, Apache Kafka Streams, Apache Spark Streaming, Apache Flink🔧 Key Achievements:Leading the migration of data infrastructure from on-premises to AWS Cloud, utilizing AWS Server Migration Service, AWS Database Migration Service, Developing complex data engineering solutions using Databricks and Delta Lake for unified analytics and enhanced data lake management.Implementing data encryption with KMS and HSM to secure sensitive healthcare data.Employing serverless computing (AWS Lambda, Google Cloud Functions) for scalable, cost-effective data processing workflows.🚀 Current Projects:Setting up GCP and AWS infrastructure using Terraform foundation modules.Building robust CI/CD pipelines for Kubernetes clusters hosted Managing data infrastructure on AWS, GCP, and Azure, ensuring scalability and reliability while automating pipelines with Airflow, Jenkins, and CI/CD.Integrating machine learning models using TensorFlow and PyTorch to derive actionable insights from data.Enhancing data governance with Collibra and Alation.🌐 Diverse Experience:With a strong background in computer science and data engineering, I've worked on multiple cloud platforms and projects, continually learning new technologies and best practices.💡 Vision & Contribution:I believe in bringing diverse perspectives to the team, fostering innovation, and staying at the forefront of cloud technology. My goal is to contribute significantly to the success of our projects and the growth of our organization.Let's connect and explore opportunities for collaboration! 👥 #DataEngineering #CloudEngineering #AWS #GCP #Azure #InnovationInTec
-
Senior Etl DeveloperPncUnited States -
Senior Data EngineerPnc Nov 2022 - PresentPittsburgh, Pennsylvania, United StatesDesigned and implemented scalable data architectures using GCP services like BigQuery, GCS, and Dataproc.Developed ETL/ELT pipelines with Google Cloud Dataflow, Apache Beam, and Hadoop ecosystem tools.Optimized PySpark performance for batch and streaming ETL tasks.Managed real-time data processing with Google Cloud Pub/Sub and Dataflow.Utilized Google BigQuery for data warehousing and analytics.Automated pipelines using Terraform and Google Cloud Deployment Manager.Integrated machine learning models with Vertex AI for predictive analytics.Ensured data governance with Google Data Catalog and tools like Collibra and Alation.Mentored junior engineers and implemented GCP best practices. -
Senior Data EngineerPresbyterian Healthcare Services Jan 2021 - Oct 2022New Mexico, United StatesDeveloped complex data engineering solutions using Databricks and Delta Lake on AWS for unified analytics and data lake management.Implemented data encryption with AWS KMS and HSM to secure sensitive healthcare data.Employed AWS Lambda for scalable, cost-effective data processing workflows.Managed data lakes on AWS Lake Formation for efficient storage and retrieval.Used AWS Kinesis and AWS Data Pipeline for real-time data ingestion and processing in healthcare.Integrated data pipelines with EHR systems (Epic, Cerner) for seamless data flow and analytics.Leveraged edge computing with AWS Greengrass to process healthcare data closer to the source.Managed cloud-native data warehousing using Amazon Redshift and Redshift Spectrum for scalable analytics.Automated infrastructure deployment with AWS CloudFormation and Terraform.Designed scalable data pipelines using Apache Spark, Kafka, and Flink on AWS for real-time and batch processing.Architected data solutions with AWS Glue, Data Pipeline, and Dataflow.Managed Hadoop clusters with Cloudera and Hortonworks distributions on AWS for high availability.Developed real-time data processing pipelines with Apache Kafka and Kafka Streams on AWS.Implemented real-time analytics using Apache Flink and Spark Streaming on AWS.Built robust ETL processes with AWS Glue, Talend, and Apache NiFi for large data volumes.Ensured data security and compliance in healthcare using AWS IAM for encryption and access management.Designed fault-tolerant data pipeline architectures on AWS for high availability and reliability. -
Data EngineerCruise Apr 2019 - Dec 2020San Francisco Bay AreaDesigned and maintained ETL pipelines using Apache Nifi, Talend, and Informatica.Built scalable data pipelines with Apache Kafka, Apache Flink, and Python for real-time and batch processing.Utilized Apache Spark for large-scale data processing, machine learning, and analytics.Implemented data solutions on AWS, Azure, and Google Cloud using S3, Redshift, BigQuery, Azure Synapse, and Databricks.Managed cloud-based data warehouses including Snowflake, Google BigQuery, and Amazon Redshift.Handled real-time data ingestion and processing with Kafka Streams, AWS Kinesis, and Google Pub/Sub.Employed Google Dataflow and AWS Glue for efficient data processing and transformation.Designed and managed databases such as MySQL, PostgreSQL, SQL Server, MongoDB, and Cassandra.Automated data pipelines and workflows with Apache Airflow and Google Cloud Composer.Ensured data security and compliance with GDPR, CCPA, and HIPAA regulations.Implemented infrastructure as code using Terraform and AWS CloudFormation.Optimized data queries and storage solutions for performance and scalability using Python and Scala.Developed dashboards and reports using Power BI, Tableau, and Looker.Monitored ETL jobs and real-time data streams using Grafana and Prometheus.Collaborated with cross-functional teams to gather requirements and translate them into technical specifications. -
Data EngineerDignity Health Nov 2017 - Mar 2019California, United StatesDesigned and managed ETL processes using Apache NiFi and Apache Airflow for seamless data integration.Integrated real-time data from EHR and CRM systems with Apache Kafka for continuous data streaming.Developed scalable data warehouse solutions using Snowflake and Google BigQuery.Optimized data storage and retrieval with Apache Parquet for efficient analytical querying.Ensured data accuracy and quality with Great Expectations and implemented data governance with Collibra.Created and presented insights through data visualizations using Tableau and Power BI.Monitored and improved data pipelines and performance with New Relic for enhanced efficiency.Collaborated with data scientists to integrate machine learning models into pipelines using TensorFlow for advanced analytics. -
Etl DeveloperZensar Technologies Jan 2016 - Jul 2017Pune, Maharashtra, IndiaUtilized Informatica to build and optimize data integration pipelines.Integrated diverse data sources including relational databases, NoSQL databases, flat files, and APIs.Implemented data cleansing and transformation processes to ensure high data quality and consistency.Developed reports and dashboards using Power BI and Tableau for actionable insights.Deployed and managed ETL pipelines and data solutions on AWS and Google Cloud.Implemented data governance and security protocols to ensure compliance and data protection.Developed automated scripts using Python for ETL process validation, scheduling, and monitoring. -
Sql DeveloperHyperlink Infosystem Nov 2014 - Dec 2015Ahmedabad, Gujarat, IndiaExperienced SQL Developer skilled in all phases of the Software Development Lifecycle (SDLC), from design to deployment.Proficient in Oracle SQL and PL/SQL, creating complex database objects and managing data operations.Expert in SSIS for data integration and SSRS for developing various types of reports to support business needs.Managed SQL Server environments, ensuring high availability through backup, recovery, and clustering.Optimized database performance with advanced tuning techniques and effective data structuring.
Frequently Asked Questions about Varun Reddy
What company does Varun Reddy work for?
Varun Reddy works for Pnc
What is Varun Reddy's role at the current company?
Varun Reddy's current role is Senior ETL Developer.
Who are Varun Reddy's colleagues?
Varun Reddy's colleagues are Kavonn Paxton, Girish Kumar Kalludevanahalli, William Cornett, Samantha Hawthorne, Jennifer Mooney, Marissa Tomsich, Teresa Moore.
Not the Varun Reddy you were looking for?
-
Varun Reddy
Actively Seeking Opportunities For A Senior Java Full Stack Developer Position.United States -
varun reddy
Dallas, Tx1cbre.com -
-
Varun Reddy
United States -
Varun Reddy
Backend Systems | The Odp Corporation | Mscs @ Umass Amherst | Previously At Athenahealth & AccoliteUnited States
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial