Pradeep Kumar Reddy Kambham

Pradeep Kumar Reddy Kambham Email and Phone Number

Data Engineer @ IQVIA
Celina, TX, US
Pradeep Kumar Reddy Kambham's Location
The Colony, Texas, United States, United States
Pradeep Kumar Reddy Kambham's Contact Details

Pradeep Kumar Reddy Kambham work email

Pradeep Kumar Reddy Kambham personal email

n/a
About Pradeep Kumar Reddy Kambham

* 7 Plus years of experience in providing tech solutions to complex data problems of Banking and healthcare domainsprimarily using Hadoop and Spark Ecosystems.* Excellent hands-on experience in developing Hadoop Architecture in both Windows and Linux platforms.* Experience in design, development, and implementation of big data applications using Hadoop Ecosystem frameworks and tools like HDFS,MapReduce, Yarn, Pig, Hive, Sqoop, Spark, Flume, Impala, Oozie, Zookeeper, Airflow, etc.* Good working knowledge on working with AWS cloud services like EMR, S3, Redshift, EMR, Lambda, Glue, Data Pipeline, Athena for BigData development.* Strong object-oriented and functional programming (Scala) experience.* Experinced in running spark appications on docker containes.* Having a good experience with Python scripting language to use in Spark development life cycle.* Developed multiple programs using Scala and deployed them on the Yarn cluster, compared the performance of Spark, with Hive.* Worked on Spark RDDS to do the transformations on the data.* Experience in importing data from different sources like HDFS/HBase into Spark RDD.* Expertise in Spark and its optimization.* Solid understanding of the Hadoop distributed file system data handling in the HDFS which is coming from other sources.* Involved in analysis, design, coding for various components and bug fixing. Providing test support throughout the project lifecycle tillproduction.* Worked on data processing and transformations and actions in spark by using Python (PySpark).* Proficient in data cleansing, migration, and warehousing activities with exposure to ETL process designing.* Experience in developing snowpark(snowflake)-scala applications.

Pradeep Kumar Reddy Kambham's Current Company Details
IQVIA

Iqvia

View
Data Engineer
Celina, TX, US
Pradeep Kumar Reddy Kambham Work Experience Details
  • Iqvia
    Data Engineer
    Iqvia
    Celina, Tx, Us
  • Iqvia
    Data Engineer
    Iqvia Feb 2022 - Present
    Durham, North Carolina, Us
    * Developed spark-Scala components and successfully integrated with EDGE framework to handle around 6000+ spark jobs per day.* Developed Automatic Airflow DAG builder using Python to generate the Airflow DAGs dynamically.* Optimized the EDGE framework performance by creating materialized views in PostgreSQL.* Successfully integrated the spark-scala components with Azure Synapse.* Ingested huge volumes of clinical research data into Snowflake as per customer needs.* Developed a custom component that can generate complex XML and JSON files. Also, this will support parquet and CSV files.* Developed Python script to flatten heavily nested JSON files.* Configured custom GitLab runners and successfully implemented the CICD pipeline to deploy the builds and improved its performance by leveraging the GitLab cache.* Developed spark-Scala component to record the change data capture.* Successfully migrated and productized the legacy systems to the Big data platform.* Developed Snowpark(snowflake) application using Scala to generate complex xml&json files.* Engineered DBT models and optimized DBT macros for efficient XML and JSON file generation, significantly enhancing performance and throughput.* Led the seamless migration of a Native Spark Scala application onto the Snowflake environment.* Implemented Python User-Defined Functions (UDFs) within Snowflake.
  • Nbcuniversal
    Data Engineer
    Nbcuniversal Jun 2021 - Aug 2021
    New York City, Ny, Us
    * Architected the Airflow 2.0 implementation in the CDS Platform.* Migrated the Airflow DAGs to Airflow 2.0 and optimized the DAGs by using the latest upgrades in the DAGs.* Implemented the Automated CICD pipeline for DAG deployment.* Written Airflow DAG to clean up the Airflow logs from the docker containers to improve performance.* Developed terraform modules to provision the AWS resources.* Imported the legacy AWS infrastructure to terraform code and brought it under terraform control.* Created spark-scala data pipelines using Databricks to conduct exploratory data analysis and build automated report generation.
  • Capgemini
    Data Engineer
    Capgemini Jan 2020 - Dec 2020
    Paris, France, Fr
    • Quantexa• Spark• Gradle• Scala• Oracle• Entity Resolution• Airflow• S3• Elasticsearch• Docker• Kubernetes• GitLab CI/CD• AppDynamics• Azure Databricks• Azure Blob• Splunk• Grafana* Coordinated the design, development, and integration of the Quantexa project, consolidating internal bank data and external third-party data to calculate risk scores and facilitate entity resolution.* Developed robust ETL pipelines, extracting internal banking data from Oracle using Spark and Scala, and ingesting it into the S3 bucket, resulting in a streamlined data processing workflow.* Transformed raw data into Parquet format, implemented data validations, created Scala case class models, and executed custom Scala functions to cleanse address and name fields, ensuring data consistency.* Orchestrated end-to-end ETL processes and successfully deployed them into production using Airflow, optimizing operational efficiency.* Established a CI/CD pipeline, automating build processes and application deployment, contributing to a more agile and efficient development environment.* Utilized Spark and Scala to ingest processed data into Elasticsearch, enhancing data accessibility and search capabilities.* Deployed Spark Docker images to Kubernetes containers, demonstrating proficiency in containerized application deployment.* Developed Java applications using AWS SDK to seamlessly transfer files from S3 to various target locations, improving data distribution processes.* Integrated AppDynamics to monitor Microservices running on Kubernetes containers, ensuring optimal performance and proactively addressing any potential issues.* Contributed to the creation of elements and compounds integral to the entity resolution process, further enhancing the project's overall success.* Developed a hybrid data pipeline leveraging both an on-premise Hadoop cluster and Azure Databricks to process sensitive and non-sensitive data separately.
  • Capgemini
    Data Engineer
    Capgemini Oct 2018 - Jan 2020
    Paris, France, Fr
    Data Engineer, Capgemini Ltd: DBS Bank, Hyderabad, IndiaWorking with DBS bank (Client) on Digibank App, which helps customers move from traditional banking to paperless banking. In this project, we migrate a traditional warehouse system to a new era data mart built on a newer data platform stack. This transformation helps the bank bring business logic to the presentation layer, which can be used to visually track, analyze, and display key performance indicators (KPI).* Ingested massive volumes of data from various core banking source systems like Finacle, Kony, etc., using Spark ingestion framework into ADA platform's Storage in S3.* Developed data pipelines in PySpark to support the needs of Data Science and Business Analyst teams.* Involved in data mapping and metadata preparation for the new data models using an enterprise metadata Governance tool called collibra.* Written pyspark and Teradata TPT scripts to extract and migrate the Legacy data from the old platform (BIP) to the new platform's (ADA) S3 storage.* Provisioned the GCP Dataproc cluster using the Airflow DAG for running the spark jobs and deleted the cluster once the job is done.* Created Airflow DAGS for the spark jobs and successfully deployed them into the production environment.* Written data reconciliation scripts to identify discrepancies in the legacy data.* Used Presto on top of Hive tables to achieve significantly faster performance for OLAP queries.* Used alluxio file system for speeding up the spark reading and writing operations.* Involved in tuning the performance of the long-running spark jobs.* Successfully implemented the SCD type-2 logic in hive tables.* Involved in unit testing, deployment, and change request management activities.
  • Cognizant
    Programmer Analyst
    Cognizant Apr 2016 - Oct 2018
    Teaneck, New Jersey, Us

Pradeep Kumar Reddy Kambham Education Details

  • University Of North Texas
    University Of North Texas
    Data Science
  • Nbkr Institute Of Science And Technology
    Nbkr Institute Of Science And Technology
    Electronics And Communications Engineering

Frequently Asked Questions about Pradeep Kumar Reddy Kambham

What company does Pradeep Kumar Reddy Kambham work for?

Pradeep Kumar Reddy Kambham works for Iqvia

What is Pradeep Kumar Reddy Kambham's role at the current company?

Pradeep Kumar Reddy Kambham's current role is Data Engineer.

What is Pradeep Kumar Reddy Kambham's email address?

Pradeep Kumar Reddy Kambham's email address is pr****@****uni.com

What schools did Pradeep Kumar Reddy Kambham attend?

Pradeep Kumar Reddy Kambham attended University Of North Texas, Nbkr Institute Of Science And Technology.

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.