Kaushik A

Kaushik A Email and Phone Number

Sr. Data Engineer @ Johnson & Johnson
United States
Kaushik A's Location
St Louis, Missouri, United States, United States
About Kaushik A

• Results-driven Data Engineer with 10 years of experience in leveraging Palantir Foundry to design, develop, and deploy innovative applications that enhance operational efficiency. Adept at utilizing tools such as Palantir Workshop and Carbon Workspace to support dealer workflow application development.• My expertise in data analysis, modeling, and data model implementation for enterprise-level applications has been honed over the course of my more than ten years as a data engineer, analyst, and SQL developer. Data acquisition, preparation, modeling, and deployment are all areas of competence, I cover throughout the whole Data Science lifecycle.• Proficient in using Palantir Workshop and Carbon Workspace to streamline dealer workflow application development, ensuring user-friendly and efficient applications.• I have a strong background in statistical methods such as hypothesis testing, PCA, ANOVA and time-series analysis and I particularly thrive at data migration to the Snowflake cloud data warehouse. Furthermore, resource monitoring, RBAC restrictions and query performance tuning are among the many Snowflake principles in which I excel. My proficiency lies in preprocessing data using Python Data Science Packages, and I have worked with deep learning platforms such as TensorFlow, Keras, AWS ML, and Azure ML Studio, as well as text mining and natural language processing.• I have experience creating applications with machine learning and natural language processing that are ready for production. My experience includes real-time analysis with Kafka and Spark Streaming, high-performance computing using Spark/Hadoop, and data modeling for Data Mart/Data Warehouse development.• I have experience creating DBT models, fine-tuning models with Grid Search, and applying NLP strategies and toolkits. I also understand deep learning, convolutional neural networks, and artificial neural networks quite well. Statistical techniques, Scala application programming, RDBMS and NoSQL database management, and tool-based data visualization are among my other proficiencies.• I have expertise working in Agile environments and am a skilled user of version control and project management software.

Kaushik A's Current Company Details
Johnson & Johnson

Johnson & Johnson

View
Sr. Data Engineer
United States
Website:
jnj.com
Employees:
100890
Kaushik A Work Experience Details
  • Johnson & Johnson
    Sr. Data Engineer
    Johnson & Johnson
    United States
  • Wellcare Medicare
    Sr. Azure Data Engineer / Sr. Data Engineer
    Wellcare Medicare Mar 2023 - Present
    Tampa, Florida, United States
    • Expertise in building and optimizing end-to-end data solutions within Palantir Foundry, from data ingestion to analysis and visualization.• Developed end-to-end solutions for managing IoT data streams in manufacturing settings, ensuring high-quality data capture, processing, and analytics.• Integrated data from diverse IoT devices, sensors, and external systems into centralized databases, ensuring smooth data flow and accurate insights for decision-making.• Implemented best practices for securing IoT data and ensuring compliance with data privacy regulations using AWS tools and other security frameworks.• Experience in building multiple Data pipelines, end to end ETL and ELT process for Data ingestion and transformation in GCP and coordinate task among the team.• Develop and schedule ETL workflows using Azure Data Factory (ADF) and Spark-SQL in Azure Databricks.• Work with similar Microsoft on-prem data platforms, specifically SQL Server and related technologies such as SSIS, SSRS, and SSAS.• Experience in developing custom tools and applications within Foundry to meet specific business requirements, enhancing user engagement and workflow efficiency.• Designed Star and Snowflake Data Models for Enterprise Data Warehouse using ER Studio.• Skilled in using Big Data tools and technologies such as BigQuery, Databricks, Snowflake, Spark, and Kafka Streams for efficient data processing.• Experienced in building data pipelines for analyzing structured and unstructured data using tools such as HDFS, HIVE, HBase, Pig, Spark, Kafka, Scala, Control-M & StreamSets ETL.• Familiar with Palantir Foundry and various data warehouses, including SQL Azure and Confidential Redshift/RDS.
  • U.S. Bank
    Snowflake Data Engineer
    U.S. Bank Oct 2021 - Feb 2023
    Minneapolis, Minnesota, United States
    • Maintained proficiency in Oracle/PL-SQL and Informatica, ensuring data processing excellence.• Adapted to reporting tools such as OBIEE and Power BI, delivering actionable insights.• Experience in building Power BI reports on Azure Synapse Analysis services for better performance.• Performed data quality issue analysis using Snow SQL by building analytical warehouses on Snowflake.• Managed user access and metadata changes as a Snowflake DBA, ensuring security and integrity of the platform.• Demonstrated expertise in Google Cloud Platform (GCP, including Big Query, Compute Engine, and Google Cloud Storage (GCS).• Built data workflows with ETL, SSIS, BI, DW, AWS EMR, Data Lake Formation, Redshift, Hadoop, Spark, Spark, SQL, Scala, and Python.• Developed Project Specific Java API's for the new requirements with Effective usage of Data Structures, Algorithms and C Java, JMS. OOPS concepts.• Worked on ETL tool Informatica, Oracle Database and PL/SQL, Python, and Shell Scripts.• Developed and implemented Apache NIFI across various environments, written QA scripts in Python.
  • Ebay
    Azure Data Engineer
    Ebay Apr 2019 - Sep 2021
    San Jose, California, United States
    • Developed SQOOP queries to export HIVE tables to Azure SQL database scheduled using CRON tab.• Used Sqoop to load data for creation of RDD’s, Datasets and Data frames in Spark SQL.• Used Qlik Replicate to manage the Change Data Capture (CDC) and automate the data loading into HDFS.• Worked on end-to-end machine learning workflow, written python code for gathering the data from Azure Synapse, snowflake, data preprocessing, feature extraction, feature engineering, modeling, evaluating the model, deployment.• Cloud computing implementation experience using HDInsight, Azure Data Lake (COSMOS), Azure Data Factory, Azure Machine Learning & PowerShell scripting.• Documented logical, physical, relational and dimensional data models. We signed the Data Marts in dimensional Data Modelling using star and snowflake schemas.• Created pipelines, data flows and complex data transformations and manipulations using ADF and PySpark with Databricks.• Skilled in using collections in Python for manipulating and looping through different user defined objects.• Designed, developed extensive additions to existing Struts, Java, J2EE Web Application utilizing Service Oriented Architecture (SOA) techniques.• Design and develop data solutions in Azure Data Lake, Data Factory, and SQL Database environment.
  • Kpmg Us
    Data Engineer
    Kpmg Us Nov 2016 - Mar 2019
    Montvale, New Jersey, United States
    • Developed a 16-node cluster in designing the Data Lake with the Cloudera Distribution. • Implemented and configured High Availability Hadoop Cluster.• Developed Hive scripts to analyze data and PHI are categorized into different segments and promotions are offered to customer based on segments.• Developed UDFs in Java as and when necessary to use in and HIVE queries.• Experience with Cosmos DB migration from other databases such as SQL Server or MongoDB.• Utilized Azure SQL as external hive Meta store for HDInsight clusters so that metadata is persisted across multiple clusters.• Developed JSON Scripts for deploying the Pipeline in Azure Data Factory (ADF) that processes the data using the SQL Activity.• Developed ETL pipelines to extract data from various sources, including on-premises databases and cloud-based data sources, and loaded them into Azure for analysis and visualization in Tableau.
  • Sonata Software
    Data Analyst
    Sonata Software Dec 2013 - Sep 2016
    Hyderabad, Telangana, India
    • Designed data architecture for one project with cloud computing environment using Amazon Web Services AWS for hosting the databases.• Worked with OLAP cubes to generate drill through reports in SSRS.• Designed and implemented end-to-end data pipelines for large-scale dataset extraction, transformation, and loading utilizing tools like Apache Spark, Apache Kafka, and Apache Airflow.• Implemented data ingestion pipelines to ingest structured and unstructured data from various sources into Hadoop, using technologies such as Sqoop or Flume, ensuring data availability for downstream processing.• Developed and optimized complex SQL queries and python scripts to extract, transform, and load data from multiple sources into a centralized data warehouse using tools like SQL Server, MySQL, or PostgreSQL.• Experience in integrating Informatica with various databases and data warehouses, such as Oracle, SQL Server, Teradata and Hadoop and Proficient in writing complex SQL queries for optimal data storage and retrieval.• Designed and implemented automated ETL pipelines using AWS services like AWS Glue and AWS Lambda and migrated existing Informatica ETL processes into AWS cloud platform.

Kaushik A Education Details

Frequently Asked Questions about Kaushik A

What company does Kaushik A work for?

Kaushik A works for Johnson & Johnson

What is Kaushik A's role at the current company?

Kaushik A's current role is Sr. Data Engineer.

What schools did Kaushik A attend?

Kaushik A attended Osmania University, Hyderabad.

Who are Kaushik A's colleagues?

Kaushik A's colleagues are Ben Deacon Mrics, Leslie Fox, Cornell Hamilton, Mac Lai, Arabinda Hazra, İsa Dere, Hilary Profrock.

Not the Kaushik A you were looking for?

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.