• Results-driven Data Engineer with 10 years of experience in leveraging Palantir Foundry to design, develop, and deploy innovative applications that enhance operational efficiency. Adept at utilizing tools such as Palantir Workshop and Carbon Workspace to support dealer workflow application development.• My expertise in data analysis, modeling, and data model implementation for enterprise-level applications has been honed over the course of my more than ten years as a data engineer, analyst, and SQL developer. Data acquisition, preparation, modeling, and deployment are all areas of competence, I cover throughout the whole Data Science lifecycle.• Proficient in using Palantir Workshop and Carbon Workspace to streamline dealer workflow application development, ensuring user-friendly and efficient applications.• I have a strong background in statistical methods such as hypothesis testing, PCA, ANOVA and time-series analysis and I particularly thrive at data migration to the Snowflake cloud data warehouse. Furthermore, resource monitoring, RBAC restrictions and query performance tuning are among the many Snowflake principles in which I excel. My proficiency lies in preprocessing data using Python Data Science Packages, and I have worked with deep learning platforms such as TensorFlow, Keras, AWS ML, and Azure ML Studio, as well as text mining and natural language processing.• I have experience creating applications with machine learning and natural language processing that are ready for production. My experience includes real-time analysis with Kafka and Spark Streaming, high-performance computing using Spark/Hadoop, and data modeling for Data Mart/Data Warehouse development.• I have experience creating DBT models, fine-tuning models with Grid Search, and applying NLP strategies and toolkits. I also understand deep learning, convolutional neural networks, and artificial neural networks quite well. Statistical techniques, Scala application programming, RDBMS and NoSQL database management, and tool-based data visualization are among my other proficiencies.• I have expertise working in Agile environments and am a skilled user of version control and project management software.
-
Sr. Data EngineerJohnson & JohnsonUnited States -
Sr. Azure Data Engineer / Sr. Data EngineerWellcare Medicare Mar 2023 - PresentTampa, Florida, United States• Expertise in building and optimizing end-to-end data solutions within Palantir Foundry, from data ingestion to analysis and visualization.• Developed end-to-end solutions for managing IoT data streams in manufacturing settings, ensuring high-quality data capture, processing, and analytics.• Integrated data from diverse IoT devices, sensors, and external systems into centralized databases, ensuring smooth data flow and accurate insights for decision-making.• Implemented best practices for securing IoT data and ensuring compliance with data privacy regulations using AWS tools and other security frameworks.• Experience in building multiple Data pipelines, end to end ETL and ELT process for Data ingestion and transformation in GCP and coordinate task among the team.• Develop and schedule ETL workflows using Azure Data Factory (ADF) and Spark-SQL in Azure Databricks.• Work with similar Microsoft on-prem data platforms, specifically SQL Server and related technologies such as SSIS, SSRS, and SSAS.• Experience in developing custom tools and applications within Foundry to meet specific business requirements, enhancing user engagement and workflow efficiency.• Designed Star and Snowflake Data Models for Enterprise Data Warehouse using ER Studio.• Skilled in using Big Data tools and technologies such as BigQuery, Databricks, Snowflake, Spark, and Kafka Streams for efficient data processing.• Experienced in building data pipelines for analyzing structured and unstructured data using tools such as HDFS, HIVE, HBase, Pig, Spark, Kafka, Scala, Control-M & StreamSets ETL.• Familiar with Palantir Foundry and various data warehouses, including SQL Azure and Confidential Redshift/RDS. -
Snowflake Data EngineerU.S. Bank Oct 2021 - Feb 2023Minneapolis, Minnesota, United States• Maintained proficiency in Oracle/PL-SQL and Informatica, ensuring data processing excellence.• Adapted to reporting tools such as OBIEE and Power BI, delivering actionable insights.• Experience in building Power BI reports on Azure Synapse Analysis services for better performance.• Performed data quality issue analysis using Snow SQL by building analytical warehouses on Snowflake.• Managed user access and metadata changes as a Snowflake DBA, ensuring security and integrity of the platform.• Demonstrated expertise in Google Cloud Platform (GCP, including Big Query, Compute Engine, and Google Cloud Storage (GCS).• Built data workflows with ETL, SSIS, BI, DW, AWS EMR, Data Lake Formation, Redshift, Hadoop, Spark, Spark, SQL, Scala, and Python.• Developed Project Specific Java API's for the new requirements with Effective usage of Data Structures, Algorithms and C Java, JMS. OOPS concepts.• Worked on ETL tool Informatica, Oracle Database and PL/SQL, Python, and Shell Scripts.• Developed and implemented Apache NIFI across various environments, written QA scripts in Python. -
Azure Data EngineerEbay Apr 2019 - Sep 2021San Jose, California, United States• Developed SQOOP queries to export HIVE tables to Azure SQL database scheduled using CRON tab.• Used Sqoop to load data for creation of RDD’s, Datasets and Data frames in Spark SQL.• Used Qlik Replicate to manage the Change Data Capture (CDC) and automate the data loading into HDFS.• Worked on end-to-end machine learning workflow, written python code for gathering the data from Azure Synapse, snowflake, data preprocessing, feature extraction, feature engineering, modeling, evaluating the model, deployment.• Cloud computing implementation experience using HDInsight, Azure Data Lake (COSMOS), Azure Data Factory, Azure Machine Learning & PowerShell scripting.• Documented logical, physical, relational and dimensional data models. We signed the Data Marts in dimensional Data Modelling using star and snowflake schemas.• Created pipelines, data flows and complex data transformations and manipulations using ADF and PySpark with Databricks.• Skilled in using collections in Python for manipulating and looping through different user defined objects.• Designed, developed extensive additions to existing Struts, Java, J2EE Web Application utilizing Service Oriented Architecture (SOA) techniques.• Design and develop data solutions in Azure Data Lake, Data Factory, and SQL Database environment. -
Data EngineerKpmg Us Nov 2016 - Mar 2019Montvale, New Jersey, United States• Developed a 16-node cluster in designing the Data Lake with the Cloudera Distribution. • Implemented and configured High Availability Hadoop Cluster.• Developed Hive scripts to analyze data and PHI are categorized into different segments and promotions are offered to customer based on segments.• Developed UDFs in Java as and when necessary to use in and HIVE queries.• Experience with Cosmos DB migration from other databases such as SQL Server or MongoDB.• Utilized Azure SQL as external hive Meta store for HDInsight clusters so that metadata is persisted across multiple clusters.• Developed JSON Scripts for deploying the Pipeline in Azure Data Factory (ADF) that processes the data using the SQL Activity.• Developed ETL pipelines to extract data from various sources, including on-premises databases and cloud-based data sources, and loaded them into Azure for analysis and visualization in Tableau. -
Data AnalystSonata Software Dec 2013 - Sep 2016Hyderabad, Telangana, India• Designed data architecture for one project with cloud computing environment using Amazon Web Services AWS for hosting the databases.• Worked with OLAP cubes to generate drill through reports in SSRS.• Designed and implemented end-to-end data pipelines for large-scale dataset extraction, transformation, and loading utilizing tools like Apache Spark, Apache Kafka, and Apache Airflow.• Implemented data ingestion pipelines to ingest structured and unstructured data from various sources into Hadoop, using technologies such as Sqoop or Flume, ensuring data availability for downstream processing.• Developed and optimized complex SQL queries and python scripts to extract, transform, and load data from multiple sources into a centralized data warehouse using tools like SQL Server, MySQL, or PostgreSQL.• Experience in integrating Informatica with various databases and data warehouses, such as Oracle, SQL Server, Teradata and Hadoop and Proficient in writing complex SQL queries for optimal data storage and retrieval.• Designed and implemented automated ETL pipelines using AWS services like AWS Glue and AWS Lambda and migrated existing Informatica ETL processes into AWS cloud platform.
Kaushik A Education Details
-
Computer Science
Frequently Asked Questions about Kaushik A
What company does Kaushik A work for?
Kaushik A works for Johnson & Johnson
What is Kaushik A's role at the current company?
Kaushik A's current role is Sr. Data Engineer.
What schools did Kaushik A attend?
Kaushik A attended Osmania University, Hyderabad.
Who are Kaushik A's colleagues?
Kaushik A's colleagues are Ben Deacon Mrics, Leslie Fox, Cornell Hamilton, Mac Lai, Arabinda Hazra, İsa Dere, Hilary Profrock.
Not the Kaushik A you were looking for?
-
Kaushik Raam A G
United States2powerint.com, colorado.edu -
Brinda Kaushik B A
Master Of Science In Business Analytics @University Of Massachusetts Amherst | Ex-JuspayAmherst, Ma
Free Chrome Extension
Find emails, phones & company data instantly
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial