Kai-Cheng Wu

Kai-Cheng Wu Email and Phone Number

Data Engineer
Kai-Cheng Wu's Location
Milpitas, California, United States, United States
Kai-Cheng Wu's Contact Details

Kai-Cheng Wu work email

Kai-Cheng Wu personal email

About Kai-Cheng Wu

Data Engineering to process user and log data for data analytics. Performance engineering for storage, database, Hadoop, and user applications.Interested in database design and development using Redshift, Databricks, Hive, Oracle, PostgreSQL.Building data pipelines with Python, Airflow, and Spark including KPIs and Enterprise data based on Salesforce.Specialties: Design and implement large databases. Cross team development including offshore and off-site.

Kai-Cheng Wu's Current Company Details

Data Engineer
Kai-Cheng Wu Work Experience Details
  • Workday
    Data Engineer
    Workday Jul 2021 - Mar 2023
    Pleasanton, California, Us
    • Building Airflow data pipelines for ingestion and transformation with Python, EMR, Redshift, and S3 using APIs for Salesforce and Gainsight (Customer Success) data sources. Creating KPI semantic views for Tableau to use • Create dimensional modeling for upstream tables.• Research, propose, and implement conversion to Databricks from Redshift based on Delta Tables.
  • Insight Global
    Data Engineer
    Insight Global Nov 2020 - Jul 2021
    Atlanta, Georgia, Us
    Workforce Intelligence Recommendation: (Amazon) 11/2020 – 7/2021• Building pipelines for Amazon internal candidate and site location data, and external labor force data using Redshift, Glue, SageMaker, and Airflow. • Generating nearby site pursuits based on commute time regions using MapBox and ArcGIS isocrhone REST APIs for SageMaker to build ML models.• Publishing QuickSight with labor potential, competitor insight, and candidate intelligence based on SageMaker ML models and external Census data.
  • Contractor
    Data Engineer
    Contractor Feb 2020 - Nov 2020
    • Designed and built data pipelines for a headcount analytics platform. Combining headcount, recruitment and hierarchical data to post to Essbase from Teradata.• Airflow integration for the hardware activation data when triggering from email events to support Tableau and Tableau Prep down streams.
  • K2 Partnering Solutions @ Fb
    Data Engineer (Contractor)
    K2 Partnering Solutions @ Fb Jun 2019 - Jan 2020
    London, England, Gb
    Built Presto data pipelines from MySQL, Oracle, and in-memory data objects. Created KPI metrics backends and dashboards. Reconciled data between Oracle and Presto.
  • Intuit
    Data Engineer (Contractor)
    Intuit Jul 2018 - Jun 2019
    Mountain View, California, Us
    • Enhancing Hive queries to improve the eligibility pool of loan candidates by analyzing historical credit activities. Developing Lambda and DynamoDB codes for AWS migration.• Building a machine learning pipeline to analyze model performance by using Spark Python to extract data from a REST API and load flatten confidence scores into Hive; Developing Scala codes to detect and assign inter account transfer for revenue calculation.• Building metrics Hive backends based on KPI data models for Marketing and Finance Tableau dashboards – monitoring loan performance and revenue. Populate Hive tables with UDF generated by H2O propensity score matching models.
  • Dell Emc
    Data Engineer/Applications Solution Engineer
    Dell Emc Jun 2010 - Jun 2018
    Round Rock, Texas, Us
    Big Data Analytics• Successfully supported and secured a TPCx-BB top rank for the Hadoop benchmark publication for Dell 14G servers and networking in the Scale Factor 10000 category. Tasks included tuning Hive configurations, Spark Machine Learning Library with MKL (Intel Math Kernel Library), validated with Hive explain plans, and performed Hive on Tez characterization. Developed automated performance codes (TPCx-BB, IO characterization, and network throughput) with Python.• Validated and designed on-prem BDaaS (Big Data as a Service) for BlueData. Performance characterized BlueData docker multi-tenancy for Hadoop, Cassandra, and R-Studio. Compared NVIDIA Tesla V100 and Pascal P100 GPU vs CPU for MapD databases and TensorFlow. Developed Python codes to drive tests and collect results.IOT and Analytical Insight Modules• IOT data ingestion and ETL: Parsed and flattened MongoDB JSON files to load into Impala. Involved in development of predictive maintenance model for IOT devices.• Executed scale-out performance for the EMC Analytical Insight Module in the Isilon Data Lake environment.Hadoop and NoSQL• Automated scale-out performance and tuning Hadoop performance for Isilon OneFS/WebHDFS.• Produced competitive studies for Greenplum, Cassandra under different configurations including virtual and bare metal.• Automated and developed python scripts to drive workloads for TPC benchmark, Terasort and Cassandra tests; Generating ETL code scripts and datasets.Virtualization of Greenplum Appliance• Completed Oracle code conversion and data migration to virtualized Greenplum for EMC DWBI platform. Utilized open source and customized Perl scripts to convert Oracle stored packages and DDLs, extract data from Oracle, and load into Greenplum.• Presented at EMC World.Flash Drive/SSD Characterization for Data Warehouse, Hadoop and OLTP• Performance analyzed and characterized for DSS, Analytics, ETL and OLTP workloads.
  • Verisign
    Sr. Software Developer
    Verisign Sep 2009 - Jul 2010
    Reston, Virginia (Va), Us
    Migrated encrypted sensitive token data from RSA to VeriSign platform.
  • The Nielsen Company (Nielsen//Online) - Formerly Nielsen//Netratings
    Database Software Engineer Lead/Database Support
    The Nielsen Company (Nielsen//Online) - Formerly Nielsen//Netratings Jan 2003 - Jun 2009
    New York, Ny, Us
     Data Services, ETL, and Web Mining: Provide internet panelists activity data to the clients and inter-company divisions.- Process data under Netezza and Oracle.- Desing database structure and file layouts.- Extract and create files using Java, sh and SQL.- Transport and notify through TIBCO messaging.  Analytical Platform for Internet Panlist Behavior: Design and develop OLAP solutions under Netezza to supply panelists’ surfing activity data. Leveraging Cognos for additional analytical functions and export features.- Batch process matching url patterns to track internet activity of the panelists.- Data copying from Oracle and other data sources.Validate, develop and design reports for custom requirement.

Kai-Cheng Wu Skills

Databases Data Warehousing Oracle Sql Database Design Java Etl Agile Methodologies Web Services Web Applications Data Analysis Hadoop Linux Postgresql Hive Emr Aws Apache Spark Aws Lambda Amazon Dynamodb Greenplum Netezza Cognos

Kai-Cheng Wu Education Details

  • California State Polytechnic University-Pomona
    California State Polytechnic University-Pomona
    Computer Science

Frequently Asked Questions about Kai-Cheng Wu

What is Kai-Cheng Wu's role at the current company?

Kai-Cheng Wu's current role is Data Engineer.

What is Kai-Cheng Wu's email address?

Kai-Cheng Wu's email address is ka****@****hoo.com

What is Kai-Cheng Wu's direct phone number?

Kai-Cheng Wu's direct phone number is +150849*****

What schools did Kai-Cheng Wu attend?

Kai-Cheng Wu attended California State Polytechnic University-Pomona.

What are some of Kai-Cheng Wu's interests?

Kai-Cheng Wu has interest in Kids, Exercise, Electronics, Home Improvement, Reading, Home Decoration.

What skills is Kai-Cheng Wu known for?

Kai-Cheng Wu has skills like Databases, Data Warehousing, Oracle, Sql, Database Design, Java, Etl, Agile Methodologies, Web Services, Web Applications, Data Analysis, Hadoop.

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.