Kai-Cheng Wu work email
- Valid
Kai-Cheng Wu personal email
- Valid
Kai-Cheng Wu phone numbers
Data Engineering to process user and log data for data analytics. Performance engineering for storage, database, Hadoop, and user applications.Interested in database design and development using Redshift, Databricks, Hive, Oracle, PostgreSQL.Building data pipelines with Python, Airflow, and Spark including KPIs and Enterprise data based on Salesforce.Specialties: Design and implement large databases. Cross team development including offshore and off-site.
-
Data EngineerWorkday Jul 2021 - Mar 2023Pleasanton, California, Us• Building Airflow data pipelines for ingestion and transformation with Python, EMR, Redshift, and S3 using APIs for Salesforce and Gainsight (Customer Success) data sources. Creating KPI semantic views for Tableau to use • Create dimensional modeling for upstream tables.• Research, propose, and implement conversion to Databricks from Redshift based on Delta Tables. -
Data EngineerInsight Global Nov 2020 - Jul 2021Atlanta, Georgia, UsWorkforce Intelligence Recommendation: (Amazon) 11/2020 – 7/2021• Building pipelines for Amazon internal candidate and site location data, and external labor force data using Redshift, Glue, SageMaker, and Airflow. • Generating nearby site pursuits based on commute time regions using MapBox and ArcGIS isocrhone REST APIs for SageMaker to build ML models.• Publishing QuickSight with labor potential, competitor insight, and candidate intelligence based on SageMaker ML models and external Census data. -
Data EngineerContractor Feb 2020 - Nov 2020• Designed and built data pipelines for a headcount analytics platform. Combining headcount, recruitment and hierarchical data to post to Essbase from Teradata.• Airflow integration for the hardware activation data when triggering from email events to support Tableau and Tableau Prep down streams.
-
Data Engineer (Contractor)K2 Partnering Solutions @ Fb Jun 2019 - Jan 2020London, England, GbBuilt Presto data pipelines from MySQL, Oracle, and in-memory data objects. Created KPI metrics backends and dashboards. Reconciled data between Oracle and Presto. -
Data Engineer (Contractor)Intuit Jul 2018 - Jun 2019Mountain View, California, Us• Enhancing Hive queries to improve the eligibility pool of loan candidates by analyzing historical credit activities. Developing Lambda and DynamoDB codes for AWS migration.• Building a machine learning pipeline to analyze model performance by using Spark Python to extract data from a REST API and load flatten confidence scores into Hive; Developing Scala codes to detect and assign inter account transfer for revenue calculation.• Building metrics Hive backends based on KPI data models for Marketing and Finance Tableau dashboards – monitoring loan performance and revenue. Populate Hive tables with UDF generated by H2O propensity score matching models. -
Data Engineer/Applications Solution EngineerDell Emc Jun 2010 - Jun 2018Round Rock, Texas, UsBig Data Analytics• Successfully supported and secured a TPCx-BB top rank for the Hadoop benchmark publication for Dell 14G servers and networking in the Scale Factor 10000 category. Tasks included tuning Hive configurations, Spark Machine Learning Library with MKL (Intel Math Kernel Library), validated with Hive explain plans, and performed Hive on Tez characterization. Developed automated performance codes (TPCx-BB, IO characterization, and network throughput) with Python.• Validated and designed on-prem BDaaS (Big Data as a Service) for BlueData. Performance characterized BlueData docker multi-tenancy for Hadoop, Cassandra, and R-Studio. Compared NVIDIA Tesla V100 and Pascal P100 GPU vs CPU for MapD databases and TensorFlow. Developed Python codes to drive tests and collect results.IOT and Analytical Insight Modules• IOT data ingestion and ETL: Parsed and flattened MongoDB JSON files to load into Impala. Involved in development of predictive maintenance model for IOT devices.• Executed scale-out performance for the EMC Analytical Insight Module in the Isilon Data Lake environment.Hadoop and NoSQL• Automated scale-out performance and tuning Hadoop performance for Isilon OneFS/WebHDFS.• Produced competitive studies for Greenplum, Cassandra under different configurations including virtual and bare metal.• Automated and developed python scripts to drive workloads for TPC benchmark, Terasort and Cassandra tests; Generating ETL code scripts and datasets.Virtualization of Greenplum Appliance• Completed Oracle code conversion and data migration to virtualized Greenplum for EMC DWBI platform. Utilized open source and customized Perl scripts to convert Oracle stored packages and DDLs, extract data from Oracle, and load into Greenplum.• Presented at EMC World.Flash Drive/SSD Characterization for Data Warehouse, Hadoop and OLTP• Performance analyzed and characterized for DSS, Analytics, ETL and OLTP workloads. -
Sr. Software DeveloperVerisign Sep 2009 - Jul 2010Reston, Virginia (Va), UsMigrated encrypted sensitive token data from RSA to VeriSign platform. -
Database Software Engineer Lead/Database SupportThe Nielsen Company (Nielsen//Online) - Formerly Nielsen//Netratings Jan 2003 - Jun 2009New York, Ny, Us Data Services, ETL, and Web Mining: Provide internet panelists activity data to the clients and inter-company divisions.- Process data under Netezza and Oracle.- Desing database structure and file layouts.- Extract and create files using Java, sh and SQL.- Transport and notify through TIBCO messaging. Analytical Platform for Internet Panlist Behavior: Design and develop OLAP solutions under Netezza to supply panelists’ surfing activity data. Leveraging Cognos for additional analytical functions and export features.- Batch process matching url patterns to track internet activity of the panelists.- Data copying from Oracle and other data sources.Validate, develop and design reports for custom requirement.
Kai-Cheng Wu Skills
Kai-Cheng Wu Education Details
-
California State Polytechnic University-PomonaComputer Science
Frequently Asked Questions about Kai-Cheng Wu
What is Kai-Cheng Wu's role at the current company?
Kai-Cheng Wu's current role is Data Engineer.
What is Kai-Cheng Wu's email address?
Kai-Cheng Wu's email address is ka****@****hoo.com
What is Kai-Cheng Wu's direct phone number?
Kai-Cheng Wu's direct phone number is +150849*****
What schools did Kai-Cheng Wu attend?
Kai-Cheng Wu attended California State Polytechnic University-Pomona.
What are some of Kai-Cheng Wu's interests?
Kai-Cheng Wu has interest in Kids, Exercise, Electronics, Home Improvement, Reading, Home Decoration.
What skills is Kai-Cheng Wu known for?
Kai-Cheng Wu has skills like Databases, Data Warehousing, Oracle, Sql, Database Design, Java, Etl, Agile Methodologies, Web Services, Web Applications, Data Analysis, Hadoop.
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial