John Cao

John Cao Email and Phone Number

Senior Data Engineer @ The Walt Disney Company
New York, NY, US
John Cao's Location
Reno, Nevada, United States, United States
John Cao's Contact Details

John Cao work email

About John Cao

An Experienced Data Engineer with a background in Data Science focused on resilient ETL designs, data modeling, and distributed systems.SkillsProgramming Languages: SQL, Python, Go, Scala (basic), JavaScript (basic), Rust (learning), HTML, CSSData Visualization: Tableau, Dash Plotly, Apache Superset, GrafanaDatabase and Storage: MySQL, MSSQL, AWS S3, Clickhouse, Vertica, Trino, InfluxDB, Azure CosmosDB, Azure Blob Storage, Google BigQuery, Neo4j, SFTP, Kafka, Delta LakeBig Data: Apache Spark, Databricks, Hadoop, Kubernetes, AWS EMRDevOps: Git, Azure DevOps, shell scripting, Regex, Jira, Confluence, Docker, Terraform, JFrog Artifactory DataOps: Apache Airflow, AWS Glue, Datahub, dbt core, SQLMesh

John Cao's Current Company Details
The Walt Disney Company

The Walt Disney Company

View
Senior Data Engineer
New York, NY, US
John Cao Work Experience Details
  • The Walt Disney Company
    Senior Data Engineer
    The Walt Disney Company
    New York, Ny, Us
  • Tesla
    Data Engineer
    Tesla Apr 2023 - Jun 2024
    Austin, Texas, Us
    - Orchestrated over 80 data pipelines using technologies like Airflow, Kubernetes, JFrog, Python, SQL, Docker, and Git to design and implement near real-time and batch data pipelines supporting manufacturing quality.- Pioneered the use of distributed processing techniques within the team, exemplified by a use case analyzing early-life failures of powertrain parts, processing upwards of 120GB of data weekly. Leveraged technologies such as Hadoop YARN, Hadoop Hive, Apache Spark, AWS EMR, and TrinoDB. Currently team expert in Spark.- Enhanced team's DevOps practices by implementing unit testing, data validation, DAG migration tool, common data utilities, secrets rotation, and data monitoring. Resulting in a 33% week-over-week increase in overall data pipeline uptime and reliability achieving SLA level of 99.9% availability.
  • Kpmg Us
    Data Engineer
    Kpmg Us 2019 - Apr 2023
    New York, Ny, Us
    - Built an ETL pipeline from scratch using databricks to call over 40 data tables averaging over 20 million records to assist the firm in building a big data distributed system. - Assisted clients in delivering mission-critical analyses of national security clearance application errors to over 200 agencies using advanced text mining and natural language process techniques presented in Tableau dashboards- Leveraged Azure Cosmos DB, Azure Blob Storage, and python for scripting to build an ETL pipeline that extracted and preprocessed over 60,000 public comments and file attachments for regulation proposals in minutes
  • Kpmg Us
    Data Analyst, Cognitive Automation Lab
    Kpmg Us 2016 - 2018
    New York, Ny, Us
    - Defined and tuned a type system with 60+ entities and 1,000+ dictionary entries in Watson Knowledge Studio to inform a named entity recognition and information extraction model utilized in a firm-wide recommender solution- Assisted the Data Scientist team in analyzing over 1,000 courses by using topic modeling and sentiment analysis on course feedback to discover unique findings such as food and environment led students to rate courses higher- Created a valuation model identifying the benefits of automation for a software contract compliance service line proposing $9.4 million in total bottom-line savings over 5 years
  • Kpmg Us
    Associate, Technology Enablement Management Consulting
    Kpmg Us 2015 - 2016
    New York, Ny, Us
    -Developed an investment case delivered to the CEO of a healthcare client highlighting a cognitive automation solution for procurement process approximating $9 million in savings to the operation. Assisted in developing the high-level design of the solution architect identifying new resolution steps and KPIs previously unrecognized. -Co-led a project management engagement to deliver cognitive automation to the firm. Worked with Technical Solution Architects and Consultants outside the firm to develop a cognitive automation tool. Led workshops and interviews with lead Partners, Managing Directors, and Managers.

John Cao Skills

Microsoft Excel Teamwork Access Microsoft Office Team Leadership Customer Service Business Strategy Data Analysis Powerpoint Skills Jd Edwards Sap Visio Restaurant Management Customer Relations Excel Oracle Gl Inquiry Quickbooks Proseries Timeslips Dreamweaver Web Design Photoshop Idea Imovie Eforms Seo Google Analytics Machine Learning

John Cao Education Details

  • Uc Irvine
    Uc Irvine
    Kpmg Ai University - Artificial Intelligence Fundamentals
  • Drexel University'S Lebow College Of Business
    Drexel University'S Lebow College Of Business
    And A Minor In Real Estate
  • Open Source Society University
    Open Source Society University
    Computer Science

Frequently Asked Questions about John Cao

What company does John Cao work for?

John Cao works for The Walt Disney Company

What is John Cao's role at the current company?

John Cao's current role is Senior Data Engineer.

What is John Cao's email address?

John Cao's email address is jo****@****pmg.com

What schools did John Cao attend?

John Cao attended Uc Irvine, Drexel University's Lebow College Of Business, Open Source Society University.

What skills is John Cao known for?

John Cao has skills like Microsoft Excel, Teamwork, Access, Microsoft Office, Team Leadership, Customer Service, Business Strategy, Data Analysis, Powerpoint Skills, Jd Edwards, Sap, Visio.

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.