Weiwei Wu

Weiwei Wu Email and Phone Number

Data Engineer, Architect @ Segway | Java, Scala, SQL, Apache Spark, Kafka, Hive, Hadoop, Data Warehouse, ETL, OLAP, BI
Weiwei Wu's Location
Malta, Malta
About Weiwei Wu

Highly skilled Data Engineer with 7+ years of hands-on experience in data engineering, data warehousing, data modeling, PB-level data processing, ETL, OLAP, and BI. Proficient in Java/Scala, Shell, and SQL. Skilled in Apache Spark, Kafka, Hive, Hadoop, HBase, and Doris.Seeking a Data Engineer position to contribute to data-driven growth.Welcome to my portfolio: https://barneywill.github.io/portfolio

Weiwei Wu's Current Company Details

Data Engineer, Architect @ Segway | Java, Scala, SQL, Apache Spark, Kafka, Hive, Hadoop, Data Warehouse, ETL, OLAP, BI
Weiwei Wu Work Experience Details
  • Segway
    Data Architect
    Segway Apr 2021 - Dec 2022
    Beijing, China
    Segway is an international hi-tech company with a mission to ‘Simplify Moving’. Skills: Java, Scala, SQL, Spark, Kafka, Hive, Hadoop, Apache Doris, FineBI, ELK, AWS, Internet of Vehicles(IoV).Highlights:1. Built a data warehouse with 1,100+ tables in 6 tiers: STG/ODS/DWD/DIM/DWS/ADS, occupying 600+ TB of storage, including daily data pipelines consisting of 1,300+ tasks (Hive tasks: 56%, ETL tasks: 29%, Shell tasks: 12%, Spark tasks: 3%), supporting 100+ BI dashboards for… Show more Segway is an international hi-tech company with a mission to ‘Simplify Moving’. Skills: Java, Scala, SQL, Spark, Kafka, Hive, Hadoop, Apache Doris, FineBI, ELK, AWS, Internet of Vehicles(IoV).Highlights:1. Built a data warehouse with 1,100+ tables in 6 tiers: STG/ODS/DWD/DIM/DWS/ADS, occupying 600+ TB of storage, including daily data pipelines consisting of 1,300+ tasks (Hive tasks: 56%, ETL tasks: 29%, Shell tasks: 12%, Spark tasks: 3%), supporting 100+ BI dashboards for data-driven decision-making.2. Achieved significant cost savings, €100,000 per year, by optimizing and tuning Hive/Hadoop/HBase/Kafka/ELK to free up 300+ CPU cores, 800+ GB memory, and 400+ TB disk.3. Improved customer experience by introducing an intelligent push notification service based on a 200-tag customer profiling system, and engineering a model to make accurate predictions like remaining mileage and charge time predictions.4. Developed incorrect IoV data monitoring, big tables fully synchronization optimization, and customer de-duplication.5. Recruited 6 data engineers and 1 data analyst to enlarge the big data team. Show less
  • Mzdata (Startup)
    Data Architect
    Mzdata (Startup) Apr 2019 - Nov 2020
    Beijing, China
    MZDATA is a start-up company that translated big data and machine learning into rapid growth for traditional retail chains.Skills: Java, Scala, SQL, Spark, Kafka, Hive, Impala, Kudu, CDH, XGBoost, Retailing.Impacts: Won 3 retail chains as clients, securing the first 2 rounds of funding.Highlights:1. Built a data warehouse with 100+ tables, combining batch and real-time data retrieval by Impala+Kudu+Hive+Parquet.2. Launched a specialized BI product for the retail… Show more MZDATA is a start-up company that translated big data and machine learning into rapid growth for traditional retail chains.Skills: Java, Scala, SQL, Spark, Kafka, Hive, Impala, Kudu, CDH, XGBoost, Retailing.Impacts: Won 3 retail chains as clients, securing the first 2 rounds of funding.Highlights:1. Built a data warehouse with 100+ tables, combining batch and real-time data retrieval by Impala+Kudu+Hive+Parquet.2. Launched a specialized BI product for the retail scenario, focusing on customer, product, and store analysis, with the ability to quickly integrate various data sources from different retailers in 1-2 days.3. Introduced a sales forecasting model to automate business processes by replacing human experience-based decisions with data-driven decisions, significantly reducing raw material waste and store manager recruitment costs through reduced workload and experience requirements. Show less
  • Jd.Com
    Data Engineer
    Jd.Com May 2017 - Apr 2019
    Beijing, China
    JD (NASDAQ: JD) is China's largest B2C E-Commerce company.Skills: Scala, SQL, Spark, Hive, Hadoop, HBase, DMP, User Profiling, Audience Targeting, Marketing.Highlights:1. Implemented an audience targeting system based on large-scale data: 1 billion members with trillions of behaviors, which was very challenging and required much effort to optimize performance, from minutes to seconds.2. Upgraded Hive from version 0.12 to 2.1, and migrated 10% of Hive tasks to Spark, on an… Show more JD (NASDAQ: JD) is China's largest B2C E-Commerce company.Skills: Scala, SQL, Spark, Hive, Hadoop, HBase, DMP, User Profiling, Audience Targeting, Marketing.Highlights:1. Implemented an audience targeting system based on large-scale data: 1 billion members with trillions of behaviors, which was very challenging and required much effort to optimize performance, from minutes to seconds.2. Upgraded Hive from version 0.12 to 2.1, and migrated 10% of Hive tasks to Spark, on an internal platform called Cloud Ocean with thousands of daily Hive tasks.3. Developed a range of data products including length of customer stay analysis, customer location analysis, store site selection(geohash), and advertising impact assessment. Show less
  • Netease
    Senior Java Engineer
    Netease May 2011 - Mar 2017
    Beijing, China
    NetEase (NASDAQ: NTES) is one of the largest internet companies in China.Skills: Java, SQL, Spark, Kafka, Zookeeper, Spring, Mybatis, Redis, MySQL, Nginx, Tomcat, Web Development, E-Commerce.Highlights:1. Developed a BI product and a data warehouse for event tracking analysis, funnel analysis, and A-B testing, resulting in 2X acceleration of product iteration.2. Implemented real-time analytics on Nginx access logs using Spark Streaming + Kafka.3. Led a team of 10 (2… Show more NetEase (NASDAQ: NTES) is one of the largest internet companies in China.Skills: Java, SQL, Spark, Kafka, Zookeeper, Spring, Mybatis, Redis, MySQL, Nginx, Tomcat, Web Development, E-Commerce.Highlights:1. Developed a BI product and a data warehouse for event tracking analysis, funnel analysis, and A-B testing, resulting in 2X acceleration of product iteration.2. Implemented real-time analytics on Nginx access logs using Spark Streaming + Kafka.3. Led a team of 10 (2 frontend devs, 2 app devs, 6 backend devs) to develop 2 e-commerce products: NetEase Mall for online shopping, and NetEase Movie Ticket for online booking. Show less

Weiwei Wu Education Details

Frequently Asked Questions about Weiwei Wu

What is Weiwei Wu's role at the current company?

Weiwei Wu's current role is Data Engineer, Architect @ Segway | Java, Scala, SQL, Apache Spark, Kafka, Hive, Hadoop, Data Warehouse, ETL, OLAP, BI.

What schools did Weiwei Wu attend?

Weiwei Wu attended Beihang University.

Not the Weiwei Wu you were looking for?

  • weiwei wu

    Strategy @ Panw | Ex-Bain | Phd In Chemistry
    Stanford, Ca
    2
    bain.com, stanford.edu

    1 (650) 7XXXXXXX

  • Weiwei Wu

    San Francisco Bay Area
    3
    fedex.com, mail.missouri.edu, hippo.com
  • Weiwei Wu

    Researcher @ Berkeley Lab | Physics & Operations Research @ Uc Berkeley
    Berkeley, Ca
  • Weiwei Wu

    San Jose, Ca
    4
    bakerhughes.com, yahoo.com, 163.com, uh.edu

    2 +171385XXXXX

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.