Highly skilled Data Engineer with 7+ years of hands-on experience in data engineering, data warehousing, data modeling, PB-level data processing, ETL, OLAP, and BI. Proficient in Java/Scala, Shell, and SQL. Skilled in Apache Spark, Kafka, Hive, Hadoop, HBase, and Doris.Seeking a Data Engineer position to contribute to data-driven growth.Welcome to my portfolio: https://barneywill.github.io/portfolio
-
Data ArchitectSegway Apr 2021 - Dec 2022Beijing, ChinaSegway is an international hi-tech company with a mission to ‘Simplify Moving’. Skills: Java, Scala, SQL, Spark, Kafka, Hive, Hadoop, Apache Doris, FineBI, ELK, AWS, Internet of Vehicles(IoV).Highlights:1. Built a data warehouse with 1,100+ tables in 6 tiers: STG/ODS/DWD/DIM/DWS/ADS, occupying 600+ TB of storage, including daily data pipelines consisting of 1,300+ tasks (Hive tasks: 56%, ETL tasks: 29%, Shell tasks: 12%, Spark tasks: 3%), supporting 100+ BI dashboards for… Show more Segway is an international hi-tech company with a mission to ‘Simplify Moving’. Skills: Java, Scala, SQL, Spark, Kafka, Hive, Hadoop, Apache Doris, FineBI, ELK, AWS, Internet of Vehicles(IoV).Highlights:1. Built a data warehouse with 1,100+ tables in 6 tiers: STG/ODS/DWD/DIM/DWS/ADS, occupying 600+ TB of storage, including daily data pipelines consisting of 1,300+ tasks (Hive tasks: 56%, ETL tasks: 29%, Shell tasks: 12%, Spark tasks: 3%), supporting 100+ BI dashboards for data-driven decision-making.2. Achieved significant cost savings, €100,000 per year, by optimizing and tuning Hive/Hadoop/HBase/Kafka/ELK to free up 300+ CPU cores, 800+ GB memory, and 400+ TB disk.3. Improved customer experience by introducing an intelligent push notification service based on a 200-tag customer profiling system, and engineering a model to make accurate predictions like remaining mileage and charge time predictions.4. Developed incorrect IoV data monitoring, big tables fully synchronization optimization, and customer de-duplication.5. Recruited 6 data engineers and 1 data analyst to enlarge the big data team. Show less -
Data ArchitectMzdata (Startup) Apr 2019 - Nov 2020Beijing, ChinaMZDATA is a start-up company that translated big data and machine learning into rapid growth for traditional retail chains.Skills: Java, Scala, SQL, Spark, Kafka, Hive, Impala, Kudu, CDH, XGBoost, Retailing.Impacts: Won 3 retail chains as clients, securing the first 2 rounds of funding.Highlights:1. Built a data warehouse with 100+ tables, combining batch and real-time data retrieval by Impala+Kudu+Hive+Parquet.2. Launched a specialized BI product for the retail… Show more MZDATA is a start-up company that translated big data and machine learning into rapid growth for traditional retail chains.Skills: Java, Scala, SQL, Spark, Kafka, Hive, Impala, Kudu, CDH, XGBoost, Retailing.Impacts: Won 3 retail chains as clients, securing the first 2 rounds of funding.Highlights:1. Built a data warehouse with 100+ tables, combining batch and real-time data retrieval by Impala+Kudu+Hive+Parquet.2. Launched a specialized BI product for the retail scenario, focusing on customer, product, and store analysis, with the ability to quickly integrate various data sources from different retailers in 1-2 days.3. Introduced a sales forecasting model to automate business processes by replacing human experience-based decisions with data-driven decisions, significantly reducing raw material waste and store manager recruitment costs through reduced workload and experience requirements. Show less
-
Data EngineerJd.Com May 2017 - Apr 2019Beijing, ChinaJD (NASDAQ: JD) is China's largest B2C E-Commerce company.Skills: Scala, SQL, Spark, Hive, Hadoop, HBase, DMP, User Profiling, Audience Targeting, Marketing.Highlights:1. Implemented an audience targeting system based on large-scale data: 1 billion members with trillions of behaviors, which was very challenging and required much effort to optimize performance, from minutes to seconds.2. Upgraded Hive from version 0.12 to 2.1, and migrated 10% of Hive tasks to Spark, on an… Show more JD (NASDAQ: JD) is China's largest B2C E-Commerce company.Skills: Scala, SQL, Spark, Hive, Hadoop, HBase, DMP, User Profiling, Audience Targeting, Marketing.Highlights:1. Implemented an audience targeting system based on large-scale data: 1 billion members with trillions of behaviors, which was very challenging and required much effort to optimize performance, from minutes to seconds.2. Upgraded Hive from version 0.12 to 2.1, and migrated 10% of Hive tasks to Spark, on an internal platform called Cloud Ocean with thousands of daily Hive tasks.3. Developed a range of data products including length of customer stay analysis, customer location analysis, store site selection(geohash), and advertising impact assessment. Show less -
Senior Java EngineerNetease May 2011 - Mar 2017Beijing, ChinaNetEase (NASDAQ: NTES) is one of the largest internet companies in China.Skills: Java, SQL, Spark, Kafka, Zookeeper, Spring, Mybatis, Redis, MySQL, Nginx, Tomcat, Web Development, E-Commerce.Highlights:1. Developed a BI product and a data warehouse for event tracking analysis, funnel analysis, and A-B testing, resulting in 2X acceleration of product iteration.2. Implemented real-time analytics on Nginx access logs using Spark Streaming + Kafka.3. Led a team of 10 (2… Show more NetEase (NASDAQ: NTES) is one of the largest internet companies in China.Skills: Java, SQL, Spark, Kafka, Zookeeper, Spring, Mybatis, Redis, MySQL, Nginx, Tomcat, Web Development, E-Commerce.Highlights:1. Developed a BI product and a data warehouse for event tracking analysis, funnel analysis, and A-B testing, resulting in 2X acceleration of product iteration.2. Implemented real-time analytics on Nginx access logs using Spark Streaming + Kafka.3. Led a team of 10 (2 frontend devs, 2 app devs, 6 backend devs) to develop 2 e-commerce products: NetEase Mall for online shopping, and NetEase Movie Ticket for online booking. Show less
Weiwei Wu Education Details
-
Mathematics And Applied Mathematics
Frequently Asked Questions about Weiwei Wu
What is Weiwei Wu's role at the current company?
Weiwei Wu's current role is Data Engineer, Architect @ Segway | Java, Scala, SQL, Apache Spark, Kafka, Hive, Hadoop, Data Warehouse, ETL, OLAP, BI.
What schools did Weiwei Wu attend?
Weiwei Wu attended Beihang University.
Not the Weiwei Wu you were looking for?
-
2bain.com, stanford.edu
1 (650) 7XXXXXXX
-
3fedex.com, mail.missouri.edu, hippo.com
-
4bakerhughes.com, yahoo.com, 163.com, uh.edu
2 +171385XXXXX
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial