Ziwei Li work email
- Valid
Ziwei Li personal email
Hardware Accelerator for Machine Learning and AI workloadGPU RTL design and Micro-Architecture design. (TileBuffer; Clipper; Cull; Viewport Transfer; Setup)Experience with Verilog and System Verilog coding. Experience with low power designExperience with area reductionExperience with timing fixExperience with ECOExperience with GPU architectureExperience with the use of Verdi, Lint, Synthesis.Understanding of semiconductor physics.
-
Gpu Design EngineerApple Mar 2023 - PresentCupertino, California, UsFocus on GPU Memory -
Asic Design EngineerGroq Aug 2021 - Feb 2023Mountain View, California, Us• Experienced in designing Hardware Accelerator for Machine Learning/AI workload.• Responsible for Micro-architecture spec and RTL design of fully deterministic Instruction Control Units (ICUs) thathandle instruction parsing, decoding and dispatching in the Tensor Streaming Processor pipeline.• Developed ICUs to support chip level die-to-die communications and multi-chip synchronizations.• Proposed and implemented new ISA to support chip level clock masking, clock period synthesis, chip levelsynchronization and preemptive voltage compensation.• Supported RAS by parity check implementation and fault detection.• Worked closely with Compiler team and innovated a new timing structure enable compiler program, whichtremendously simplify the software deterministic instruction scheduling effort.• Analyzed the complexity of supporting several different numerical formats.• Improved dynamic power by 30% with optimized clock control and clock-domain splitting.• Helped top, cluster and block level DV debugging the compiler program issue using DVE and Verdi.• Collaborated with DV and Compiler team to analyze Emulation result via Palladium.• Worked on Logical Synthesis flow with PD to fix the Congestion issues and Timing violations.• Implemented ICUs Functional Verification Coverage. -
Senior Gpu Rtl Design EngineerSamsung Austin Semiconductior Mar 2020 - Aug 2021• Owned U-arch spec and RTL design of synthesize-friendly and power-efficient Primitive Assembler that handles Clipping, Culling, Set-up process for interpolation, and Viewport Transfer in the GPU pipeline.• Owned U-arch spec and RTL design of Medium Grain Clock Gating via ICG cell, Fine Grain Clock Gating and clock-domain splitting for power reduction to meeting the goal.• Owned U-arch spec and RTL design of throughput doubling .• Reduced the area consumption with the methods of splitting memory, retiming logic removal and simplifying redundant logics.• Refined RTL code and re-organized the pipeline logic to meeting the timing requirements.• Worked with EMULATION team and PD team to help in emulation debug and ECO to minimize the impacted logics after RTL Freeze and Post-Silicon debug and ECO after tape-out.• Worked with FDV, AM, Formal and CSIM team to debug using Verdi, hit the target of functional coverage, resolve timing violation and performance loss.• Worked with other RTL unit design team for interface handshake and inter-unit’s communication.
-
Gpu Rtl DesignerSamsung Austin Semiconductior Aug 2017 - Mar 2020Tile Buffer Micro-Arch design/Low power design. • Owned U-arch spec and RTL design of synthesize-friendly and power-efficient smart memory in storing Depth, Stencil and Color information from Rasterization and Color blender units using System Verilog and modifying Yaml file.• Owned U-arch spec and RTL design of Packed Format/YUV format/MSAA/Adaptive De-sampling/Block Linear/Pitch Linear Flush, Normal/BINZ/Decompression Preload, Fast clear Data-path Control Logic and FSM.• Owned U-arch spec and RTL design of pipeline mode Sync Token Control Logic and timing synchronous mode NNOC Packet Control Logic.• Implemented D16 to D24 and D16 to FP32 Conservative Format Conversion Logic.• Fixed the timing violations by re-organizing pipeline logic and simplifying existed logic to meet timing requirements.• Worked with EMULATION team and PD team to help emulation debug and ECO to provide cheapest choice for the 2 tape-outs.• Worked with FDV, Formal and DVM team to fix SEQ mismatch and worked with Physical and Integration team for congestion issues.Primitive Assembler (clipper, cull, view port transfer, setup) Micro-Arch design/Low power design.• Owned U-arch spec and RTL design of synthesize-friendly and power-efficient Primitive Assembler that handles Clipping, Culling, Set-up process for interpolation, and Viewport Transfer in the GPU pipeline.
-
Trainee Of Bidding Department(Intership)Sieyuan Qingneng Electronics Corp Jul 2014 - Sep 2014Assisted in the editing and review of a project application formManaged the bidding documentsVisited workshops to acquire information
Ziwei Li Skills
Ziwei Li Education Details
-
University Of Southern CaliforniaElectrical And Electronics Engineering -
Huazhong University Of Science And TechnologyPower System
Frequently Asked Questions about Ziwei Li
What company does Ziwei Li work for?
Ziwei Li works for Apple
What is Ziwei Li's role at the current company?
Ziwei Li's current role is GPU design engineer.
What is Ziwei Li's email address?
Ziwei Li's email address is zl****@****roq.com
What schools did Ziwei Li attend?
Ziwei Li attended University Of Southern California, Huazhong University Of Science And Technology.
What skills is Ziwei Li known for?
Ziwei Li has skills like Microsoft Office, Microsoft Word, 领导力, Microsoft Excel, 管理人员, Verilog, Customer Service, C++, Cadence, 项目管理, System Verilog, 战略规划.
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial