Andrew Lavin Email & Phone Number
area 408
LinkedIn matched
Who is Andrew Lavin? Overview
A concise factual answer block for searchers comparing this professional profile.
Andrew Lavin is listed as Fellow at AMD, a with 44382 employees, based in San Jose, California, United States. AeroLeads shows phone signal with area code 408 and a matched LinkedIn profile for Andrew Lavin.
Andrew Lavin previously worked as Independent Researcher at Self-Employed and Distinguished Engineer at Phantom Ai. Andrew Lavin holds B.S., Mathematics, E&As (Double Major) from Caltech.
Email format at AMD
This section adds company-level context without repeating Andrew Lavin's masked contact details.
Review company-level records connected to Andrew Lavin before choosing the right outreach path.
About Andrew Lavin
I specialize in efficient algorithms for convolutional neural networks.
Listed skills include Linux, C++, Objective C, Programming, and 12 others.
Andrew Lavin's current company
Company context helps verify the profile and gives searchers a useful next step.
Andrew Lavin work experience
A career timeline built from the work history available for this profile.
Independent Researcher
CurrentPublished five years of original research in the paper "On the Efficiency of Convolutional Neural Networks." Major contributions include a theory of neural network efficiency that unifies model efficiency, computational efficiency, and latency; the Waterline performance model for sequences of parallel kernels, which corrects errors with the widely used Roofline analysis; memory-efficient CUDA kernels for ConvFirst and MBConv+SE blocks that achieve up to 14x and 5x speedup over PyTorch Inductor, respectively; the ConvFirst model that achieves ~4x speedup over ConvNeXt with equal accuracy when using our custom kernels. The paper shows that co-optimization of the model and program yields superior performance.Developed the Spio kernel library for PyTorch. Spio is a CUDA kernel framework with named tensors, run time compilation, kernel performance models, and torch.compile integration. The first Spio kernels perform grouped convolution several times faster than the builtin PyTorch kernels.
Distinguished Engineer
Developed memory-efficient CUDA kernels for MBConv+SE blocks and ConvFirst blocks, implementing entire blocks with fused kernels. Led the development of a graph rewriting compiler for neural networks. Improved the performance and reliability of core system-level code for a production ADAS system.
Principal Software Engineer
Developed an efficient inference engine for the NVIDIA Xavier SoC using block-fusion kernels written in CUDA and PTX. Achieved ~4x speedup on whole network object detection, semantic segmentation, and other tasks versus NVIDIA TensorRT. Developed a client-server framework for testing and benchmarking, enabling the compile and test framework to run in a Python desktop environment while the inference server runs on the embedded system. Administered the company's deep learning cluster.
Founder
Created the ConvFirst building block for convolutional neural networks, predating the similar ConvNeXt block by three years. Performed research on high-performance pre-processing pipelines for neural networks. Also devised multi-scale convnet models. Began contract work on the Phantom AI Inference Engine.
Applied Research Scientist Deep Learning Algorithms
As a contract employee, I researched machine learning algorithms and their deployment on graphics processors.
Principal Software Engineer
Evaluated the computer vision software stack for the autopilot team. Suggested architectural and algorithmic changes.
Machine Learning Consultant
Improved efficiency of neural network software on Intel Gen graphics processors using OpenCL. Achieved approximately 4x speedup over the existing codebase.
Computer Vision Software Engineer
Optimized a computer vision pipeline for desktop and embedded platforms over the course of a short term contract. Achieved speedup of 2.5X on embedded platform while reducing memory use by nearly half.
Independent Researcher
Introduced Winograd's fast convolution algorithms into convolutional neural networks while working as an independent researcher. Devised the mathematical formulation of fast tensor convolution as matrix products of coefficients in transform space. Presented the results at CVPR 2016 in the research paper "Fast Algorithms for Convolutional Neural Networks."cuDNN and other deep learning software libraries and ASIC and FPGA designs have all implemented the algorithm. The paper has more than 1,100 citations.Created the winCNN python module for the automatic generation of modified Cook-Toom (i.e., minimal Winograd) convolution algorithms.
R & D Engineer
Developed the world's first high efficiency convolution neural network kernel for NVIDIA GPUs, reaching 95% computational efficiency for popular deep learning network layers. Used the Maxas assembler and modified the SGEMM sample to perform direct convolution.Profiled the cuda-convnet2 deep learning framework and identified the image preprocessing bottleneck for multigpu systems. Removed bottleneck by performing scan line mean subtraction and color noise addition pipelined with scan line JPEG decoding.
Senior Software Architect
Designed and implemented a cross platform C++ library for parsing and rendering Apple Keynote files. Implemented code for iPad screen sharing. Evaluated codecs for next generation screen sharing. Implemented OpenGL based renderer for screen sharing and video conferencing.
Software Developer
Designed and implemented next generation video conference user interface. Cross platform code written in C++, with original rendering implementation written for OS X. Advised engineers who ported to Linux and Windows. Extended the Chameleon framework (for porting iOS apps to OS X) with support for drag and drop and tooltips, using Objective-C on OS X. Improved OS X screen capture. Contributed many bug fixes and UI enhancements on OS X and iOS using Objective-C.
Software Developer
Created C++ middleware for system programming on Linux, MS Windows, and Mac/iOS. Profiled and optimized existing visual matching algorithm. Designed and implemented API for approximate nearest neighbor search and benchmarked different algorithms.
Software Developer
Developed face detection and automatic image segmentation software. Maintained MS Windows APIs for legacy products. Profiled and optimized image segmentation algorithm. Developed commandline tool for automatic image alignment. Translated several computer vision algorithms from Matlab to C++.
Lead Software Engineer
Created a Cocoa server framework for home automation on OS X.Created a flexible, efficient framework for distributed objects on iPhone and OS X. Improved the reliability and speed of legacy home automation system software. Eliminated GUI blocking and communication failures by creating an asynchronous, multithreaded distributed object system using C++ and TCP/IP on Linux.Created a framework for the rapid implementation of RS232 device drivers. Added the facility to control HVAC by implementing a BACnet client protocol stack.
Software Engineering Consultant
Implemented Trax Systems’ Traceware package recognition system. Created the image analysis algorithm in just 2 weeks. Achieved perfect recognition accuracy for all well formed images regardless of perspective. Implemented the recognition module and web service interface on Linux using C++ and PHP.Wrote parsers for large, complex undocumented data files for the Thirty Meter Telescope Project. Wrote scripts to import daily data files into a relational database and a web site for data visualization. Used Perl, PostgreSQL, and PHP on OS X.Created the Picturepusher photo sharing service. Implemented a fast, cross platform photo management application, featuring a thumbnail viewer, camera import, and photo upload. Implemented image resizing algorithms that achieved high quality while remaining much faster than many publicly available libraries. Implemented a web site with user selectable access controls, transfer and storage quotas, and a scalable architecture. Used C++, PHP, Python, and PostgreSQL on LInux, MS Windows, and Mac.Implemented a sales reporting tool for a department store chain. Used PHP, PostgreSQL, and Oracle to pull daily sales data from the corporate database. Wrote parsers for the customer traffic monitoring system. Generated HTML reports comparing sales and profits per customer across any combination of stores.Designed and implemented open source libraries for cross platform sockets, threads and system logging.Contributed to the wxWidgets for Mac preemptive threads implementation.Created the Threadhandler classes for multi-threaded event handling in wxWidgets.
Software Engineer
7th employee in a Silicon Valley startup in digital imaging technologies innovation.Improved the speed of the immersive imaging viewer by an order of magnitude using hardware graphics acceleration. This was accomplished during the first 2 weeks of employment and was key in winning my employer a contract for the development of an immersive video renderer and viewer.Co-inventor of 4 US patents for immersive video.Created an immersive video viewer.Created the image processing and visualization modules for the well reviewed Powerstitch large scale photo panorama software.
Software Engineer
10th employee of a small research group in one of the world’s largest software companies.Improved the speed of Picture This Home’s modeling algorithm by a factor of 2 in just 2 days.Created C++ libraries for linear algebra, nonlinear optimization, and geometric operations.Created the solid modeling user interface for the Origami image based modeling prototype.Provided programming support for researchers from leading universities such as Caltech, MIT, and Stanford.
Colleagues at AMD
Other employees you can reach at amd.com. View company contacts for 44382 employees →
Akhila Nakhate
Colleague at AmdSanta Clara, California, United States
View →
SN
Scott Nixon
Colleague at AmdFort Collins, Colorado, United States
View →
GG
Gowtham G
Colleague at AmdHyderabad, Telangana, India
View →
AS
Azam Salehi
Colleague at AmdTehran, Tehran Province, Iran, Islamic Republic Of
View →
TU
Tharani Ulaganathan
Colleague at AmdKumarapalayam, Tamil Nadu, India
View →
V.
Vineeth .P
Colleague at AmdCoimbatore, Tamil Nadu, India
View →
JW
Jemmy Wu
Colleague at AmdTaipei, Taipei City, Taiwan, Province Of China
View →
TR
Tom Raisor
Colleague at AmdProvo, Utah, United States
View →
LZ
Lanjiang Zhou
Colleague at AmdSunnyvale, California, United States
View →
DW
Dilshan Wickramarathna
Colleague at AmdNugegoda, Western Province, Sri Lanka
View →
Andrew Lavin education
-
Caltech
Frequently asked questions about Andrew Lavin
Quick answers generated from the profile data available on this page.
What company does Andrew Lavin work for?
Andrew Lavin works for AMD.
What is Andrew Lavin's role at AMD?
Andrew Lavin is listed as Fellow at AMD.
What is Andrew Lavin's phone number?
AeroLeads has found 1 phone signal(s) with area code 408 for Andrew Lavin at AMD.
Where is Andrew Lavin based?
Andrew Lavin is based in San Jose, California, United States while working with AMD.
What companies has Andrew Lavin worked for?
Andrew Lavin has worked for Amd, Self-Employed, Phantom Ai, Subdivision Ai, and Intel Corporation.
Who are Andrew Lavin's colleagues at AMD?
Andrew Lavin's colleagues at AMD include Akhila Nakhate, Scott Nixon, Gowtham G, Azam Salehi, and Tharani Ulaganathan.
How can I contact Andrew Lavin?
You can use AeroLeads to view verified contact signals for Andrew Lavin at AMD, including work email, phone, and LinkedIn data when available.
What schools did Andrew Lavin attend?
Andrew Lavin holds B.S., Mathematics, E&As (Double Major) from Caltech.
What skills is Andrew Lavin known for?
Andrew Lavin is listed with skills including Linux, C++, Objective C, Programming, Software Development, Computer Vision, Unix, and Image Processing.
Search by job title, company, industry, location, and seniority. Export verified B2B contact data when you need it.
Start free trial