Andrew Lavin

Andrew Lavin Email and Phone Number

Fellow @ AMD
California, United States
Andrew Lavin's Location
San Jose, California, United States, United States
Andrew Lavin's Contact Details

Andrew Lavin personal email

Andrew Lavin phone numbers

About Andrew Lavin

I specialize in efficient algorithms for convolutional neural networks.

Andrew Lavin's Current Company Details
AMD

Amd

View
Fellow
California, United States
Website:
amd.com
Employees:
44382
Andrew Lavin Work Experience Details
  • Amd
    Fellow
    Amd
    California, United States
  • Self-Employed
    Independent Researcher
    Self-Employed Feb 2024 - Present
    Published five years of original research in the paper "On the Efficiency of Convolutional Neural Networks." Major contributions include a theory of neural network efficiency that unifies model efficiency, computational efficiency, and latency; the Waterline performance model for sequences of parallel kernels, which corrects errors with the widely used Roofline analysis; memory-efficient CUDA kernels for ConvFirst and MBConv+SE blocks that achieve up to 14x and 5x speedup over PyTorch Inductor, respectively; the ConvFirst model that achieves ~4x speedup over ConvNeXt with equal accuracy when using our custom kernels. The paper shows that co-optimization of the model and program yields superior performance.Developed the Spio kernel library for PyTorch. Spio is a CUDA kernel framework with named tensors, run time compilation, kernel performance models, and torch.compile integration. The first Spio kernels perform grouped convolution several times faster than the builtin PyTorch kernels.
  • Phantom Ai
    Distinguished Engineer
    Phantom Ai May 2022 - Feb 2024
    Mountain View, California, Us
    Developed memory-efficient CUDA kernels for MBConv+SE blocks and ConvFirst blocks, implementing entire blocks with fused kernels. Led the development of a graph rewriting compiler for neural networks. Improved the performance and reliability of core system-level code for a production ADAS system.
  • Phantom Ai
    Principal Software Engineer
    Phantom Ai Nov 2020 - May 2022
    Mountain View, California, Us
    Developed an efficient inference engine for the NVIDIA Xavier SoC using block-fusion kernels written in CUDA and PTX. Achieved ~4x speedup on whole network object detection, semantic segmentation, and other tasks versus NVIDIA TensorRT. Developed a client-server framework for testing and benchmarking, enabling the compile and test framework to run in a Python desktop environment while the inference server runs on the embedded system. Administered the company's deep learning cluster.
  • Subdivision Ai
    Founder
    Subdivision Ai Mar 2019 - Nov 2020
    Created the ConvFirst building block for convolutional neural networks, predating the similar ConvNeXt block by three years. Performed research on high-performance pre-processing pipelines for neural networks. Also devised multi-scale convnet models. Began contract work on the Phantom AI Inference Engine.
  • Intel Corporation
    Applied Research Scientist Deep Learning Algorithms
    Intel Corporation Mar 2018 - Aug 2018
    Santa Clara, California, Us
    As a contract employee, I researched machine learning algorithms and their deployment on graphics processors.
  • Tesla
    Principal Software Engineer
    Tesla Nov 2017 - Dec 2017
    Austin, Texas, Us
    Evaluated the computer vision software stack for the autopilot team. Suggested architectural and algorithmic changes.
  • Intel Corporation
    Machine Learning Consultant
    Intel Corporation Mar 2017 - Oct 2017
    Santa Clara, California, Us
    Improved efficiency of neural network software on Intel Gen graphics processors using OpenCL. Achieved approximately 4x speedup over the existing codebase.
  • Tesla
    Software Engineer
    Tesla Jun 2016 - Jan 2017
    Austin, Texas, Us
    Developed high-performance NVIDIA GPU kernels for the Autopilot system.
  • Daqri
    Computer Vision Software Engineer
    Daqri Feb 2016 - May 2016
    Los Angeles, Ca, Us
    Optimized a computer vision pipeline for desktop and embedded platforms over the course of a short term contract. Achieved speedup of 2.5X on embedded platform while reducing memory use by nearly half.
  • Self-Employed
    Independent Researcher
    Self-Employed Mar 2015 - Dec 2015
    Introduced Winograd's fast convolution algorithms into convolutional neural networks while working as an independent researcher. Devised the mathematical formulation of fast tensor convolution as matrix products of coefficients in transform space. Presented the results at CVPR 2016 in the research paper "Fast Algorithms for Convolutional Neural Networks."cuDNN and other deep learning software libraries and ASIC and FPGA designs have all implemented the algorithm. The paper has more than 1,100 citations.Created the winCNN python module for the automatic generation of modified Cook-Toom (i.e., minimal Winograd) convolution algorithms.
  • Ebay Inc
    R & D Engineer
    Ebay Inc Jul 2014 - Feb 2015
    San Jose, Ca, Us
    Developed the world's first high efficiency convolution neural network kernel for NVIDIA GPUs, reaching 95% computational efficiency for popular deep learning network layers. Used the Maxas assembler and modified the SGEMM sample to perform direct convolution.Profiled the cuda-convnet2 deep learning framework and identified the image preprocessing bottleneck for multigpu systems. Removed bottleneck by performing scan line mean subtraction and color noise addition pipelined with scan line JPEG decoding.
  • Fuze Box
    Senior Software Architect
    Fuze Box Jul 2012 - Dec 2013
    Boston, Ma, Us
    Designed and implemented a cross platform C++ library for parsing and rendering Apple Keynote files. Implemented code for iPad screen sharing. Evaluated codecs for next generation screen sharing. Implemented OpenGL based renderer for screen sharing and video conferencing.
  • Fuze Box
    Software Developer
    Fuze Box Oct 2011 - Jul 2012
    Boston, Ma, Us
    Designed and implemented next generation video conference user interface. Cross platform code written in C++, with original rendering implementation written for OS X. Advised engineers who ported to Linux and Windows. Extended the Chameleon framework (for porting iOS apps to OS X) with support for drag and drop and tooltips, using Objective-C on OS X. Improved OS X screen capture. Contributed many bug fixes and UI enhancements on OS X and iOS using Objective-C.
  • A9.Com
    Software Developer
    A9.Com Dec 2010 - Sep 2011
    Palo Alto, Ca, Us
    Created C++ middleware for system programming on Linux, MS Windows, and Mac/iOS. Profiled and optimized existing visual matching algorithm. Designed and implemented API for approximate nearest neighbor search and benchmarked different algorithms.
  • Flashfoto
    Software Developer
    Flashfoto Mar 2010 - Dec 2010
    Developed face detection and automatic image segmentation software. Maintained MS Windows APIs for legacy products. Profiled and optimized image segmentation algorithm. Developed commandline tool for automatic image alignment. Translated several computer vision algorithms from Matlab to C++.
  • Commandscape
    Lead Software Engineer
    Commandscape Jan 2006 - May 2009
    Created a Cocoa server framework for home automation on OS X.Created a flexible, efficient framework for distributed objects on iPhone and OS X. Improved the reliability and speed of legacy home automation system software. Eliminated GUI blocking and communication failures by creating an asynchronous, multithreaded distributed object system using C++ and TCP/IP on Linux.Created a framework for the rapid implementation of RS232 device drivers. Added the facility to control HVAC by implementing a BACnet client protocol stack.
  • Picturepusher Llc
    Software Engineering Consultant
    Picturepusher Llc Jan 2001 - Dec 2005
    Implemented Trax Systems’ Traceware package recognition system. Created the image analysis algorithm in just 2 weeks. Achieved perfect recognition accuracy for all well formed images regardless of perspective. Implemented the recognition module and web service interface on Linux using C++ and PHP.Wrote parsers for large, complex undocumented data files for the Thirty Meter Telescope Project. Wrote scripts to import daily data files into a relational database and a web site for data visualization. Used Perl, PostgreSQL, and PHP on OS X.Created the Picturepusher photo sharing service. Implemented a fast, cross platform photo management application, featuring a thumbnail viewer, camera import, and photo upload. Implemented image resizing algorithms that achieved high quality while remaining much faster than many publicly available libraries. Implemented a web site with user selectable access controls, transfer and storage quotas, and a scalable architecture. Used C++, PHP, Python, and PostgreSQL on LInux, MS Windows, and Mac.Implemented a sales reporting tool for a department store chain. Used PHP, PostgreSQL, and Oracle to pull daily sales data from the corporate database. Wrote parsers for the customer traffic monitoring system. Generated HTML reports comparing sales and profits per customer across any combination of stores.Designed and implemented open source libraries for cross platform sockets, threads and system logging.Contributed to the wxWidgets for Mac preemptive threads implementation.Created the Threadhandler classes for multi-threaded event handling in wxWidgets.
  • Enroute
    Software Engineer
    Enroute 1998 - 2000
    7th employee in a Silicon Valley startup in digital imaging technologies innovation.Improved the speed of the immersive imaging viewer by an order of magnitude using hardware graphics acceleration. This was accomplished during the first 2 weeks of employment and was key in winning my employer a contract for the development of an immersive video renderer and viewer.Co-inventor of 4 US patents for immersive video.Created an immersive video viewer.Created the image processing and visualization modules for the well reviewed Powerstitch large scale photo panorama software.
  • Autodesk
    Software Engineer
    Autodesk 1996 - 1998
    San Francisco, Ca, Us
    10th employee of a small research group in one of the world’s largest software companies.Improved the speed of Picture This Home’s modeling algorithm by a factor of 2 in just 2 days.Created C++ libraries for linear algebra, nonlinear optimization, and geometric operations.Created the solid modeling user interface for the Origami image based modeling prototype.Provided programming support for researchers from leading universities such as Caltech, MIT, and Stanford.

Andrew Lavin Skills

Linux C++ Objective C Programming Software Development Computer Vision Unix Image Processing C Object Oriented Design Ios Pattern Recognition Cuda Gpu Software Engineering Algorithms

Andrew Lavin Education Details

  • Caltech
    Caltech
    E&As (Double Major)

Frequently Asked Questions about Andrew Lavin

What company does Andrew Lavin work for?

Andrew Lavin works for Amd

What is Andrew Lavin's role at the current company?

Andrew Lavin's current role is Fellow.

What is Andrew Lavin's email address?

Andrew Lavin's email address is aj****@****ail.com

What is Andrew Lavin's direct phone number?

Andrew Lavin's direct phone number is +140856*****

What schools did Andrew Lavin attend?

Andrew Lavin attended Caltech.

What skills is Andrew Lavin known for?

Andrew Lavin has skills like Linux, C++, Objective C, Programming, Software Development, Computer Vision, Unix, Image Processing, C, Object Oriented Design, Ios, Pattern Recognition.

Who are Andrew Lavin's colleagues?

Andrew Lavin's colleagues are Malcolm Stevens, Shankarlal Suthar, Srikanth Kakkirala, Nurullah Akuş, Abdelrhman Hamada, Dani Hashweh, Pawan Kumar Rukmangada.

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.