πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat Email and Phone Number

"25 under 25: Top Data Science Contributors and Thought-Leaders" ➑️ Influencer through Data πŸ“ˆ Operationalizing enterprise-grade Responsible AI tools 🧰 @ Microsoft
Redmond, WA
πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's Location
Greater Seattle Area, United States, United States
πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's Contact Details

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat work email

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat personal email

n/a

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat phone numbers

About πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat

At the age of 17, I went 10,000 miles away from pretty much everyone I knew in my life to a diametrically opposite end of the Earth, when I arrived at UC San Diego. Without any exposure of life in the US or any programming background, well, I started learning quite a few things from scratch!I recently graduated Magna Cum Laude from UC San Diego specializing in Data Science and Machine Learning. Not only did I specifically major in 'Data Science', I've had a history of past industry experiences from spearheading Data Science efforts in a start-up to interning at 4 MNCs and Fortune 100 companies as a Data Scientist / ML Engineer. As an undergrad Data Science major, I've also had a proven track record of exceptionally securing Graduate-level internships as an undergrad, all while engaging in Machine Learning Systems research, securing multiple research scholarships and awards, guiding junior students with college mentorships programs and Model UN, and maintaining an increasing GPA trajectory to graduate within the top 6% of the graduating class of 2021. I have had applied experiences in Data Science and Machine Learning with some of the leaders in the Financial, Consulting, Pricing, and Enterprise Software domains, engaging in projects such as:➑️ Inventing a tool with a novel workflow to parse US Companies' Filings ➑️ Developing a chatbot for a $200,000 client proposal➑️ Formulating a pre-processing framework to automatically flag warnings for bad feature combinations for 50+ global pricing models ➑️ Implementing 4 optimization algorithms in large-scale Deep Learning systems dealing with petabytes of dataAt 21 years, I became the youngest member in the 4th Cohort of the Microsoft AI Development Acceleration Program (MAIDAP).Feel free to get in touch!Appointments: calendly.com/agemawatSpeaking Request: bit.ly/AdvityaGemawatSpeakerDisclaimer: All opinions posted on LinkedIn are my own and do not reflect or represent my employer or any other entity.Big Data | Advanced Analytics | Data Mining | Hadoop | Visualization | Deep Learning | Business Intelligence | Clustering | Database | SQL | PostgreSQL | Data Warehousing | Data Engineering | Java | R | Spark | PyTorch | MATLAB | Shell | Flask | HTML | CSS | JavaScript | jQuery | Ajax | Latex | Docker | MADlib | Greenplum | AutoML | Massively Parallel Processing (MPP) | Hyperband | Population Based Training (PBT) | Successive Halving Algorithm (SHA)

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's Current Company Details
Microsoft

Microsoft

View
"25 under 25: Top Data Science Contributors and Thought-Leaders" ➑️ Influencer through Data πŸ“ˆ Operationalizing enterprise-grade Responsible AI tools 🧰
Redmond, WA
Website:
microsoft.com
Employees:
10
Company phone:
0124 415 8000
πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat Work Experience Details
  • Microsoft
    Machine Learning Engineer 2 - Llm Evaluation
    Microsoft Feb 2024 - Present
    Redmond, Washington, Us
    Evaluation on Azure AI Foundry
  • Microsoft
    Machine Learning Engineer 2 - Responsible Ai
    Microsoft Sep 2023 - Jan 2024
    Redmond, Washington, Us
    Responsible AI for Computer Vision
  • Microsoft
    Machine Learning Engineer - Responsible Ai
    Microsoft Jun 2023 - Sep 2023
    Redmond, Washington, Us
    Responsible AI team under Azure ML & Azure AI Studio product lines | Azure AI Platform org
  • Microsoft
    Machine Learning Research Engineer
    Microsoft Jul 2021 - Jun 2023
    Redmond, Washington, Us
    Microsoft AI Development Acceleration Program (MAIDAP) [Program’s acceptance rate ~= 0.2%]Spring 2023 Rotation: Azure Machine Learning (AML) team, AI Platform org & Microsoft Research (MSR)Fall 2022 Rotation: AIOps Gandalf Team, Health & Standards org within Azure Edge & Platform (AEP)Spring 2022 Rotation: Gray Systems Lab (GSL), Azure Data orgFall 2021 Rotation: Signal Quality team, Azure Quality Infrastructure (AQI) orgOne of the most elite New Grad programs in all of Microsoft, located at the "most innovative square mile on the planet"!L60: Mar 2022 - PresentL59: Jul 2021 - Feb 2022 β€’ 7 mos
  • Uw'S Global Innovation Exchange
    Industry Mentor
    Uw'S Global Innovation Exchange Jan 2024 - Mar 2024
    Bellevue, Wa, Us
    Industry Mentor for the 'GenAI Evaluation Copilot' Grad Capstone Project, sponsored by Microsoft.
  • Uc San Diego Computer Science And Engineering Department (Cse)
    Deep Learning Systems Researcher, Advanced Data Analytics (Ada) Lab
    Uc San Diego Computer Science And Engineering Department (Cse) Sep 2020 - Jun 2021
    La Jolla, California, Us
    Ideated 'GraphGem: Optimized Scalable System for Graph Convolutional Networks' (Top 6 @ ACM SIGMOD SRC 2021) and advised by Prof Dr. Arun Kumar.Poster Link - https://cs.uwaterloo.ca/~xihe/srcposter2021/Posters/UG1-4.pdfAbstract Link - https://dl.acm.org/doi/abs/10.1145/3448016.3450573
  • Uc San Diego Computer Science And Engineering Department (Cse)
    Guest Speaker, Graduate Databases Seminar (Cse 239A)
    Uc San Diego Computer Science And Engineering Department (Cse) Oct 2020 - Oct 2020
    La Jolla, California, Us
    An undergraduate Guest Speaker in a Graduate-level Databases seminar course.Gave a talk on Project 'Vista', recipient of the HDSI Scholarship, HDSI Scholarship Project Award, and the CRA Outstanding Researcher Award nomination (Microsoft Research sponsored).Abstract:Scalable systems for ML are largely siloed into dataflow systems for structured data and DL systems for unstructured data. This gap has left workloads that jointly analyze both forms of data with poor systems support, leading to both low system efficiency and grunt work. We plan to bridge this gap for a class of such workloads: feature transfer from Deep NNs for analyzing unstructured (images, text etc) along with structured data. Vista is a new data system that resolves systems issues by elevating entire transfer learning workloads to a declarative level on top of Parallel Dataflow (PD) and DL systems. Vista automatically optimizes configuration and execution of this workload to reduce computational redundancy and improve workload reliability. In this talk, I talk about some of the extensions I worked on for Vista, along with some of my current efforts to expand Vista to support general transfer learning workloads, along with the broader goal to integrate Vista with Cerebro.Speaker bio:Advitya Gemawat is a fourth-year undergraduate student advised by Prof. Arun Kumar at UC San Diego. His research interests lie along the intersections of Machine Learning and scalable Systems, an emerging area which is increasingly referred to as Systems for ML. He's a recipient of the HDSI Undergraduate Scholarship, Siemens SRC Scholarship, and has been nominated for the CRA Outstanding Undergraduate Research Award (sponsored by Microsoft Research). He's previously interned at Fidelity International, PwC, HP, and recently at VMware, and has spearheaded hyperparameter optimization and AutoML capabilities in the Apache MADlib library's Deep Learning module, all of which have been released in the version 1.18.0.
  • Uc San Diego Computer Science And Engineering Department (Cse)
    Machine Learning Systems Research Assistant, Advanced Data Analytics (Ada) Lab
    Uc San Diego Computer Science And Engineering Department (Cse) Feb 2019 - Sep 2020
    La Jolla, California, Us
    Among 20 proposals selected from UC San Diego for HDSI's Undergraduate Research Scholarship Program for the 2020 budget year.β€’ Integrated support and declarative hyperparameter tuning for 5 downstream ML models into the Lab’s CNN workflow optimizer with PySpark.β€’ Evaluated, visualized and compared end-to-end runtimes of 6 downstream models by running experiments with Spark clusters under a distributed set-up.β€’ Automated run-time experiments by writing custom Shell scripts to initialize and run all processes in background, saving 5+ hours per manual experimental run.β€’ Systemized a gap in scalable feature transfer for multimodal analytics by identifying 4500x memory blow-ups with BERT inference workloads.Sponsored by VMware, NSF and NIH
  • HalΔ±cΔ±oğlu Data Science Institute, Uc San Diego
    Undergraduate Research Scholar [Hdsi Best Project Award Winner πŸ₯‡]
    Halıcıoğlu Data Science Institute, Uc San Diego Dec 2019 - Dec 2020
    San Diego, California, Us
    *Secured the HDSI Undergraduate Scholarship Project Award as part of the graduating class of 2021*Awarded $2500 scholarship as tuition reduction for 2020 to support my educational activities at UC San Diego by contributing to an 'End-to-End Declarative Transfer Learning System for Multimodal Analytics with Deep Neural Networks'.Project 'Vista': An End-to-End Declarative Transfer Learning System for Multimodal Analytics with Deep Neural Networks Mentor: Prof Dr. Arun KumarScalable systems for ML are largely siloed into dataflow systems for structured data and Deep Learning (DL) systems for unstructured data. This gap has left workloads that jointly analyze both forms of data with poor systems support, leading to both low system efficiency and grunt work. We plan to bridge this gap for a class of such workloads: feature transfer from Deep NNs for analyzing unstructured (images, text etc) along with structured data. Vista is a new data system that resolves systems issues by elevating entire transfer learning workloads to a declarative level on top of Parallel Dataflow (PD) and DL systems. Vista automatically optimizes configuration and execution of such workloads to reduce computational redundancy and improve workload reliability. Initial experimental results show that Vista can enable speedups up to 10x compared to existing baseline approaches.We base Vista on 3 design decisions: Declarativity to simplify specification, Execution Optimization to reduce runtimes, and Memory Management and Configuration Optimization to avoid crashes.
  • The Apache Software Foundation
    Apache Madlib Contributor
    The Apache Software Foundation Aug 2020 - Sep 2020
    Wilmington, Delaware, Us
    One among ~56 Apache MADlib contributors worldwide at the time. One of the youngest Apache MADlib contributors in the world. Deep Learning module: Hyperparameter Optimization for Model SelectionAdded support for the following UDFs: ➑️ `generate_model_configs` to perform grid and random search for deep learning model selection ➑️ `hyperband_schedule` utility function for users to pick and choose the schedule they wish to use for hyperband➑️ `madlib_keras_automl` with AutoML capabilities such as Hyperband and Hyperopt with novel algorithmic optimizations for Massively Parallel Processing (MPP) databasesAll contributions officially released as features in MADlib version 1.18.0!Contributions made as part of internship at VMware.
  • Vmware
    Deep Learning Engineer - Advanced Analytics, Greenplum R&D
    Vmware Jun 2020 - Sep 2020
    Palo Alto, Ca, Us
    Supposedly the first undergraduate student researcher in VMware's history to first-author a research blog on the company's product blog site ["Massively Parallel Automated Model Building for Deep Learning" | media attached below]Advanced Analytics team for Greenplum Database R&D | Modern Applications Platform Business Unit (MAPBU) | VMware Tanzu portfolio | 15 weeks full-time internshipβ€’ Introduced 5 hyperparameter search algorithms and 3 new APIs with python and postgres to automate deep learning model selection in Apache MADlib.β€’ Pioneered 2 AutoML capabilities with Hyperband and Hyperopt optimized for MPP and Model-Hopper Parallelism to accelerate end-to-end model selection.β€’ Revolutionized hyperparameter optimization MADlib’s DL module in released version 1.18.0, becoming one of the youngest Apache MADlib contributors in the world!
  • Hp
    Data Scientist - Global Pricing Analytics
    Hp Jan 2020 - Jun 2020
    Palo Alto, Ca, Us
    Exceptionally selected in HP's MS/PhD Data Science Intern program as an undergraduate. Global Pricing Analytics team, Chief Commercial Office Organizationβ€’ Formulated a pre-processing automation tool to flag warnings related to bad feature combinations for 50+ pricing models refreshed quarterly for all of HP’s products globally.β€’ Upgraded pre-processing functionality to parse features via 10+ declarative config options to provide warnings and time-series plots.β€’ Accelerated execution speed-ups from O(N^2) to O(N) by integrating on-demand capabilities with the tool.β€’ Amalgamated 4 internal metrics to measure impact of individual feature drivers around ad-spend, price, seasonality, inventory etc.
  • Metrim Data
    Product Manager, Analytics
    Metrim Data Jul 2019 - Sep 2019
    Contributed to a specialized Web Analytics platform to extract and predict insights related with demographics, user engagements and social ideologies from social media sites using language models and IBM Watson.
  • Metrim Data
    Data Scientist
    Metrim Data Mar 2018 - Jun 2019
    Joined the student-run projects group as the first Data Scientist.β€’ Mined 42000+ tweets with #dillydilly from December 2017 to April 2018 using tweepy to access the Twitter Application Program Interface (API).β€’ Extracted qualitative understanding about dillydilly’s usage in contexts of sports, social events, top users, public holidays etc as a whole and as 24-hour time windows through a Topic Model.β€’ Visualized the word embeddings extracted with Stanford's Global Vectors for Word Representation (GloVe) Model using t-distributed Stochastic Neighbor Embedding (t-SNE) and trained a seq2seq Recurrent Neural Network (LSTM) model on embedded tweet data to autocomplete tweets.β€’ Automated detection of token-oriented complaints in Reddit's Comments Data using the Valence Aware Dictionary for sEntiment Reasoner (VADER) model’s polarity and feature parameters.β€’ Engineered a β€˜Controversiality Detector’ with feature extraction and stacking Decision Tree based models on a 2nd-level eXtreme Gradient Boosting (XGBoost) Classifier on Reddit’s Comments Data to enhance accuracy by 62.7%. β€’ Predicted the Amazon Q/A data's close-ended answers by 99.726% with a Support Vector Machine (SVM) Classifier.β€’ Developed a Chatbot with Amazon Q/A dataset and text2query mappings and generated most similar questions to user input by leveraging Term Frequency - Inverse Document Frequency (tf-idf) vectors and Semantic Parsing.β€’ Designed a 'question2SQL' framework in a seq2seq NMT model with Keras embeddings and accurately evaluated the position of a query keyword by 80%.
  • Pwc
    Full Stack Data Scientist
    Pwc Jun 2019 - Sep 2019
    Gb
    LoS: Advisory Sub-SBU/Team: Analyticsβ€’ Designed an ER diagram to represent the data model stored after ETL processes on disparate city-wide government data.β€’ Optimized topic detection to analyze customer reviews with POS-tagging and integrated NLTK’s similarity dictionary to reduce redundancy in category representations by 54.26%.β€’ Fabricated the functionality and UI of a Chatbot supporting autocorrect, multi-language translations, transliteration, feedback using HTML, CSS, Ajax, jQuery and Flask as part of a proposal for a $200,000 project.
  • Fidelity International
    Data Scientist (Technology - Ftc)
    Fidelity International Jun 2018 - Aug 2018
    London, London, Gb
    β€’ Invented a novel tool to automate extraction and classification of specific information within US Companies' Filings by 100%. β€’ Spearheaded the construction of systems and pipelines involving Web Scraping, Information Retrieval and Natural Language Processing (NLP) for investment research purposes.β€’ Achieved accurate topic forecasting of corpora of financial texts by 70% using a Multinomial Naive Bayes Classifier.
  • Zilina Self Governing Region
    Foreign Exchange Program & Ε½ilina Model Un
    Zilina Self Governing Region Apr 2015 - Apr 2015
    Ε½ilina, Sk
    Went for ZAMUN and Foreign Exchange to Slovakia and stayed with a host family. Secured the Best Distinguished Delegate award as Delegate of Syria in UN Human Rights Council, discussing measures on eradicating Law Enforcement Corruption in regions with Human Trafficking.

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat Skills

Data Analysis Machine Learning Data Structures Python Data Mining Data Visualization Big Data Matlab Sql R Public Speaking Microsoft Excel Microsoft Word Postgresql Data Cleaning Microsoft Powerpoint Web Scraping Julia Leadership Rubik's Cube Java

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat Education Details

  • HalΔ±cΔ±oğlu Data Science Institute, Uc San Diego
    Halıcıoğlu Data Science Institute, Uc San Diego
    Data Science
  • Uc San Diego
    Uc San Diego
    Data Science

Frequently Asked Questions about πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat

What company does πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat work for?

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat works for Microsoft

What is πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's role at the current company?

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's current role is "25 under 25: Top Data Science Contributors and Thought-Leaders" ➑️ Influencer through Data πŸ“ˆ Operationalizing enterprise-grade Responsible AI tools 🧰.

What is πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's email address?

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's email address is advitya.gemawat@hp.com

What is πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's direct phone number?

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's direct phone number is +185840*****

What schools did πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat attend?

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat attended HalΔ±cΔ±oğlu Data Science Institute, Uc San Diego, Uc San Diego.

What skills is πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat known for?

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat has skills like Data Analysis, Machine Learning, Data Structures, Python, Data Mining, Data Visualization, Big Data, Matlab, Sql, R, Public Speaking, Microsoft Excel.

Who are πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's colleagues?

πŸ‘¨πŸ»β€πŸ’» Advitya Gemawat's colleagues are Bradley Wright, Emiko Shimono, Palash Agrawal, Sherin D., Pankaj Bharti, Anand Rajeswaran, Khasipa Khoirunnisa.

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.