π¨π»βπ» Advitya Gemawat Email and Phone Number
π¨π»βπ» Advitya Gemawat work email
- Valid
- Valid
- Valid
π¨π»βπ» Advitya Gemawat personal email
π¨π»βπ» Advitya Gemawat phone numbers
At the age of 17, I went 10,000 miles away from pretty much everyone I knew in my life to a diametrically opposite end of the Earth, when I arrived at UC San Diego. Without any exposure of life in the US or any programming background, well, I started learning quite a few things from scratch!I recently graduated Magna Cum Laude from UC San Diego specializing in Data Science and Machine Learning. Not only did I specifically major in 'Data Science', I've had a history of past industry experiences from spearheading Data Science efforts in a start-up to interning at 4 MNCs and Fortune 100 companies as a Data Scientist / ML Engineer. As an undergrad Data Science major, I've also had a proven track record of exceptionally securing Graduate-level internships as an undergrad, all while engaging in Machine Learning Systems research, securing multiple research scholarships and awards, guiding junior students with college mentorships programs and Model UN, and maintaining an increasing GPA trajectory to graduate within the top 6% of the graduating class of 2021. I have had applied experiences in Data Science and Machine Learning with some of the leaders in the Financial, Consulting, Pricing, and Enterprise Software domains, engaging in projects such as:β‘οΈ Inventing a tool with a novel workflow to parse US Companies' Filings β‘οΈ Developing a chatbot for a $200,000 client proposalβ‘οΈ Formulating a pre-processing framework to automatically flag warnings for bad feature combinations for 50+ global pricing models β‘οΈ Implementing 4 optimization algorithms in large-scale Deep Learning systems dealing with petabytes of dataAt 21 years, I became the youngest member in the 4th Cohort of the Microsoft AI Development Acceleration Program (MAIDAP).Feel free to get in touch!Appointments: calendly.com/agemawatSpeaking Request: bit.ly/AdvityaGemawatSpeakerDisclaimer: All opinions posted on LinkedIn are my own and do not reflect or represent my employer or any other entity.Big Data | Advanced Analytics | Data Mining | Hadoop | Visualization | Deep Learning | Business Intelligence | Clustering | Database | SQL | PostgreSQL | Data Warehousing | Data Engineering | Java | R | Spark | PyTorch | MATLAB | Shell | Flask | HTML | CSS | JavaScript | jQuery | Ajax | Latex | Docker | MADlib | Greenplum | AutoML | Massively Parallel Processing (MPP) | Hyperband | Population Based Training (PBT) | Successive Halving Algorithm (SHA)
Microsoft
View- Website:
- microsoft.com
- Employees:
- 10
- Company phone:
- 0124 415 8000
-
Machine Learning Engineer 2 - Llm EvaluationMicrosoft Feb 2024 - PresentRedmond, Washington, UsEvaluation on Azure AI Foundry -
Machine Learning Engineer 2 - Responsible AiMicrosoft Sep 2023 - Jan 2024Redmond, Washington, UsResponsible AI for Computer Vision -
Machine Learning Engineer - Responsible AiMicrosoft Jun 2023 - Sep 2023Redmond, Washington, UsResponsible AI team under Azure ML & Azure AI Studio product lines | Azure AI Platform org -
Machine Learning Research EngineerMicrosoft Jul 2021 - Jun 2023Redmond, Washington, UsMicrosoft AI Development Acceleration Program (MAIDAP) [Programβs acceptance rate ~= 0.2%]Spring 2023 Rotation: Azure Machine Learning (AML) team, AI Platform org & Microsoft Research (MSR)Fall 2022 Rotation: AIOps Gandalf Team, Health & Standards org within Azure Edge & Platform (AEP)Spring 2022 Rotation: Gray Systems Lab (GSL), Azure Data orgFall 2021 Rotation: Signal Quality team, Azure Quality Infrastructure (AQI) orgOne of the most elite New Grad programs in all of Microsoft, located at the "most innovative square mile on the planet"!L60: Mar 2022 - PresentL59: Jul 2021 - Feb 2022 β’ 7 mos -
Industry MentorUw'S Global Innovation Exchange Jan 2024 - Mar 2024Bellevue, Wa, UsIndustry Mentor for the 'GenAI Evaluation Copilot' Grad Capstone Project, sponsored by Microsoft. -
Deep Learning Systems Researcher, Advanced Data Analytics (Ada) LabUc San Diego Computer Science And Engineering Department (Cse) Sep 2020 - Jun 2021La Jolla, California, UsIdeated 'GraphGem: Optimized Scalable System for Graph Convolutional Networks' (Top 6 @ ACM SIGMOD SRC 2021) and advised by Prof Dr. Arun Kumar.Poster Link - https://cs.uwaterloo.ca/~xihe/srcposter2021/Posters/UG1-4.pdfAbstract Link - https://dl.acm.org/doi/abs/10.1145/3448016.3450573 -
Guest Speaker, Graduate Databases Seminar (Cse 239A)Uc San Diego Computer Science And Engineering Department (Cse) Oct 2020 - Oct 2020La Jolla, California, UsAn undergraduate Guest Speaker in a Graduate-level Databases seminar course.Gave a talk on Project 'Vista', recipient of the HDSI Scholarship, HDSI Scholarship Project Award, and the CRA Outstanding Researcher Award nomination (Microsoft Research sponsored).Abstract:Scalable systems for ML are largely siloed into dataflow systems for structured data and DL systems for unstructured data. This gap has left workloads that jointly analyze both forms of data with poor systems support, leading to both low system efficiency and grunt work. We plan to bridge this gap for a class of such workloads: feature transfer from Deep NNs for analyzing unstructured (images, text etc) along with structured data. Vista is a new data system that resolves systems issues by elevating entire transfer learning workloads to a declarative level on top of Parallel Dataflow (PD) and DL systems. Vista automatically optimizes configuration and execution of this workload to reduce computational redundancy and improve workload reliability. In this talk, I talk about some of the extensions I worked on for Vista, along with some of my current efforts to expand Vista to support general transfer learning workloads, along with the broader goal to integrate Vista with Cerebro.Speaker bio:Advitya Gemawat is a fourth-year undergraduate student advised by Prof. Arun Kumar at UC San Diego. His research interests lie along the intersections of Machine Learning and scalable Systems, an emerging area which is increasingly referred to as Systems for ML. He's a recipient of the HDSI Undergraduate Scholarship, Siemens SRC Scholarship, and has been nominated for the CRA Outstanding Undergraduate Research Award (sponsored by Microsoft Research). He's previously interned at Fidelity International, PwC, HP, and recently at VMware, and has spearheaded hyperparameter optimization and AutoML capabilities in the Apache MADlib library's Deep Learning module, all of which have been released in the version 1.18.0. -
Machine Learning Systems Research Assistant, Advanced Data Analytics (Ada) LabUc San Diego Computer Science And Engineering Department (Cse) Feb 2019 - Sep 2020La Jolla, California, UsAmong 20 proposals selected from UC San Diego for HDSI's Undergraduate Research Scholarship Program for the 2020 budget year.β’ Integrated support and declarative hyperparameter tuning for 5 downstream ML models into the Labβs CNN workflow optimizer with PySpark.β’ Evaluated, visualized and compared end-to-end runtimes of 6 downstream models by running experiments with Spark clusters under a distributed set-up.β’ Automated run-time experiments by writing custom Shell scripts to initialize and run all processes in background, saving 5+ hours per manual experimental run.β’ Systemized a gap in scalable feature transfer for multimodal analytics by identifying 4500x memory blow-ups with BERT inference workloads.Sponsored by VMware, NSF and NIH -
Undergraduate Research Scholar [Hdsi Best Project Award Winner π₯]HalΔ±cΔ±oΔlu Data Science Institute, Uc San Diego Dec 2019 - Dec 2020San Diego, California, Us*Secured the HDSI Undergraduate Scholarship Project Award as part of the graduating class of 2021*Awarded $2500 scholarship as tuition reduction for 2020 to support my educational activities at UC San Diego by contributing to an 'End-to-End Declarative Transfer Learning System for Multimodal Analytics with Deep Neural Networks'.Project 'Vista': An End-to-End Declarative Transfer Learning System for Multimodal Analytics with Deep Neural Networks Mentor: Prof Dr. Arun KumarScalable systems for ML are largely siloed into dataflow systems for structured data and Deep Learning (DL) systems for unstructured data. This gap has left workloads that jointly analyze both forms of data with poor systems support, leading to both low system efficiency and grunt work. We plan to bridge this gap for a class of such workloads: feature transfer from Deep NNs for analyzing unstructured (images, text etc) along with structured data. Vista is a new data system that resolves systems issues by elevating entire transfer learning workloads to a declarative level on top of Parallel Dataflow (PD) and DL systems. Vista automatically optimizes configuration and execution of such workloads to reduce computational redundancy and improve workload reliability. Initial experimental results show that Vista can enable speedups up to 10x compared to existing baseline approaches.We base Vista on 3 design decisions: Declarativity to simplify specification, Execution Optimization to reduce runtimes, and Memory Management and Configuration Optimization to avoid crashes. -
Apache Madlib ContributorThe Apache Software Foundation Aug 2020 - Sep 2020Wilmington, Delaware, UsOne among ~56 Apache MADlib contributors worldwide at the time. One of the youngest Apache MADlib contributors in the world. Deep Learning module: Hyperparameter Optimization for Model SelectionAdded support for the following UDFs: β‘οΈ `generate_model_configs` to perform grid and random search for deep learning model selection β‘οΈ `hyperband_schedule` utility function for users to pick and choose the schedule they wish to use for hyperbandβ‘οΈ `madlib_keras_automl` with AutoML capabilities such as Hyperband and Hyperopt with novel algorithmic optimizations for Massively Parallel Processing (MPP) databasesAll contributions officially released as features in MADlib version 1.18.0!Contributions made as part of internship at VMware. -
Deep Learning Engineer - Advanced Analytics, Greenplum R&DVmware Jun 2020 - Sep 2020Palo Alto, Ca, UsSupposedly the first undergraduate student researcher in VMware's history to first-author a research blog on the company's product blog site ["Massively Parallel Automated Model Building for Deep Learning" | media attached below]Advanced Analytics team for Greenplum Database R&D | Modern Applications Platform Business Unit (MAPBU) | VMware Tanzu portfolio | 15 weeks full-time internshipβ’ Introduced 5 hyperparameter search algorithms and 3 new APIs with python and postgres to automate deep learning model selection in Apache MADlib.β’ Pioneered 2 AutoML capabilities with Hyperband and Hyperopt optimized for MPP and Model-Hopper Parallelism to accelerate end-to-end model selection.β’ Revolutionized hyperparameter optimization MADlibβs DL module in released version 1.18.0, becoming one of the youngest Apache MADlib contributors in the world! -
Data Scientist - Global Pricing AnalyticsHp Jan 2020 - Jun 2020Palo Alto, Ca, UsExceptionally selected in HP's MS/PhD Data Science Intern program as an undergraduate. Global Pricing Analytics team, Chief Commercial Office Organizationβ’ Formulated a pre-processing automation tool to flag warnings related to bad feature combinations for 50+ pricing models refreshed quarterly for all of HPβs products globally.β’ Upgraded pre-processing functionality to parse features via 10+ declarative config options to provide warnings and time-series plots.β’ Accelerated execution speed-ups from O(N^2) to O(N) by integrating on-demand capabilities with the tool.β’ Amalgamated 4 internal metrics to measure impact of individual feature drivers around ad-spend, price, seasonality, inventory etc. -
Product Manager, AnalyticsMetrim Data Jul 2019 - Sep 2019Contributed to a specialized Web Analytics platform to extract and predict insights related with demographics, user engagements and social ideologies from social media sites using language models and IBM Watson. -
Data ScientistMetrim Data Mar 2018 - Jun 2019Joined the student-run projects group as the first Data Scientist.β’ Mined 42000+ tweets with #dillydilly from December 2017 to April 2018 using tweepy to access the Twitter Application Program Interface (API).β’ Extracted qualitative understanding about dillydillyβs usage in contexts of sports, social events, top users, public holidays etc as a whole and as 24-hour time windows through a Topic Model.β’ Visualized the word embeddings extracted with Stanford's Global Vectors for Word Representation (GloVe) Model using t-distributed Stochastic Neighbor Embedding (t-SNE) and trained a seq2seq Recurrent Neural Network (LSTM) model on embedded tweet data to autocomplete tweets.β’ Automated detection of token-oriented complaints in Reddit's Comments Data using the Valence Aware Dictionary for sEntiment Reasoner (VADER) modelβs polarity and feature parameters.β’ Engineered a βControversiality Detectorβ with feature extraction and stacking Decision Tree based models on a 2nd-level eXtreme Gradient Boosting (XGBoost) Classifier on Redditβs Comments Data to enhance accuracy by 62.7%. β’ Predicted the Amazon Q/A data's close-ended answers by 99.726% with a Support Vector Machine (SVM) Classifier.β’ Developed a Chatbot with Amazon Q/A dataset and text2query mappings and generated most similar questions to user input by leveraging Term Frequency - Inverse Document Frequency (tf-idf) vectors and Semantic Parsing.β’ Designed a 'question2SQL' framework in a seq2seq NMT model with Keras embeddings and accurately evaluated the position of a query keyword by 80%. -
Full Stack Data ScientistPwc Jun 2019 - Sep 2019GbLoS: Advisory Sub-SBU/Team: Analyticsβ’ Designed an ER diagram to represent the data model stored after ETL processes on disparate city-wide government data.β’ Optimized topic detection to analyze customer reviews with POS-tagging and integrated NLTKβs similarity dictionary to reduce redundancy in category representations by 54.26%.β’ Fabricated the functionality and UI of a Chatbot supporting autocorrect, multi-language translations, transliteration, feedback using HTML, CSS, Ajax, jQuery and Flask as part of a proposal for a $200,000 project. -
Data Scientist (Technology - Ftc)Fidelity International Jun 2018 - Aug 2018London, London, Gbβ’ Invented a novel tool to automate extraction and classification of specific information within US Companies' Filings by 100%. β’ Spearheaded the construction of systems and pipelines involving Web Scraping, Information Retrieval and Natural Language Processing (NLP) for investment research purposes.β’ Achieved accurate topic forecasting of corpora of financial texts by 70% using a Multinomial Naive Bayes Classifier. -
Foreign Exchange Program & Ε½ilina Model UnZilina Self Governing Region Apr 2015 - Apr 2015Ε½ilina, SkWent for ZAMUN and Foreign Exchange to Slovakia and stayed with a host family. Secured the Best Distinguished Delegate award as Delegate of Syria in UN Human Rights Council, discussing measures on eradicating Law Enforcement Corruption in regions with Human Trafficking.
π¨π»βπ» Advitya Gemawat Skills
π¨π»βπ» Advitya Gemawat Education Details
-
HalΔ±cΔ±oΔlu Data Science Institute, Uc San DiegoData Science -
Uc San DiegoData Science
Frequently Asked Questions about π¨π»βπ» Advitya Gemawat
What company does π¨π»βπ» Advitya Gemawat work for?
π¨π»βπ» Advitya Gemawat works for Microsoft
What is π¨π»βπ» Advitya Gemawat's role at the current company?
π¨π»βπ» Advitya Gemawat's current role is "25 under 25: Top Data Science Contributors and Thought-Leaders" β‘οΈ Influencer through Data π Operationalizing enterprise-grade Responsible AI tools π§°.
What is π¨π»βπ» Advitya Gemawat's email address?
π¨π»βπ» Advitya Gemawat's email address is advitya.gemawat@hp.com
What is π¨π»βπ» Advitya Gemawat's direct phone number?
π¨π»βπ» Advitya Gemawat's direct phone number is +185840*****
What schools did π¨π»βπ» Advitya Gemawat attend?
π¨π»βπ» Advitya Gemawat attended HalΔ±cΔ±oΔlu Data Science Institute, Uc San Diego, Uc San Diego.
What skills is π¨π»βπ» Advitya Gemawat known for?
π¨π»βπ» Advitya Gemawat has skills like Data Analysis, Machine Learning, Data Structures, Python, Data Mining, Data Visualization, Big Data, Matlab, Sql, R, Public Speaking, Microsoft Excel.
Who are π¨π»βπ» Advitya Gemawat's colleagues?
π¨π»βπ» Advitya Gemawat's colleagues are Bradley Wright, Emiko Shimono, Palash Agrawal, Sherin D., Pankaj Bharti, Anand Rajeswaran, Khasipa Khoirunnisa.
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records Γ $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial