I am a Machine Learning Engineer with a strong technical foundation in data science, software development, and AI-driven applications, built upon a Master’s in Computer Science from North Carolina State University. My expertise spans advanced technologies, including NLP, computer vision, and advancements like Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), Generative AI, and MLOps. I’ve delivered impactful solutions like a stable diffusion Generative AI model fine-tuned and deployed using AWS SageMaker, a GCP-hosted NLP tool with 96% accuracy, a traffic monitoring system reducing false positives by 20%, and an interactive education platform that boosted user engagement by 40%.My background encompasses end-to-end project delivery, from data analysis and model development to deployment on cloud platforms such as AWS and GCP using Docker and FastAPI. With a solid command of frameworks like TensorFlow, PyTorch, and SQL, I excel in building robust, high-performance systems that align with business goals and deliver measurable outcomes. My full-stack skills with FastAPI, ReactJS, and TypeScript enable me to develop dynamic, user-centric applications that enhance product value.I’m currently seeking Machine Learning Engineering, Data Science, and Software Development(AI/ML) roles that allow me to leverage my technical expertise and innovative approach to drive meaningful, data-powered solutions. I am eager to join forward-thinking teams that are pushing the boundaries of AI, creating scalable, impactful products that elevate user experiences and business performance.
-
Genai DeveloperWhiztek Corp | It ServicesChicago, Il, Us -
Research Engineer @ Raise LabNorth Carolina State University May 2024 - PresentRaleigh, North Carolina, UsCurrently working on Leveraging LLMs in enhancing both feature extraction from unstructured data and user interaction.- Feature Extraction: Extract insights from unstructured data such as commit messages or issues using GPT-4.- User Interaction: Enable natural language querying and responses for user-friendly interfaces using ChatGPT.- Code and Issue Labeling: Categorize and evaluate the impact of code commits and issues using CodeBERT. -
Graduate Research Assistant @ Raise LabNorth Carolina State University Jan 2023 - Apr 2024Raleigh, North Carolina, UsMachine Learning Expertise for Open-Source Computational Science (OSCS) Health Prediction:- Developed a transferable prediction model (95% accuracy) using hierarchical BIRCH clustering with Random Forest Regressor, to assess OSCS project health, analyzing 150+ GitHub projects.- Conducted statistical tests to assess the accuracy of the predictive models.- Leveraged SHAP and LIME for interpretable insights, contributing to project sustainability and stakeholder trust.- Automated data mining from diverse sources (150+ GitHub projects) for large-scale health assessment.Optimized Regression Models for OSCS Health Metrics:- Implemented HyperOpt to optimize regression models for predicting OSCS health metrics, reducing mean squared error by 40%.- Improved model performance and efficiency for more accurate and timely health assessments.- Evaluated and compared various regression techniques for optimal model selection.- Conducted extensive literature review and documentation as a part of the research.Open-Source Contribution and Research Collaboration in Computational Science:- Developed and shared open-source tools utilizing data mining and CART-based models for OSCS health prediction, contributing to the field's advancement.- Participated in an NSF-funded research project on adapting Empirical SE to Computational Science, applying expertise in hierarchical clustering and ensemble learning(NSF Award: https://www.nsf.gov/awardsearch/showAward?AWD_ID=1931425). -
Data Science InternFidelity Investments Jun 2023 - Aug 2023Boston, Ma, UsDeveloped and Deployed Generative AI Model:- Built a production-ready text-to-image model using AWS SageMaker for Fidelity's internal platform.- Leveraged Prompt Engineering and LoRA to reduce errors and accelerate fine-tuning for specific use cases (portraits, digital art, Adobe Firefly).Implemented Comprehensive Data Analysis Pipeline:- Evaluated a potential Fidelity software for automated data analysis on a massive 2.1 million-participant dataset.- Performed rigorous Exploratory Data Analysis (EDA) including data cleaning, transformation, feature engineering, and advanced data visualization. Created comprehensive dashboards using AWS SageMaker and TensorBoard to monitor metrics, detect data leakage, analyze data correlation, and present statistical plots. - These visualizations enabled deeper insights into data patterns and facilitated the presentation of evaluations and recommendations to senior management, significantly impacting decisions on software adoption. -
Machine Learning EngineerOmdena May 2021 - Jul 2021New York, Ny, UsDesigned and deployed a user-driven NLP data solution:- Reduced information gathering time by 70% compared to manual methods.- Empowered users to define specific topics, interests, and timeframes for personalized information retrieval.Developed a high-accuracy web scraping tool (96%):- Leveraged popular Python libraries (Beautiful Soup, Scrapy, Selenium) for scraping and data extraction.- Employed deep learning models (LSTM, BiLSTM, Transformers) for document summarization, sentiment analysis, and natural language understanding.- Achieved 96% accuracy in data extraction from diverse sources (webpages, images, databases, social media).Seamless deployment and user interaction:- Deployed the tool on the Google Cloud Platform (GCP) for scalability and accessibility.- Developed a user-friendly Streamlit front-end for intuitive interaction and data exploration. -
Research FellowPune Institute Of Computer Technology Jan 2021 - May 2021Pune, Maharashtra, InDeveloped ensemble-based models for plant seedling classification and weed detection:- High accuracy: Achieved 95.32% recall and 99.7% overall accuracy, outperforming existing techniques.- Novel approach: Combined shallow CNN for feature extraction with K-Nearest Neighbors for classification, refined through cross-validation.- Unique data augmentation: Implemented a technique using bilateral blurring, LAB colorspace conversion, segmentation, and image masking for improved performance.- Presented at conference: Work showcased at ICACIE'21, demonstrating its significance in the field. -
Software Development InternCarikture Jul 2020 - Sep 2020Gurgaon, Haryana, InDeveloped and launched a data-driven educational platform for 5,000+ users:Technology stack: Electron.js, React.js (front-end), Node.js(backend)Impact:- Increased user engagement by 40% through personalized learning paths and interactive data dashboards (Tableau).- Enhanced personalization: Utilized machine learning models (recommendation systems) for tailored learning paths.- Improved user experience: Conducted A/B testing and optimized UI elements (ReactJS, CSS, Bootstrap).Key achievements:- Designed engaging data visualizations to track user progress and platform usage.- Integrated machine learning for personalized learning recommendations.- Optimized user experience through A/B testing and UI/UX improvements.- Increased user engagement and platform usage with data-driven insights.
Kunal Shah Education Details
-
North Carolina State UniversityComputer Science -
Pune Institute Of Computer TechnologyComputer Science Engineering -
Sanjay Ghodawat InstitutePcm
Frequently Asked Questions about Kunal Shah
What company does Kunal Shah work for?
Kunal Shah works for Whiztek Corp | It Services
What is Kunal Shah's role at the current company?
Kunal Shah's current role is GenAI Developer.
What schools did Kunal Shah attend?
Kunal Shah attended North Carolina State University, Pune Institute Of Computer Technology, Sanjay Ghodawat Institute.
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial