Swapnil Sinha

Swapnil Sinha Email and Phone Number

Full-stack Developer @ Struction
San Francisco, CA, US
Swapnil Sinha's Location
San Diego, California, United States, United States
About Swapnil Sinha

Machine Learning Engineer and Full-Stack Developer specializing in LLM optimization and scalable cloud systems. Currently pursuing an MS in Data Science at UC San Diego and innovating at Human AI Labs, where I'm working on model quantization for edge AI deployment.I transform complex technical challenges into practical solutions:- Optimized LLaMA3 models for edge devices, reducing model size by 70% while preserving accuracy- Architected microservices handling $500K+ in annual e-commerce revenue- Built ML systems from recommendation engines to computer vision solutions- Deployed large-scale cloud infrastructure on AWS supporting 5000+ usersMy sweet spot? Bridging the gap between cutting-edge ML research and production-ready software. Whether it's fine-tuning LLMs, building RAG systems, or developing full-stack applications, I focus on delivering solutions that drive real business impact.Currently exploring opportunities in AI/ML engineering and distributed systems. Let's connect if you're working on something interesting at the intersection of ML and scalable software!

Swapnil Sinha's Current Company Details
Struction

Struction

View
Full-stack Developer
San Francisco, CA, US
Swapnil Sinha Work Experience Details
  • Struction
    Full-Stack Developer
    Struction
    San Francisco, Ca, Us
  • Personal Ai
    Machine Learning Engineer
    Personal Ai Sep 2024 - Present
    San Diego, California, United States
    • Developing a real-time telephony system with personal AI using FastAPI, integrating NVIDIA Riva for speech-to-text and text-to-speech conversion.
  • Personal Ai
    Machine Learning Intern
    Personal Ai Aug 2024 - Present
    San Diego, California, United States
    Led the quantization of LLaMA3 and Personal Language Conversational AI Models using Qualcomm AI Stack and AIMET, preserving model accuracy for On-Device AI inference.Created an integration C++ API for the loading and execution of quantized LLM model binaries, for inference on Windows and Android edge devices.Integrated with FastAPI by creating a Python wrapper, enabling seamless deployment of the LLM as an application on the Windows platform.
  • Uc San Diego
    Student Software Engineer
    Uc San Diego Jan 2024 - Present
    San Diego, California, United States
    Data Science/Machine Learning Platform (DSMLP)• Update MTL-Attendance for student attendance tracking at UCSD, utilizing Python, AWS Lambda, S3, and Terraform.• Redesigned the Cluster and Pods Status dashboard using React.js, enhancing real-time monitoring of GPU and CPU resources, resulting in a 25% reduction in troubleshooting time.• Developed Airflow DAGs to fetch enrollment data, and manage user drops and adds, enhancing class roster management for over 150 courses.• Enhanced Canvas middleware integration with course management systems by implementing CI/CD regression testing pipelines, resulting in 3% faster enrollment processing and improved deployment stability.• Worked on spinning up Jupyter Notebook environment with GPU support on AWS EKS containerizing JupyterHub instance using Docker and Kubernetes, enabling the on-demand creation of Jupyter instances.
  • San Diego Supercomputer Center
    Machine Learning Researcher
    San Diego Supercomputer Center Jun 2024 - Aug 2024
    San Diego, California, United States
    • Released an LLM-based Retrieval-Augmented Generation (RAG) medical chatbot, leveraging Langchain and LlamaIndex, enabling faster and more efficient access to data and information from medical publications for researchers.• Fine-tuned the Llama3-8B model using LoRA resulting in a 15% increase in response accuracy improving the chatbot’s ability to provide precise medical information.
  • Qburst
    Data Engineer / Software Engineer
    Qburst Aug 2021 - Aug 2023
    Trivandrum, Kerala, India
    • Collaborated with the E-commerce service team to develop backend microservices using Python, Django, and Flask, directly contributing to an annual revenue increase of $500k.• Implemented a Customer onboarding microservice that led to a 20% increase in user registrations in Q1 2023. Enhanced user engagement through personalized onboarding experiences.• Developed an inventory management microservice incorporating real-time updates via Kafka, Lambda, and EventBridge, reducing inventory sync lag by 30% and minimizing stock discrepancies.• Created a delivery tracking microservice with Google Maps integration, which decreased delivery time delays by 8%.• Wrote SQL queries in PostgreSQL for supporting business operations including inventory management, and sales analytics• Built a Backend For Frontend (BFF) service using Node.js, which reduced API response times by 20% increasing the scalability of front-end applications.• Deployed LSTM models using PyTorch for future sales forecasting, achieving an 85% accuracy rate to generate actionable insights, enhancing the reliability of sales predictions, and improving decision-making.• Revamped the software development life cycle by implementing Agile methodologies; accelerated product release cycles by 40% and improved team collaboration.Leveraged collaborative filtering and content-based recommendation systems to personalize restaurant and productrecommendations, resulting in a 25% increase in order frequency and customer engagement.• Developed customer churn prediction models using historical transaction data and behavioral patterns, enabling proactiveretention strategies that reduced churn by 20%.• Implemented A/B testing frameworks to evaluate recommendation system performance, resulting in a 15% improvement in model accuracy and business metrics
  • Numeregion
    Machine Learning Engineer
    Numeregion Aug 2020 - Aug 2021
    Pune, Maharashtra, India
    • Co-authored published two papers on the topics• A Neural Network Model to Predict the Radiation Resistance of Dipole Antenna. (https://ieeexplore.ieee.org/document/9847856)• A Neural Network Model for Effective Dielectric Constant Prediction of a Two-Layered Microstrip TransmissionLine (https://ieeexplore.ieee.org/document/9848322)• Developed Deep Learning solutions for antenna and transmission line design challenges.• Calculated the ideal values for the parameters using Python, TensorFlow, PyTorch, Keras, and TensorFlow.js
  • Isro - Indian Space Research Organization
    Research Intern
    Isro - Indian Space Research Organization Oct 2020 - May 2021
    Worked under the project, Melioration of IEEE802.15.4 wireless communication standard under avionics department in VSSC

Swapnil Sinha Education Details

Frequently Asked Questions about Swapnil Sinha

What company does Swapnil Sinha work for?

Swapnil Sinha works for Struction

What is Swapnil Sinha's role at the current company?

Swapnil Sinha's current role is Full-stack Developer.

What schools did Swapnil Sinha attend?

Swapnil Sinha attended Uc San Diego, Apj Abdul Kalam Technological University.

Not the Swapnil Sinha you were looking for?

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.