NLP Engineer working on the development of applications using Large Language Models (LLMs).Some interesting projects I have worked on:๐๐๐๐๐๐ญ ๐จ๐ ๐
๐๐ฐ-๐๐ก๐จ๐ญ ๐๐ซ๐จ๐ฆ๐ฉ๐ญ๐ข๐ง๐ ๐๐ง๐ ๐๐ฏ๐๐ฅ๐ฎ๐๐ญ๐ข๐จ๐ง ๐จ๐ ๐๐๐๐ฌ ๐ฎ๐ฌ๐ข๐ง๐ ๐ญ๐ก๐ ๐๐ฅ๐๐ฎ๐ญ๐ก๐๐ซ๐๐ ๐๐๐ ๐๐๐ซ๐ง๐๐ฌ๐ฌ:โข Evaluated the generative performance of three language models - OPT, GPTNeo, and Dolly - across benchmark datasets (AI2โs Reasoning Challenge, Adversarial Natural Language Inference, and Winograd Schema Challenge) using various prompt settings: Zero-Shot, One-Shot, Three-Shot, and Five-Shot prompts.โข Observed that model performance on all the benchmarks linearly scales with an increase in model size and there is a significant increase in performance as the number of few-shot (in-context) examples increases in the prompt.๐๐๐๐๐๐ญ ๐จ๐ ๐๐ฉ๐ญ๐ข๐ฆ๐ข๐ณ๐๐ซ ๐๐๐ฅ๐๐๐ญ๐ข๐จ๐ง ๐๐ง๐ ๐๐ฒ๐ฉ๐๐ซ๐ฉ๐๐ซ๐๐ฆ๐๐ญ๐๐ซ ๐๐ฎ๐ง๐ข๐ง๐ ๐จ๐ง ๐๐ซ๐๐ข๐ง๐ข๐ง๐ ๐๐๐๐ข๐๐ข๐๐ง๐๐ฒ ๐๐ง๐ ๐๐๐ ๐๐๐ซ๐๐จ๐ซ๐ฆ๐๐ง๐๐:โข Investigated the impact of optimizer selection and associated hyperparameters on model performance during training across diverse tasks.โข Evaluated the performance of five different optimizers (AdamW, RMSProp, NAG, SGD with Momentum, and SGD) on various natural language processing tasks such as Sentiment Analysis, Question Answering, and Text Summarization. Analyzed the convergence of the best-performing models on each dataset.โข Fine-tuned DistilBERT, BERT, and FinBERT models for Sentiment Analysis on the StockTwits dataset, while DistilBERT, BERT, RoBERTa were fine-tuned for Question Answering on the CoQA dataset. For Text Summarization, BART, DistillBART, and T5 models were fine-tuned on the BillSum dataset.โข Empirical observations highlighted that more general optimizers like RMSProp and AdamW consistently performed as well as, if not better than, specialized optimizers like SGD, Nesterov, or Momentum, given appropriately selected hyperparameters.Active contributor to multiple popular open-source projects.๐๐ข๐ญ๐๐ฎ๐: https://github.com/awinml๐๐๐๐ก๐ง๐ข๐๐๐ฅ ๐ฌ๐ค๐ข๐ฅ๐ฅ๐ฌ:โข Languages: Pythonโข Machine Learning: Numpy, Pandas, Scikit-learn, Keras, PyTorchโข NLP: Hugging Face Transformers, HaystackOpen to interesting conversations and collaboration, happy to connect.๐๐ฆ๐๐ข๐ฅ: ashwinxmathur@gmail.com