PhD candidate studying computational biology & cheminformatics at the department of chemical engineering at Northwestern University. I am also a part of the new pathway development group at the Joint BioEnergy Institute within Lawrence Berkeley National Laboratory. My thesis focuses on the development of retrobiosynthesis software & machine learning models to help synthetic biologists discover novel metabolic pathways to key small-molecules for biomanufacturing. Prior to starting my PhD, I completed my undergraduate degree in chemical engineering at UC Berkeley. While at Berkeley, I took on additional courses in data science and machine learning, which eventually catalyzed my jump to computational biology in graduate school. I now have 4+ years of experience performing statistical analyses and working with popular data science libraries in python. On the classical ML side, I have trained several gradient-boosted tree models on tabular data using XGBoost while on the deep learning side, I have experience training message-passing graph neural networks using PyTorch. All my cheminformatics and molecular modelling projects have been accompanied by extensive use of open-source cheminformatics toolkits, such as RDKit, OpenEye, and Open Babel.• Programming languages: Python, SQL, bash• Technologies: Docker, Django, Celery, Redis, SQLite, PostgreSQL, MongoDB, Streamlit, parallel computing, distributed computing, high performance computing, git, github, gitlab• Packages I am proficient in: RDKit, Pandas, Dask, Numpy, Numba, Multiprocessing, PyTorch, Scikit-learn, XGBoost, eQuilibrator, BioPython, COBRApy, NetworkX, Pymoo, Seaborn, Matplotlib, plotly, bayesian-optimization, chemprop, datamol