Data Scientist
Current- Built and implemented machine translation Spanish-English system, utilized seq-to-seq model and attention techniques to improve BLUE score, researched and implemented pointer network to generate summarization based on.
- Built and deployed image facial recognition pipeline on Docker, implemented Fast-RCNN and MTCNN for efficient object detection by utilizing RPN, researched and built FaceNet on facial detection improving triplet loss.
- Devised methodology for calculating the website traffic that is attributable to credit card advertising, implemented contextual bandit UCB algorithm to optimize ads performance
- Developed query understanding/rewriting to predict user intent, increased user click through rate by utilizing WordtoVec(CBOW) and improved ads ranking by matching user preference to financial product.
- Defined metrics and using geo experiments to estimating causal effects of COVID-19 product features changes on user experience, analyzed power and modeled network effect to decide unit of randomization, designed.
- Built data pipelines that pre-aggregate users across experiments/treatment groups & pre-aggregate metric data