I am a data scientist/statistician, and my research projects cover causal inference, clustering, spatial statistics, and classification, with applications to areas such as clinical trials, experimental design, time series modeling, and forensics.I have research and application experience in statistics and ML, in areas like public health, biotech, agriculture, consulting, and utilities.I am open to both full-time and contract statistician/data scientist positions.I also have Java development experience not in my profile, but I am open to Java developer positions or statistics + Java positions.
-
Statistician InternThe National Institutes Of Health Jun 2022 - Aug 2022Bethesda, Md, UsWrite an R package to analyze, visualize, and interpret Nanostring data (including calculating DEG, conducting GSVA/GSEA, and performing GO enrichment analysis/KEGG pathway analysis)Write a UI in R Shiny to more easily access/use the R package -
Statistician InternThe National Institutes Of Health Sep 2020 - Nov 2020Bethesda, Md, UsExtracted count data from several types of sequencing data using the TopHat-Stringtie-Ballgown pipeline and the GDC pipeline for genomic data analysis (e.g. RNA-seq, DNA-seq)Wrote statistical programs in R to help scientists identify about 10+ relevant genes from pool of about 1000+ genes to liver cancer treatment effectsWrote statistical analyses in R to identify about 100 genes from a pool of 3000 genes that influence lung cancer treatment effectsAnalyzed and interpreted predictive models for principal investigators to understand experiment results. -
Graduate StudentNorth Carolina State University Sep 2015 - Jan 2020Raleigh, North Carolina, UsResearch Projects:- Self-driving RC Cars, Jan 2019-May 2019Trained self-driving remote control cars using RL methods including SAC, DDPG, and PPO using Python (pytorch, simulation in Unity).-Mixed Model Thompson Sampling for Multi-Armed Bandit Problems, Mar 2018-May 2018Adapt mixed model Thompson sampling methods to contextual bandit problems with spillover rewards and inter-arm interactions; performance then compared to UCB methods, giving around a 5% improvement in regret.- Human Trafficking Detection with Recurrent Neural Networks, Jan 2018-May 2018Augmented a text-based NN model for detecting human traffickers with a CNN model in Python (pytorch) to detect potential victims of human trafficking (trained on roughly 2k images for roughly 1k ads).- Zillow Price Error Prediction, Aug 2017 - Oct 2017Predicted real estate price errors using a deep neural network regression model in Python (keras with tensorflow backend) based on various home data (including tax data and physical details)- Modeled disease progression in tobacco plants using logistic regression in R.Compared the model for spread of disease in South Carolina to one developed for farms in North Carolina, and suggested potential causes of differences -
Research AssistantNorth Carolina State University Sep 2017 - Dec 2017Raleigh, North Carolina, UsForensic Geolocation using Deep Learning ClassificationBuilt a predictive spatial classification deep neural network model to trace the origins of dust samples based on genetic signatures (roughly 10k signatures for about 200 locations). Proposed model achieved an accuracy of roughly 60%. -
Teaching AssistantNorth Carolina State University Sep 2015 - Sep 2015Raleigh, North Carolina, UsFall 2015 semester: Statistics by ExampleSpring 2016 semester: Statistics by ExampleFall 2016 semester: Fundamentals of Linear Models and RegressionSpring 2017 semester: Fundamentals of Statistical Inference IIFall 2018-Fall 2019: Introduction to SAS ProgrammingSpring 2020 semester: Introductory Statistics for Engineers• Tutored students in basic statistical concepts and diagnostic methods for linear regression.• Assisted professors with classroom instruction and record-keeping.• Met with students during weekly office hours. -
Data Science InternFirst Analytics Jun 2019 - Aug 2019Cary, North Carolina, UsDeveloped time series model in Python to forecast monthly demand for inventory control of 100+ food products at 1000+ locations. Created new predictive models of sales volume from sales history in Python and SQL to improve performance from existing model by roughly 2% -
Data Science InternXylem Inc. Jun 2018 - Aug 2018Washington, District Of Columbia, UsDeveloped and tested fundamental data analysis framework (includes linear models, activation functions, and time series models) in Python for the data science group to solve data problems.Modeled and predicted daily water usage data for commercial, residential and industrial sites in a city using ARIMA time series models in Python as test case for new data analysis framework. -
Application DeveloperBraindx, Llc Dec 2014 - Apr 2015Suwanee,, Ga, UsAdded color palette for visualizing brain measurements and loading bar features to brain imaging software using Java and MATLAB -
Undergraduate Teaching Assistant Of Acm95 - Introductory Methods Of Applied MathematicsCalifornia Institute Of Technology Oct 2013 - Mar 2014• Tutored students in complex analysis and methods of solving differential equations (including Laplace transforms, Fourier transforms, and Green’s functions).• Assisted professors with classroom instruction and record-keeping.• Prepared presentations for lectures and deliver lectures.• Met with students during weekly office hours.• Created and wrote materials such as visual aids, supplementary notes, and sample problems.
-
Research AssistantCalifornia Institute Of Technology Jun 2013 - Aug 2013• Applied Bayesian statistics to social science studies.• Developed Python programs for analyzing decision and reaction time data for social science studies using Hierarchical Bayesian estimation of the Drift-Diffusion Model.• Presented research results to Caltech students and professors.
-
Undergraduate Teaching Assistant Of Acm95 - Introductory Methods Of Applied MathematicsCalifornia Institute Of Technology Jan 2013 - Mar 2013• Tutored students in methods of solving differential equations (including Laplace transforms, Fourier transforms, and Green’s functions).• Assisted professors with classroom instruction and record-keeping.• Prepared presentations for lectures and deliver lectures.• Met with students during weekly office hours.• Created and wrote materials such as visual aids, supplementary notes, and sample problems.
-
Research AssistantQueen'S University Jul 2012 - Aug 2012Kingston, On, Ca• Used MATLAB’s Stateflow and Simulink modules to write programs for experimental trials for measuring an individual’s hand-eye coordination using specific equipment in the lab (KINARM)• Reported results to professor.
Benjamin Hu Education Details
-
CaltechComputational And Applied Mathematics -
North Carolina State UniversityStatistics
Frequently Asked Questions about Benjamin Hu
What is Benjamin Hu's role at the current company?
Benjamin Hu's current role is Actively looking for Data Scientist/Statistician position.
What schools did Benjamin Hu attend?
Benjamin Hu attended Caltech, North Carolina State University.
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial