I use data to derive insights that can improve the quality of life.By training, I am a statistician specializing in high dimensional data (i.e. number of parameters >> number of subjects) from biology, bioinformatics, epidemiology and to a more limited experience with clinical trials and finance. My software of choice is R (a free and powerful open source software), Unix tools (bash, awk, sed etc) and I am experienced with machine learning algorithms. I am learning python and deep learning techniques.Some of my areas of expertise:* Data cleaning, management and transformation* Exploratory data analysis of very large datasets* R programming* Clustering or segmentation* Linear reression (e.g. correlations, GLMs, ANOVA)* Hypothesis or A/B testing (e.g. t-test, proportion test, Chi-square, p-value, confidence intervals, etc)* Meta-analyses techniques (e.g. inverse-variance techniques, test of heterogeneity)* Machine learning techniques (e.g. support vectors machines, linear discriminant analysis, knn, decision trees etc)* Visualizations (e.g. boxplots, heatmaps)
Listed skills include Bioinformatics, R, Genetics, Genomics, and 20 others.