Yuan Huang

Yuan Huang Email and Phone Number

Machine Learning Architect | Analytics Lead | Software Lead | Principal Scientist, Data Engineer | Data Scientist @ Sion Power Corporation
tucson, arizona, united states
Yuan Huang's Location
Andover, Massachusetts, United States, United States
About Yuan Huang

10+ years of experience in leading and developing data pipeline, integrated data and software systems, and data science solutions. Solid hands-on experience in statistical analysis, machine learning and deep learning algorithms combined with strong programming, data engineering, Devops and MLOps skills as a certified AWS Machine Learning Specialty, Airflow DAG developer, Kubernetes Administrator and Jenkins developer. Excellent records of stakeholder management in building strong relationships with senior management and cross-functional teams and translating business processes to data science projects

Yuan Huang's Current Company Details
Sion Power Corporation

Sion Power Corporation

View
Machine Learning Architect | Analytics Lead | Software Lead | Principal Scientist, Data Engineer | Data Scientist
tucson, arizona, united states
Website:
sionpower.com
Employees:
59
Yuan Huang Work Experience Details
  • Sion Power Corporation
    Machine Learning Architect
    Sion Power Corporation Apr 2024 - Present
    Tucson, Arizona, United States
  • Zoetis
    Analytics Lead | Business Insights & Analytics (Bia), Zoetis Technology & Data
    Zoetis May 2022 - Dec 2023
    Boston, Massachusetts, United States
    ● Led R&D machine learning, AI, and data analytics projects with a team of 7 data scientists and software developers to deliver high quality data science products and solutions, using agile project management methodology● Translated business and scientific questions to machine learning, AI and data analytics projects, evaluated the feasibility and KPIs/metrics, and designed technical roadmaps of the projects by utilizing knowledge and experiences in NLP/LLM, time series, computer vision, supervised and unsupervised learning, and deep learning, as well as data engineering, software engineering, DevOps and MLOps.● Orchestrated the entire product lifecycle from ideation, feasibility exploration, Proof Of Concept (POC), launch, release, and adoption with technical standards in Python coding, unit tests, Github version control, and CI/CD for 8 projects within a span of 1.5 yearsHighlighted achievements:○ Developed production-level computer vision/segmentation analysis deep learning models (Pytorch/Pytorch-lightning, OpenCV and UNet for data processing and modeling, OOD for codebase design, Numpy, Pytest, github actions workflow for CI/CD, ML model package development, and FastAPI) to identify lesion regions on animal tissue images, reducing over 200 FTE hours annually, and resulting in more consistent evaluation of vaccine efficiency○ Completed and transferred a Python Scikit-learn predictive pipeline (starting from data cleaning/data dictionary definition to model deployment) for dairy cow diseases using Scikit-learn (Pandas, Numpy, Seanborn, Matplotlib, Random Forest, XGBoost, and Pipeline) to the Precision Animal Health (PAH) department○ Launched patient query, analysis, and visualization R Shiny web app/dashboard on Posit using Pyspark for big data query that accelerated the clinical recruitment process by at least one month
  • Vertex Pharmaceuticals
    Research Fellow, Comp Chem Software Lead | Modeling And Informatics
    Vertex Pharmaceuticals Jul 2021 - May 2022
    Boston, Massachusetts, United States
    ● Designed and developed MolProperty serverless AWS cloud computing system (AWS CDK, API Gateway, Lambda, Data API, and Aurora, Docker, OpenEye, Jchem, Pytest) for high performance molecular property calculation, storage, and query● Developed Python client package for computational chemists to query MolProperty computing system (Asyncio, Docopt, Pandas, Boto3 and Python package development)● Led cross-functional AWS modeling service project (including an external MLOps team) that includes CI/CD and model deployment for small molecule temporal predictive model training and inference pipelines. (JIRA, MLflow, Pytest, Docker, Python package development, Pandas, Scikit-learn, Random Forest, AWS lambda, Aurora, API gateway, CDK, Codebuild, Codepipeline, ECR, ECS, Sagemaker SDK, Cloudwatch)● Led DCS package Proof-of-Concept project by collaborating with the Software Engineering team● Hosted code reviews on design pattern, Python technology, and best coding practices
  • Bristol Myers Squibb
    Principal Scientist/Senior Manager, Data Engineering
    Bristol Myers Squibb Jan 2019 - Jul 2021
    Greater Boston
    ● Led the development of an antisense oligonucleotides bioinformatics calculation tool with computational biology and medicinal chemistry groups, reducing overnight processing to just 15 minutes utilizing R and bioconductor packages● Created and deployed data ingestion tools on AWS to query, download, transform, and store public RNA-Seq data from NCBI GEO data repository (Docker, Docker cli, shell script, Pandas, AWS EC2, ECR, ECS and S3).● Built and launched a data lake system as the central data catalog that integrated and managed a variety of data repositories (DynamoDB, Aurora MySQL, Redshift, S3 for Athena, data modeling for sql/non-sql and data warehouse). Developed the ETL data pipeline (Apache Airflow, AWS Step Functions, Lambda, API Gateway, GLUE, Sagemaker/PySpark, SAM, and Data Migration Service)● Designed and implemented front-end and back-end architectures of the Target Profiler web application that integrates genomic and omics data for data utilization and visualization to harmonize genomic data for 6 immunology diseases (AWS API gateway, Lambda, Athena, Aurora, data API, JavaScript/D3/crossfilter, HTML/CSS/bootstrap/Sass) ● Managed IBD clinical data from Crohn’s and Colitis Foundation and provided data mining support of EMR data● Led the development of a web application for fast data download of OpenTargets data by customized Presto Queries (AWS API gateway, Lambda, Athena, and R Shiny app). Initiated and led the POC test with Varada to further optimize query performance on big data
  • Tivo
    Staff Engineer, Data
    Tivo Aug 2018 - Jan 2019
    Greater Boston Area
    Data Science Group, Analytics and Advertising Engineering, Data Analytics R&D, Tivo• Performed large volume queries, data processing and wrangling for viewership data on AWS s3 using big data techniques (Athena, Presto query, Pymysql, Quoble, Pyspark and Pandas). (see my github for code demo)• Developed python scripts and Jupyter notebooks for time based and program based viewership data quality checking using average audience (AA), rating and data visualization (presto query, pandas and matplotlib). The scripts automatically transfer and store the generated plots and dataframes to AWS s3
  • Andover Innovative Medicines Institute (Aim), Eisai
    Data Scientist/Scientist | Bioinformatics
    Andover Innovative Medicines Institute (Aim), Eisai Apr 2012 - Aug 2018
    Andover, Ma 01810
    Scientist, bioinformatics/Data Scientist | Human Biology and Data Science Engine • Performed regression and classification analysis (linear regression, random forest and XGBoost), clustering analysis (hierarchical clustering and principal component analysis), association study, and hypothesis tests for RNA sequencing data using R (ggplot2, dplyr) and Python scientific packages (pandas, scipy.stats, statsmodels, seaborn and matplotlib)• Performed cleaning, alignment, and transformation of RNA-Seq data on DNAnexus cloud computing platform (Linux Shell, DNAnexus toolkit for cloud computing).• Implemented interactive R notebook (tidyr, ggplot2, dplyr, plotly, DT) and Shiny apps for data summary and visualization• Modified and maintained ETL data pipelines and database for high throughput DMPK data loading and storage (pipeline pilot, Oracle database, toad and SQL).• Created and built a Quality by Design (QbD) framework and software for fast liquid chromatography method development based on Design Of Experiment (DOE) and computation simulation, and published the work as a journal paper• Released VBA applications to DMPK department for the automatic processing and analysis of high-throughput microsomal stability screening assay that reduced processing time by more than 10 times• Created and executed scripts (pipeline pilot and SQL) for extracting, transferring, updating and integrating pharmacokinetics data from Japan database (PostgreSQL) to U.S. site (D360 and Oracle), which reduced data delay time from months to 24 h .• Designed and developed a cost calculation system to calculate and visualize the costs of synthetic routes (Java, MySQL, and Javascript) for the process chemistry group (Bio-IT World Conference 2015)• Served as a core technical team member of Allotrope Foundation; led the Proof-of-Concept team at Eisai for the installation and testing of Allotrope Proof-of-Concept software applications and Allotrope Data Format (ADF) converters
  • University Of Minnesota
    Research Associate
    University Of Minnesota Nov 2008 - Mar 2012
    Greater Minneapolis-St. Paul Area
    • Optimized 2D-LC instrument operational conditions using computer simulations including Monte Carlo simulation• Developed a MATLAB program for processing and visualizing on-line 2D-LC experimental data

Yuan Huang Education Details

Frequently Asked Questions about Yuan Huang

What company does Yuan Huang work for?

Yuan Huang works for Sion Power Corporation

What is Yuan Huang's role at the current company?

Yuan Huang's current role is Machine Learning Architect | Analytics Lead | Software Lead | Principal Scientist, Data Engineer | Data Scientist.

What schools did Yuan Huang attend?

Yuan Huang attended University Of Arizona, The University Of Texas At El Paso, Chinese Academy Of Sciences, Nanjing University Of Technology.

Who are Yuan Huang's colleagues?

Yuan Huang's colleagues are Lawrence Weinstein, Hector Mendoza, Leonor Vargas, Max Jimenez, Fairy Girl, Apolo J.r., Greg Lowe.

Not the Yuan Huang you were looking for?

  • Yuan Huang

    Engagement Manager At Mckinsey & Company
    New York, Ny
    4
    phila.gov, mckinsey.com, mckinsey.com, generationcitizen.org

    4 +121556XXXXX

  • Yuan H.

    Sr Scientist At Keystone Strategy - Core Ai | Ex-Amazon | Ph.D. In Economics
    Greater Seattle Area
  • Yuan Huang

    New York, Ny
    3
    fb.com, facebook.com, gmail.com

    1 (855) 6XXXXXXX

  • Yuan Huang

    New York City Metropolitan Area
    4
    gmail.com, goldmansachs.com, gs.com, blackstone.com

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.