Data Scientist Ii
Current● Saved $1.5 Million annually in vendor costs by forecasting traffic with xgboost (Python, Airflow)● Reduced financial planning time by 90% and improved sales forecast accuracy by 2% by creating automated time series forecasts for using pytorch and tensorflow (Python, Airflow, SQL, AWS) ● Won 4 awards for building source-of-truth reporting: data mart with 12 tables (130K queries/year) and 6 dashboards (58K views/year), including the 2 most used finance dashboards (SQL, Tableau) ● Increased annual sales 5% by building a structural equation model (SEM) to optimize marketing and inventory for each stage of digital and store conversion funnel (R, SQL)● Identified and led cross-functional team to add location data to traffic, identity resolution, and transaction pipelines, increasing geographic resolution up to 8M times, saving $100K annually in vendor costs; these data support models that contribute $75M in annual sales● Identified gaps in marketing and event data and built tables to fill the gaps; these data support models that contribute $64M in annual sales (SQL)● Built trust by managing data mart automation/security and API data extraction across 4 database migrations with 5 vendors and 4 teams (finance, engineering, product, analytics)● Optimized queries and pipelines to reduce run time and cost by up to 94% (SQL, Python, Airflow)● Reduced data pipeline failure rate by 40% by setting up data quality alerts (JSON, Pagerduty, SQL)● Audited board and quarterly earnings reports to ensure accuracy and consistent definitions (SQL)● Developed onboarding documentation used 800 times and SQL training used by 100-person team