Joseph M.

Joseph M. Email and Phone Number

Senior Data Engineer @ Netflix
New York, NY, US
Joseph M.'s Location
New York, New York, United States, United States
Joseph M.'s Contact Details

Joseph M. personal email

n/a

Joseph M. phone numbers

About Joseph M.

Over the last decade, I've built highly scalable distributed data platforms and helped companies scale to processing multiple exabytes of data. My mission is to bring software practices followed by top tech companies to data engineering and help data engineers level up.I help data engineers land high paying tech jobs and significantly up skill themselves.If you want to learn more, you can join my newsletter:→ GO HERE: https://www.startdataengineering.com/news-letter/If you want to join my free Data Engineering 101 Program:→ GO HERE: https://www.startdataengineering.com/email-course/Twitter: @startdataengYouTube: @startdataengineering

Joseph M.'s Current Company Details
Netflix

Netflix

View
Senior Data Engineer
New York, NY, US
Joseph M. Work Experience Details
  • Netflix
    Senior Data Engineer
    Netflix
    New York, Ny, Us
  • Startdataengineering
    Data Engineer & Founder
    Startdataengineering Mar 2020 - Present
  • Linkedin
    Senior Data Engineer
    Linkedin May 2022 - Mar 2024
    Sunnyvale, Ca, Us
    * Worked on migrating data pipelines from legacy in-house system to Azure Databricks, reducing data latency from 24h to 6h.* Designed and build data quality systems to proactively catch issues quickly, reducing user-reported issues by 60%.
  • Narrativ
    Senior Software Engineer, Data
    Narrativ Jul 2019 - May 2022
    New York, Ny, Us
    * Designed and built source of truth fact and dimension tables and ELT infrastructure with Fivetran and DBT. Established best practices and conventions, enabling other teams to build their data marts. This enabled data freshness monitoring, common source of truth tables, CI/CD for data pipelines, and better data quality tests.* Designed and built data validation system using Great Expectations and k8s tasks to enable API-driven validation of data providing results via UI. This lead to a reduction in engineering hours spent from about 3 days per sprint to less than 15min work for the end user(non-engineer).* Updated product inventory ingestion data pipeline to use Snowflake instead of Postgres to get the new/updated product data. This lead to a reduction in data processing time from about 2hours to less than 10min.* Designed and built a data pipeline factory in Airflow, with config metadata that can be modified via REST APIs. This reduced engineering hours spent from 2 days per sprint to less than 10 min of work for the end user(non-engineer).* Worked on real-time data processing and enrichment of clickstream events in Apache Storm with AWS Dynamo DB to prevent over or under-spending on the allocated budget. This enabled other systems to have accurate up to date spend amounts.* Designed and built a cache data structure on Redis that enables fast lookup of bid metrics based on attributes of a click. The cache was refreshed via Airflow. This feature led to an increase in client spend by about 5mil in the first 2 months.* Designed and built a data pipeline to consolidate similar products without using unique ids, using Word2Vec, Spark, Snowflake, and Airflow. This enabled bidding on additional products per auction leading to an increase in spending of 9%.
  • Annalect
    Senior Data Engineer
    Annalect Feb 2018 - Jun 2019
    New York, Ny, Us
    * Designed and built data models for different types of TB scale data such as geolocation, clickstream, purchase, viewership data. * Designed and built data pipelines using Spark. This reduced data processing time from about 7h to less than 1h.* Designed and built APIs to enable application users to send data to multiple partners. This provided users with one central control platform instead of manually uploading data into the individual partner portal.* Worked on migrating data from AWS Redshift to properly partitioned dataset on S3. This enabled the use of AWS Redshift Spectrum reducing warehouse cost by about 80%.* Set up Apache Airflow for data pipeline orchestration and scheduling, leading to a reduction in data freshness and correctness issues by about 75%.* Worked on DSL to allow application users to join datasets visually. This leads to end-users being able to perform complex joins and aggregates.
  • Hudson Data
    Data Scientist
    Hudson Data Apr 2016 - Feb 2018
    • Led project to automate ETL pipelines for high availability of data and significantly reduced wait time for analyses.• Developed graph analysis algorithm to capture fraud signals resulting in savings of 5M dollars.• Designed and deployed an ML pipeline in Python to help clients take immediate action on possible fraudulent policies in the vehicle insurance industry.• Reduced the execution time of a query to <1s from 45s by de normalizing the data.
  • Indus Valley Partners
    Associate Software Developer
    Indus Valley Partners Feb 2015 - Mar 2016
    New York, Ny, Us
    • Developing software for hedge fund clients focusing on commercial real estate sector, using C#, ASP.NET, Tortoise svn, MS SQL.* Developing custom applications using AngularJS
  • New York City Transit
    College Aide
    New York City Transit Jun 2014 - Dec 2014
    New York, Ny, Us
    • Built a web application for MTA employees using Java, javascript and jetty servlet engine.• The web application incorporated multiple python scripts, oracle stored procedures and C++ executable files.• The application was built on hibernate framework.
  • Polaris Financial Technology Limited
    Consultant
    Polaris Financial Technology Limited Oct 2012 - Jun 2013
    Chennai, In
    • Worked on banking retail sector website using technologies such as Java, EJB, JSP and javascript.• Developed an automated error log system for the entire project using Java.log4j

Joseph M. Skills

Java Matlab C++ Sql C Hadoop Javascript Python Mapreduce Machine Learning Hive Algorithms Html Big Data Github Eclipse Ajax Asp.net Databases Software Development R Css Labview Apache Pig Angularjs C# Data Structures .net Spark Mssql

Joseph M. Education Details

  • Columbia University
    Columbia University
    Electrical Engineering
  • Madras Institute Of Technology Campus
    Madras Institute Of Technology Campus
    Electronics And Communications Engineering
  • St.Thomas Mhss
    St.Thomas Mhss

Frequently Asked Questions about Joseph M.

What company does Joseph M. work for?

Joseph M. works for Netflix

What is Joseph M.'s role at the current company?

Joseph M.'s current role is Senior Data Engineer.

What is Joseph M.'s email address?

Joseph M.'s email address is jo****@****llc.com

What is Joseph M.'s direct phone number?

Joseph M.'s direct phone number is +121259*****

What schools did Joseph M. attend?

Joseph M. attended Columbia University, Madras Institute Of Technology Campus, St.thomas Mhss.

What are some of Joseph M.'s interests?

Joseph M. has interest in Robo Investing, Process Driven Business, Data Science, Algorithms, Exciting Technologies, Health, Net, Asp, Signal And Information Processing, Health And Fitness.

What skills is Joseph M. known for?

Joseph M. has skills like Java, Matlab, C++, Sql, C, Hadoop, Javascript, Python, Mapreduce, Machine Learning, Hive, Algorithms.

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.