specializing in ETL pipeline for big data project and Data Analysis. Experienced with all stages of the development cycle for complex data loads and predictive analysis projects. I took several Data Science focused courses at Applied AI course: Python programming, Machine Leaning, Linear Algebra, Probability and Statistics, and Natural Language Processing. Although my curriculum prepared me well for the fundamentals of Data Science (i.e. cs, statistics, and machine learning). I am a technology-savvy and mathematically-equipped who gets a good kick out of analyzing data and have good hands on experince in processing the Big data using distributed framework(spark, scala, python), data warehouse tool(snowflake), distributed database(cassandra, Hbase), distributed file sytem(HDFS, Auxillo) Machine Learning knowledge in:• Unsupervised algorithms- K-means clustering - hierarchical clustering- DBSCAN• Dimensionality reduction and visualization tool's:- PCA- t-sne• Supervised algorithms- Naive Bayes- Logistic Regression- Linear Regression- SVM- Tree-based algorithms: Decision Trees, Random Forest, Adaboost, Grandient Boosting Machine• Deep Learning- Multi-layer preception(MLP)- Convolution neural network(CNN)- Recuurent nueral network(RNN)- Time Series Analysis• Statistics: - exploratory data analysis using statistics tools- Probality theory- Distribution function Programming and framework knowledge in:• programming language- Python - Scala- Shell scripting- SQL• Big data framework.- Spark- Snowflake- Hadoop- Databricks- NumPy- SciPy- Pandas- Matplotlib- NLTK- tensorflow- Keras• Big data filesystem- HDFS- Azure Blob• Database- Postgress- Snowsql - HBase- Hive- Cassandra• Schedulers- Airflow• Cloud- Azure
-
SdeNike Jul 2022 - PresentIndia -
Senior Data ScientistAccion Labs Jul 2019 - Jul 2022Bengaluru, Karnataka, India• Envisioned, Architected and Implemented a generic Azure Data Pipeline and Spark based Model for batch processing of enterprise level complex retail data, resulting in saving $5000 per month. • The pipeline reduced the effort by 70% of previous architecture for all loads.• Sourced, analyzed, processed, validated, transformed, aggregated and distributed data from more than 5+ sources, using generic Azure Data Pipeline.• Configured and developed automated data bricks spark replication system from PostgreSQL to Snowflake for 100 tables, which improved loading speed of data by 75%.• Developed and maintained reporting tool ensure 100% errors were recorded and reported. • Used airflow to orchestrate the ETL solution that helped improve conversion rate by 20%. • Mentored, documented and maintained best practices of git usage in data engineering project. • Awarded the prestigious “Team Marvel” award for zero defect go-live of the complex feature.• Received “Customer Focus” award for architecting complex data load.• Customer accolade for contribution on their LinkedIn page. -
Big Data DeveloperReni Analytics Inc. Sep 2018 - Jul 2019Bengaluru, Karnataka, India• Implemented a Predictive Analysis project called ‘Cell Site Degradation (CSD)’ that trained a ML algorithm to collate raw network and weather data and predict the network site degradation, which resulting in 32% increase in revenue. .• Involved in design and implemented a shipment tracking and alerting product called ‘Dice-Platform’ using Scala Spark, Hadoop and Kafka, improving operating efficiency by 80%. • Built an algorithm to match vendor delivery quotation and shipment tracking and alert deviation with 100% accurate results.• Built an end-to-end user configurable CICD Pipeline that was adopted by more than 20+ projects in the company.• Developed monitoring and alerting capabilities to ensure 100% of data pipelines were working. .• Implemented a ‘Log Analyzer’ using Spark Framework, Elasticsearch and Kibana. -
Big Data EngineerIntellifour Software Pvt Ltd Aug 2016 - Sep 2018Bengaluru Area, India• Implemented and automated distributed ETL system for smart data access to determine the aggregation of a traffic of the calls over a network in a particular area for 15mins, half an hour, weekly and monthly, which reduced manual monitoring effort of 24 hours.• Created alerting system, dashboard and monitored to ensure 100% data was processed and transferred on time.• Participated in Agile planning 40+ data feature request.• Identified and fine tuned tombstone issue in Cassandra to speed up the insert and update by 80% for 10M transaction data per day.
Umesh . Education Details
-
Electrical And Electronics Engineering
Frequently Asked Questions about Umesh .
What company does Umesh . work for?
Umesh . works for Nike
What is Umesh .'s role at the current company?
Umesh .'s current role is SD.
What schools did Umesh . attend?
Umesh . attended Visvesvaraya Technological University.
Who are Umesh .'s colleagues?
Umesh .'s colleagues are Ting Ting Low, Patrick Felipe, Caleb Llaban, Sabrina P., Brandon Everett, Dylan Heslop, Teekay Biti.
Not the Umesh . you were looking for?
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial