Data Engineer
Current- Developed Python code for processing images based on client’s requirements using the OpenCV-Python library and automated the pipeline for processing of images residing in one AWS S3 bucket and upload the processed.
- Implemented SQL queries to obtain a summary of the pricing history of client’s product and use the information to create a dashboard using the Python Dash library and Tableau. Twitter Sentiment Analysis:
- Demonstrated a machine learning pipeline where:
- Live tweets were extracted using Twitter API and processed using AWS Kinesis and stored in an S3 bucket.
- Exploratory Data Analysis (EDA) and Sentiment Analysis of the live tweets was performed on a Databricks platform using the Pyspark Machine learning library.
- The results of are stored in S3 bucket, tweet analysis data tables were created on Amazon Athena and tweet analysis dashboards were created using AWS QuickSight. Back Order Prediction: