Data Engineer
Current- Created various text semantics APIs using CherryPy and tested them using Postman API testing
- Performed text-based feature extraction on product information to match products across multiple e-commerce websites using various clustering algorithms and ranking them using tf-idf weights, using Python and Solr.
- Benchmarked the various algorithms and now working towards scaling it using Apache Spark with AWS