Data Scientist
CurrentAs a Data Scientist, I've played a role with specialized focus on data engineering, leveraging expertise in various programming languages and technologies, I am dedicated to create, improve and optimizing data workflows to drive business value.Development and processing baselines: particularly adept at establishing baselines for software and processes development focused on data lifecicle.- Kubernetes/Docker contenerized microservices- CI/CD pipelinesInhouse ETL tools: using my expertise on programming, initialize the creation of a inhouse Python library specialized on generic and configurable ETL processes, based on business needs. Some of the library modules include:- AWS integration- SQL and NoSQL databases generic import/export- Batch and parallel massive processingRecommender engine: collaborated with cross-functional teams to define and developed an event based inhouse recommender engine, communicating on-prem tools/processes with cloud event driven environment.- AWS usage: Kinesis, DynamoDB, S3, SageMaker, Lambda, EventBridge, Step Functions- On-premise tech: MongoDB, Kafka, custom Python processesDatabase modeling/management: involved in MongoDB, Athena and Redshift databases modeling, focused on improving data read and write.- Optimizing services interacting with raw and processed data- Improving data accesibility securely