João Souza Email and Phone Number
As a seasoned Data Engineer, I specialize in the ideation and implementation of comprehensive data platforms designed to streamline the ingestion and processing of large data volumes. My expertise lies primarily in leveraging AWS technologies (working with AWS since 2016), where I integrate solutions such as Airflow, DBT, Spark, SQL, Python, AWS services, SnowFlake and databricks to construct robust platforms tailored to user needs.My approach combines a deep understanding of data engineering principles with a practical application of the latest technologies to solve complex data challenges. By creating scalable and efficient data pipelines, I enable organizations to unlock valuable insights from their data, facilitating informed decision-making and strategic business growth.I am passionate about exploring innovative data solutions and continuously expanding my skill set to include the latest advancements in the field. My goal is to contribute to projects that not only meet but exceed user expectations, delivering high-quality and impactful data engineering solutions.
-
Engenheiro De Dados EspecialistaOpen Finance BrasilState Of São Paulo, Brazil -
Specialist Data EngineerWill Bank Jan 2022 - Aug 2024Focusing on data processing and consumption, I have been exploring solutions to enhance the performance of our data platform. By integrating AWS with lakehouse services such as Databricks and Snowflake, we aim to optimize our data management and analytics capabilities. -
Data EngineerWill Bank Jan 2021 - Jan 2022I am a part of a team that works to develop data products, working on receiving, process and making available the most diverse data sources (internal and external APIs, SQL and NoSQL Databases, FTP). I was the main engineer on the data lake Migrate project, leaving a flow with Pentaho Kettle and Redshift and going to a Data Lake structure inside the AWS Cloud. Our main data source is an Oracle with more than a thousand tables and almost 2 terabytes, and by using AWS DMS solution, we brought all this data to AWS S3 creating our Raw Zone. All processing between the Raw and Processed layers was performed by EMR (PySpark) and orchestrated by Airflow. The Processed Zone is the first accessible layer on Data Lake and we use AWS Athena as the SQL engine. In the Curated and Analytic layers, we implemented the Data-Mesh concept, where the business areas are responsible for the rules of each entity within this micro-universe of data. Aiming to obtain more autonomy for the business areas, we use DBT (Data Build tools) as a framework to manage, orchestrate and implement all tables up to the Analityc Zone. -
Data EngineerCognitivo.Ai Jun 2020 - Aug 2024I worked in large and medium-sized companies, I was responsible for the architecture and implementation of Data Lake projects. Using an AWS cloud, I structured ingest services for APIs, relational and non-relational databases (DMS, Lambdas, Kinesis), Data Lake layer storage (S3) and making the processed and enriched data available to the area (EMR, Redshift, Snowflake, Athena, Presto). All services used in the projects were coded using Terraform and I used jobs Sparks (PySpark) for processing. In other projects I acted as Tech Lead Data Engineer, being responsible for the deliveries of the technical team. Working closer to the customer, identifying as a need, managing the backlog and deadlines, thinking about the solution to the most diverse problems. -
Data EngineerZoop Jul 2019 - Jan 2021Rio De Janeiro E Região, BrasilI acted as the data engineer responsible for the architecture and development of the data platform. Assisting in data engineering team building. Our data architecture was developed using AWS services such as DMS, ECS, S3, KMS, EMR (PySpark), Glue, Lambda and Redshift. All processes were orchestrated by Airflow and we maintained a data warehouse in Redshift, where we generated insights for the business areas. -
Database Administrator / Data EngineerM4U Mar 2015 - Jul 2019Rio De Janeiro E Região, BrasilSupport the development team, application deployments, query performance analysis, and reporting for business areas. Data lake - Development of data pipeline for batch processing, using DMS to collect data from a Postgres database and stored in an S3 bucket with Parquet files. And using AWS Glue Job to transform this data and generate insights for the product team. Development of a python application to perform client migration between the legacy system and the new one. -
Database AdministratorBtg Pactual Dec 2012 - Mar 2015DBA - Infra e Desenvolvimento.Plataformas:- Oracle- SqlServerAdministração de servidores SQL Server (versões: 2000, 2005, 2008 e 2012). Automatização de processos de checklist, monitoração e manutenção do ambiente de produção, análises de traces, tunning de queries, particionamento de tabelas e execução de restores e backups.. Instalação e configuração do SQL Server Stand-Alone, Clusters, Analysis Services e Reporting Services.Administração de servidores de banco de dados Oracle. Gerencia de usuários, schemas, roles e bloqueios. Desenvolvimento de procedures em Pl/SQL e TSQL para otimizar o trabalho. -
Estagiario SiebelBexpert Sep 2011 - Dec 2012Projeto BrasilCAP: Participei da equipe de Sadmin do projeto. Administração do aplicativo Oracle Siebel CRM, Siebel Tools, compilação de SRF, repository merge, deploy de ambiente e aplicação de patches para upgrade de versão. Oracle BI Publisher e integração da barra de CTI com o Siebel.Instalação e configuração do Oracle Business Intelligence Enterprise Edition 11g. (Informatica, Data Warehouse Administration Console e BIApps).
-
Estagiário Pl/SqlSulamerica Ing Mar 2010 - Sep 2011Desenvolvimento de queries e procedures no Oracle. Dominio das ferramentas PL/SQL e Oracle BI Discoverer. Assistencia na análise dos requisitos e documentação dos projetos da area juridica, acompanhamento da execução dos projetos junto as fabricas de softwares. Suporte aos processos do Juridico. Utilização do MS Project.
João Souza Education Details
-
Computer Science
Frequently Asked Questions about João Souza
What company does João Souza work for?
João Souza works for Open Finance Brasil
What is João Souza's role at the current company?
João Souza's current role is Engenheiro de Dados Especialista.
What schools did João Souza attend?
João Souza attended Uerj - Universidade Do Estado Do Rio De Janeiro, Instituto Infnet.
Not the João Souza you were looking for?
-
-
João Souza
Education | Social Innovation | Inclusive Digital Transformation | Futures & InclusionBelo Horizonte, Mg2gmail.com, maisfavela.org3 +553193XXXXXX
-
João Souza
Remuneração & Benefícios | People Analytics | Business Intelligence | Orçamento | Gestão De PessoasSão Paulo, Sp2hotmail.com, viavarejo.com.br -
-
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial