N P Email and Phone Number

k at @ Broadridge

new york, new york, united states

N P's Location

United States, United States

About N P

With over 8 years of professional experience as a Data Engineer, I possess a strong track record in planning, developing, deploying, and maintaining data services. My expertise lies in conceptualizing and implementing data pipelines, leveraging analytical programming languages such as Python, Scala, Java, and SQL. I have successfully designed and executed comprehensive data pipelines and Hadoop-based analytics solutions, utilizing technologies including HDFS, MapReduce, Spark, Hive, Kafka, and more. Additionally, I excel in optimizing cloud performance, enhancing big data processing, and collaborating with cross-functional teams to design and implement end-to-end data pipelines, fostering data accessibility and efficiency. My proficiency extends to AWS Lambda, Spark development, ETL integration, and Cosmos DB design, among other areas, making me adept at addressing diverse data engineering challenges with precision and professionalism.

N P's Current Company Details

Broadridge

View

new york, new york, united states

Website:: broadridge.com
Employees:: 9823

N P Work Experience Details

Data Engineer

Broadridge Apr 2022 - Present

New York, United States

• Optimizing Cloud Performance: Implemented solutions to optimize cloud performance for handling high-volume data, potentially using cloud services like AWS.• Collaboration with Business Process Managers: Collaborated as a subject matter expert to transform large data volumes and create analytical data products for BI reporting needs.• AWS Lambda: Created functions and assigned roles in AWS Lambda for event-driven processing with Python and Java.• Spark Development: Developed Spark applications using Scala, DataFrames, and the Spark SQL API for efficient big data processing.• ETL Integration with Spark: Designed and implemented ETL integration patterns using Python on Spark, analyzing SQL scripts and using PySpark for implementation.• Cosmos DB: Designed and implemented Cosmos DB database schemas for efficient storage and management of large-scale datasets.• Data Pipelines: Constructed data pipelines using Spark, Hive, Sqoop, and custom input adapters for ingesting, transforming, and analyzing operational data.• Spark-SQL and Hive: Utilized Spark-SQL to load JSON data, create Schema RDDs, and load data into Hive tables, possibly hosted on AWS.• Data Integration and Real-time Processing: Utilized Spark for interactive queries, streaming data processing, and integration with NoSQL databases to handle large data volumes. Integrated Cosmos DB with Azure Stream Analytics for real-time data processing.• AWS Glue: Designed and implemented ETL processes in AWS Glue to migrate data from external sources into AWS Redshift.• Elasticsearch Integration: Developed ETL parsing and analytics using Python/Spark to establish a structured data model in Elasticsearch for API and UI consumption.• IAM and Security: Utilized IAM for creating accounts, roles, groups, and policies for AWS resource management and integration with AWS services.• Snowflake: Authored complex Snow SQL scripts in the Snowflake cloud data warehouse for business analysis and reporting.

View
Data Engineer

Homesite Insurance Nov 2019 - Mar 2022

Boston, Massachusetts, United States

Implemented Cloud (Azure) Delta lake house platform infrastructure, including RBAC, using IAC (Terraform) following industry best practices.Collaborated with architecture and security teams to remediate infrastructure-related concerns and recommendations, specifically focusing on Azure Databricks, Azure PaaS, and Azure DevOps (VSTS).Designed and implemented end-to-end data pipelines using Azure Data Factory and Databricks, enhancing data accessibility and efficiency.Developed data models and schemas for Azure SQL Databases and Azure Cosmos DB, ensuring data consistency and query performance optimization.Constructed a scalable data warehouse using Azure Synapse Analytics, enabling efficient data analysis and reporting for business users.Implemented an automated data quality monitoring system using Azure Data Factory and Azure Functions.Executed ELT processes using Azure Data Factory and Databricks, leveraging big data technologies and distributed processing capabilities.Designed and implemented Change Data Capture (CDC) mechanisms to capture and synchronize real-time updates from source systems to Cosmos DB.Created pipelines in Azure Data Factory for data extraction, transformation, and loading from various sources, including Azure SQL and Blob storage.Developed Data Pipelines for Copy Activity in Azure Data Factory, enabling data movement and transformation for on-cloud ETL processing.Crafted Spark applications using PySpark and Spark-SQL for data extraction, transformation, and aggregation from various file formats, uncovering insights into customer usage patterns.Conducted knowledge-sharing sessions and created comprehensive documentation about Cosmos DB design, data models, and deployment procedures, facilitating team skill development and seamless knowledge transfer.

View
Data Engineer

Citrix May 2017 - Oct 2019

• Spark Application Development: Developed Spark applications in Scala for data cleansing, validation, transformation, and summarization, showcasing expertise in big data processing.• Data Pipeline Construction: Created data pipelines using Spark, Hive, and Sqoop for ingesting, transforming, and analyzing operational data.• Real-time Data Processing: Streamed real-time data using Spark and Kafka for handling web server console log data, demonstrating real-time data processing skills.• Cloud Platform Expertise: Provided Infrastructure as a Service (IaaS) solutions on the Microsoft Azure cloud platform, highlighting cloud infrastructure knowledge.• SQL and Database Development: Designed database objects using T-SQL and developed processes for incremental data imports, showing database development skills.• Agile Collaboration: Collaborated within an Agile environment, utilizing tools like Jira and GitHub version control for continuous builds, emphasizing teamwork and agile methodologies.
Big Data Developer

Ceequence Technologies Jul 2015 - Feb 2017

Hyderabad, Telangana, India

• Data Extraction and Transformation: Extracted data from Oracle, DB2, and Teradata sources, performed transformations, and exported data as per business needs, demonstrating expertise in ETL processes.• Hive and MapReduce Processing: Imported data from diverse sources, applied Hive and MapReduce transformations, loaded data into HDFS, and used Sqoop for extracting from relational databases.• Hive Development: Created Hive tables, loaded data with HQL scripts, developed custom Java-based Hive UDFs, and designed Hive external tables with dynamic partitioning and buckets.• Data Processing Frameworks: Utilized Apache Spark with Scala and Python, integrated Kafka for real-time data processing, and employed NoSQL databases like HBase for semi-structured data storage.• Performance Optimization: Optimized performance with distributed caching, Hive partitioning, bucketing, and map-side joins, showcasing efficiency in big data processing.• Scheduling and Workflow Management: Scheduled data ingestions and transformations using Oozie and Maestro, highlighting workflow management skills.

Frequently Asked Questions about N P

What company does N P work for?

N P works for Broadridge

What is N P's role at the current company?

N P's current role is k.

Who are N P's colleagues?

N P's colleagues are Bo Baribeau, Lakshna Ramakrishnan, Bob Russell, Swetha Molakala, Tejinder Banwait, Surya Kiran Reddy Kolli, Matshitso Mpho.

Not the N P you were looking for?

N P

Cloud Engineer At Western Union

San Francisco Bay Area

View
N P

Data Analytics And Business Consulting | Python, Etl, Ml, Tableau, Snowflake, Gcp|

Annandale, Va

View
N P.

Java Full-Stack Developer At Usaa

San Antonio, Tx

View
N P.

Chief Operating Officer, Co-Founder | Business Solutions, Operations Management

United States

View
N P

Fremont, Ca

View

View similar profiles

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles

Get direct phone numbers & mobile contacts

Access company data & employee information

Works directly on LinkedIn - no copy/paste needed

Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.

Security Check