N P

N P Email and Phone Number

new york, new york, united states
N P's Location
United States, United States
About N P

With over 8 years of professional experience as a Data Engineer, I possess a strong track record in planning, developing, deploying, and maintaining data services. My expertise lies in conceptualizing and implementing data pipelines, leveraging analytical programming languages such as Python, Scala, Java, and SQL. I have successfully designed and executed comprehensive data pipelines and Hadoop-based analytics solutions, utilizing technologies including HDFS, MapReduce, Spark, Hive, Kafka, and more. Additionally, I excel in optimizing cloud performance, enhancing big data processing, and collaborating with cross-functional teams to design and implement end-to-end data pipelines, fostering data accessibility and efficiency. My proficiency extends to AWS Lambda, Spark development, ETL integration, and Cosmos DB design, among other areas, making me adept at addressing diverse data engineering challenges with precision and professionalism.

N P's Current Company Details
Broadridge

Broadridge

View
k
new york, new york, united states
Website:
broadridge.com
Employees:
9823
N P Work Experience Details
  • Broadridge
    Data Engineer
    Broadridge Apr 2022 - Present
    New York, United States
    • Optimizing Cloud Performance: Implemented solutions to optimize cloud performance for handling high-volume data, potentially using cloud services like AWS.• Collaboration with Business Process Managers: Collaborated as a subject matter expert to transform large data volumes and create analytical data products for BI reporting needs.• AWS Lambda: Created functions and assigned roles in AWS Lambda for event-driven processing with Python and Java.• Spark Development: Developed Spark applications using Scala, DataFrames, and the Spark SQL API for efficient big data processing.• ETL Integration with Spark: Designed and implemented ETL integration patterns using Python on Spark, analyzing SQL scripts and using PySpark for implementation.• Cosmos DB: Designed and implemented Cosmos DB database schemas for efficient storage and management of large-scale datasets.• Data Pipelines: Constructed data pipelines using Spark, Hive, Sqoop, and custom input adapters for ingesting, transforming, and analyzing operational data.• Spark-SQL and Hive: Utilized Spark-SQL to load JSON data, create Schema RDDs, and load data into Hive tables, possibly hosted on AWS.• Data Integration and Real-time Processing: Utilized Spark for interactive queries, streaming data processing, and integration with NoSQL databases to handle large data volumes. Integrated Cosmos DB with Azure Stream Analytics for real-time data processing.• AWS Glue: Designed and implemented ETL processes in AWS Glue to migrate data from external sources into AWS Redshift.• Elasticsearch Integration: Developed ETL parsing and analytics using Python/Spark to establish a structured data model in Elasticsearch for API and UI consumption.• IAM and Security: Utilized IAM for creating accounts, roles, groups, and policies for AWS resource management and integration with AWS services.• Snowflake: Authored complex Snow SQL scripts in the Snowflake cloud data warehouse for business analysis and reporting.
  • Homesite Insurance
    Data Engineer
    Homesite Insurance Nov 2019 - Mar 2022
    Boston, Massachusetts, United States
    Implemented Cloud (Azure) Delta lake house platform infrastructure, including RBAC, using IAC (Terraform) following industry best practices.Collaborated with architecture and security teams to remediate infrastructure-related concerns and recommendations, specifically focusing on Azure Databricks, Azure PaaS, and Azure DevOps (VSTS).Designed and implemented end-to-end data pipelines using Azure Data Factory and Databricks, enhancing data accessibility and efficiency.Developed data models and schemas for Azure SQL Databases and Azure Cosmos DB, ensuring data consistency and query performance optimization.Constructed a scalable data warehouse using Azure Synapse Analytics, enabling efficient data analysis and reporting for business users.Implemented an automated data quality monitoring system using Azure Data Factory and Azure Functions.Executed ELT processes using Azure Data Factory and Databricks, leveraging big data technologies and distributed processing capabilities.Designed and implemented Change Data Capture (CDC) mechanisms to capture and synchronize real-time updates from source systems to Cosmos DB.Created pipelines in Azure Data Factory for data extraction, transformation, and loading from various sources, including Azure SQL and Blob storage.Developed Data Pipelines for Copy Activity in Azure Data Factory, enabling data movement and transformation for on-cloud ETL processing.Crafted Spark applications using PySpark and Spark-SQL for data extraction, transformation, and aggregation from various file formats, uncovering insights into customer usage patterns.Conducted knowledge-sharing sessions and created comprehensive documentation about Cosmos DB design, data models, and deployment procedures, facilitating team skill development and seamless knowledge transfer.
  • Citrix
    Data Engineer
    Citrix May 2017 - Oct 2019
    • Spark Application Development: Developed Spark applications in Scala for data cleansing, validation, transformation, and summarization, showcasing expertise in big data processing.• Data Pipeline Construction: Created data pipelines using Spark, Hive, and Sqoop for ingesting, transforming, and analyzing operational data.• Real-time Data Processing: Streamed real-time data using Spark and Kafka for handling web server console log data, demonstrating real-time data processing skills.• Cloud Platform Expertise: Provided Infrastructure as a Service (IaaS) solutions on the Microsoft Azure cloud platform, highlighting cloud infrastructure knowledge.• SQL and Database Development: Designed database objects using T-SQL and developed processes for incremental data imports, showing database development skills.• Agile Collaboration: Collaborated within an Agile environment, utilizing tools like Jira and GitHub version control for continuous builds, emphasizing teamwork and agile methodologies.
  • Ceequence Technologies
    Big Data Developer
    Ceequence Technologies Jul 2015 - Feb 2017
    Hyderabad, Telangana, India
    • Data Extraction and Transformation: Extracted data from Oracle, DB2, and Teradata sources, performed transformations, and exported data as per business needs, demonstrating expertise in ETL processes.• Hive and MapReduce Processing: Imported data from diverse sources, applied Hive and MapReduce transformations, loaded data into HDFS, and used Sqoop for extracting from relational databases.• Hive Development: Created Hive tables, loaded data with HQL scripts, developed custom Java-based Hive UDFs, and designed Hive external tables with dynamic partitioning and buckets.• Data Processing Frameworks: Utilized Apache Spark with Scala and Python, integrated Kafka for real-time data processing, and employed NoSQL databases like HBase for semi-structured data storage.• Performance Optimization: Optimized performance with distributed caching, Hive partitioning, bucketing, and map-side joins, showcasing efficiency in big data processing.• Scheduling and Workflow Management: Scheduled data ingestions and transformations using Oozie and Maestro, highlighting workflow management skills.

Frequently Asked Questions about N P

What company does N P work for?

N P works for Broadridge

What is N P's role at the current company?

N P's current role is k.

Who are N P's colleagues?

N P's colleagues are Bo Baribeau, Lakshna Ramakrishnan, Bob Russell, Swetha Molakala, Tejinder Banwait, Surya Kiran Reddy Kolli, Matshitso Mpho.

Not the N P you were looking for?

  • N P

    Cloud Engineer At Western Union
    San Francisco Bay Area
  • N P

    Data Analytics And Business Consulting | Python, Etl, Ml, Tableau, Snowflake, Gcp|
    Annandale, Va
  • N P.

    Java Full-Stack Developer At Usaa
    San Antonio, Tx
  • N P.

    Chief Operating Officer, Co-Founder | Business Solutions, Operations Management
    United States
  • N P

    Fremont, Ca

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.