"Results-driven Senior ELK/Big Data Engineer with a proven record of architecting and deploying high-performance data solutions. Specializing in Hadoop, ELK (Elasticsearch, Logstash, Kibana), CI/CD, and SIEM, I bring expertise in a range of technologies to optimize data ecosystems. Proficient in managing and processing large datasets using technologies such as Apache Spark, Apache Flink, and Apache Kafka.Demonstrated success in designing and implementing end-to-end data pipelines, ensuring seamless integration and efficient data flow. Skilled in infrastructure orchestration using tools like Docker and Kubernetes, with a focus on scalability and reliability.My experience extends to cloud platforms such as AWS, Azure, and Google Cloud, enabling me to architect and deploy solutions that harness the power of cloud-native technologies. Familiarity with data governance, metadata management, and data quality assurance further strengthens my ability to deliver robust and compliant data architectures.Experienced Data Engineer with of leadership in team management and the creation of extensive data solutions for both on-premise and cloud environments. Proficient in designing data-driven solutions using cloud technologies such as Dataflow, BigQuery, Cloud Composer, Data Fusion, and more. Specialized in adapting to dynamic business requirements, delivering impactful, scalable, and high-performance data solutions to clients across diverse domains and geographies.Skills:📜 Programming: Python, Java, SQL📁 Big Data Stack: Hadoop, Spark, Sqoop, Hive, Hbase, Airflow, Python, Hue📗 Blockchain: Hyperledger, Ethereum, Quoram🏬 Warehouses: Teradata, BigQuery, Redshift, Oracle⚙️ 🔧 Tools/Frameworks: Power BI, Azure ML, Qlik ReplicateDevOps: Bamboo, Bitbucket, Jenkins, Maven, Tekton, Kubernetes, Ansible📈 ML: scikit-learn, Tensorflow, Keras, NLTK, OpenCV🏃 Monitoring & Scheduling: ELK, Grafana, Airflow, Control-M
-
Sr. Elk And Devops EngineerCi FinancialWallington, Nj, Us -
Sr. Elk/Devops EngineerCi Financial Aug 2020 - Present.Evaluating the current state of an organization's data systems, pinpointing areas for enhancement, and incorporating considerations related to Hadoop, ELK, CI/CD, and SIEM..Identifying and assessing innovative technologies and solutions for the modernization of data systems, integrating insights from Hadoop, ELK, CI/CD, and SIEM perspectives..Crafting a comprehensive data modernization strategy that aligns seamlessly with the overarching business goals, with a specific focus on Hadoop, ELK, CI/CD, and SIEM implementations..Executing seamless data migration from legacy systems to contemporary platforms, emphasizing data integrity and security throughout, with attention to Hadoop, ELK, CI/CD, and SIEM implications..Formulating and implementing robust data storage solutions on GCP, including but not limited to BigQuery, Cloud SQL, and Cloud Spanner, while considering integration with Hadoop and ELK ecosystems..Integrating data from diverse sources, encompassing on-premises systems and other cloud platforms, leveraging technologies such as Cloud Dataflow, Cloud Dataproc, and Cloud Data Fusion. .Orchestrating the creation and management of data pipelines for both real-time and batch processing, utilizing technologies like Cloud Dataflow, Cloud Composer, and Cloud Data Fusion, with seamless integration with Hadoop and ELK components..Ensuring the governance of data access, employing Cloud Identity and Access Management (IAM) and Cloud Key Management Service (KMS), while also considering security aspects related to Hadoop and ELK..Collaborating effectively with cross-functional teams, including data engineers, data scientists, and application developers, to guarantee proper utilization, integration, and security of data, while incorporating Hadoop, ELK, CI/CD, and SIEM principles..Vigilantly monitoring and optimizing the performance and cost efficiency of data storage and processing solutions, incorporating insights from Hadoop, ELK, CI/CD, and SIEM perspectives. -
Elk Engineer/Aws EngineerAon Dec 2017 - Feb 2020Orchestrated the migration initiative, transitioning from On-prem (Hadoop) and Teradata (IDW) to the GCP Data Engineering platform, integrating GCS, Dataproc, Datafusion, BigQuery, and Dataflow. This migration resulted in a substantial annual cost reduction in hardware infrastructure maintenance, along with a notable 25% improvement in performance.Provided valuable technical mentorship to junior team members, conducted thorough code and design reviews, and enforced coding standards and best practices, incorporating considerations related to Hadoop, ELK, CI/CD, and SIEM.Established a robust Cloud Warehouse in GCP's BigQuery, redirecting legacy data sources to BigQuery by reengineering existing ETL processes using PySpark/Talend, ensuring seamless integration with Hadoop and ELK ecosystems.Architected and implemented versatile frameworks for processing transactional data (30 million records/day) from SQLServer to the Cloud, utilizing Qlik Replicate and PySpark,Automated the ETL process across billion-row datasets, resulting in a 30% reduction in manual workload, with a focus on optimizing processes related to Hadoop, ELK, CI/CD, and SIEM.Ingested data from diverse sources, including SQLServer, Oracle, and Salesforce API, using Python to create data views for BI tools like Tableau/PowerBI, while ensuring compatibility with Hadoop.Leveraged Spark in Python to distribute data processing on large datasets, enhancing the speed of ingestion and migration, with a keen eye on Hadoop and ELK integration.Designed and executed migration pipelines from Hadoop to GCS/BigQuery, employing DISTCP utility and PySpark features for BigQuery-compatible conversion, while considering implications for Hadoop and ELK ecosystems.Implemented DevOps deployment methods for bulk DDL deployment and version management for BigQuery metadata, utilizing Terraform for maintaining object states, and integrating CI/CD principles, including considerations for Hadoop, ELK, CI/CD, and SIEM. -
Unix Linux AdministratorAccenture Sep 2015 - Nov 2017Our key objective is to develop and implement reusable frameworks, with a dedicated emphasis on Disaster Recovery and efficient Data Ingestion into Hive from diverse sources such as Mainframe systems, Teradata databases, and NAS files. This includes crafting frameworks and joblets such as File Capture, Snapshot Ingestion, SCD Ingestion, Load Status Joblet, DQ Check Joblet, ELI Joblet, Mainframe Files Ingestion, Mainframe Header Validation Joblet, and more.The MARS team consistently builds reusable frameworks and joblets tailored for constructing a Data Lake within the Big Data Hub. Additionally, we extend our expertise to Migelop various reusable frameworks on Talend Real-Time Big Data for both internal stakeholders and other client vendors. Our responsibilities encompass end-to-end integration and validation of these frameworks, incorporating considerations related to Hadoop, ELK, CI/CD, and SIEM principles.Our collaborative efforts focus on seamless integration with Hadoop ecosystems, implementing ELK solutions for effective log management and analysis, adhering to CI/CD practices for streamlined development and deployment processes, and ensuring the incorporation of SIEM principles for robust security monitoring. The MARS team consistently works towards enhancing the efficiency and security of data processing through a holistic approach that considers Hadoop, ELK, CI/CD, and SIEM aspects.
Frequently Asked Questions about Anu P.
What company does Anu P. work for?
Anu P. works for Ci Financial
What is Anu P.'s role at the current company?
Anu P.'s current role is Sr. ELK and DevOps Engineer.
Not the Anu P. you were looking for?
Free Chrome Extension
Find emails, phones & company data instantly
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial