๐ฅ๐ฅ๐ฅ ๐๐๐ฒ ๐๐ก๐๐ซ๐! ๐ฅ๐ฅ๐ฅWelcome to my digital canvas, where each pixel of data tells a story of innovation, strategy, and boundless curiosity. With over 8 years experience in Data Engineering, Big Data Development, and Data Visualization, my career is a testament to turning the complex tapestry of data into a clear narrative. ๐๐ซ๐จ๐๐๐ฌ๐ฌ๐ข๐จ๐ง๐๐ฅ ๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:๐๐๐ซ๐ฌ๐๐ญ๐ข๐ฅ๐ ๐๐ฑ๐ฉ๐๐ซ๐ญ๐ข๐ฌ๐: My professional saga spans roles as a ๐๐๐ญ๐ ๐๐ง๐ ๐ข๐ง๐๐๐ซ, ๐๐ข๐ ๐๐๐ญ๐ ๐๐๐ฏ๐๐ฅ๐จ๐ฉ๐๐ซ, ๐๐ง๐ ๐๐ง๐๐ฅ๐ฒ๐ฌ๐ญ, where I've partnered with various clients to navigate the vast seas of data. My expertise shines brightly for businesses eager to unlock the true potential of their data and a passionate pursuit of exploration in the vast universe of data.๐๐๐๐ก๐ง๐ข๐๐๐ฅ ๐๐ฒ๐ฆ๐ฉ๐ก๐จ๐ง๐ฒ: My technical repertoire extends beyond ๐๐ฒ๐ฌ๐ฉ๐๐ซ๐ค ๐๐ง๐ ๐๐๐, embracing the rich possibilities of ๐๐ข๐ฏ๐, ๐๐ข๐ซ๐๐ฅ๐จ๐ฐ, ๐๐๐ญ๐๐๐ซ๐ข๐๐ค๐ฌ, ๐๐๐ ๐๐๐๐ฌ๐ก๐ข๐๐ญ, ๐๐, ๐๐ฅ๐ฎ๐, ๐๐ง๐จ๐ฐ๐๐ฅ๐๐ค๐. This diverse toolkit allows me to sculpt data workflows, optimize big data processes, and deploy scalable solutions.๐๐ฒ๐ฌ๐ฉ๐๐ซ๐ค, ๐๐ฒ๐ญ๐ก๐จ๐ง & ๐๐๐ ๐๐๐ฌ๐ญ๐๐ซ๐ฒ: At the heart of what I do are my skills in automating data processes and expertise in PySpark and Python, allowing me to refine and prepare data for its final use. My journey has taken me through complex data challenges, improving performance and streamlining processes, making sure that data excels.๐๐ซ๐๐ก๐ข๐ญ๐๐๐ญ ๐จ๐ ๐๐๐ญ๐ ๐๐จ๐ฅ๐ฎ๐ญ๐ข๐จ๐ง๐ฌ: My narrative extends to creating blueprints for success through comprehensive technical documents and data flow diagrams. My approach ensures a foundation of precision and excellence, turning data visions into reality.๐๐ญ๐ซ๐๐ญ๐๐ ๐ข๐ ๐๐ฎ๐ฌ๐ข๐ง๐๐ฌ๐ฌ ๐๐ง๐ฌ๐ข๐ ๐ก๐ญ๐ฌ: Collaborating closely with clients, I've transformed business needs into dynamic reports and dashboards in Tableau, weaving data into the fabric of decision-making. My dual role as a leader and hands-on expert has been instrumental in demystifying data modelling and warehouse design.๐๐ฒ ๐๐ฑ๐ฉ๐๐ซ๐ข๐๐ง๐๐:My journey with data has taken me across the exciting worlds of social media, payments, retail banking, and investment banking.๐๐ก๐ฒ ๐ ๐๐ญ๐๐ง๐ ๐๐ฎ๐ญ:Beyond my technical acumen lies a fervent passion for elevating data from mere numbers to powerful narratives. My belief in data's transformative power fuels my mission to unlock new horizons for businesses, crafting data-driven strategies that illuminate paths to success.
-
Technical WriterMediumToronto, On, Ca -
Data Engineer @ PinterestCapgemini Apr 2023 - PresentToronto, Ontario, Canadaโผ Developed scalable and maintainable data pipelines using Apache Spark, Hadoop, and other Big Data technologies, resulting in improved data processing efficiency and performance.โผ Successfully transformed complex analytical models using PySpark, Python, and SQL into scalable, production-ready solutions, enhancing the overall data processing capabilities of the organization.โผ Led the development of scalable Databricks pipelines, leveraging Delta Lake for improved datareliability in a 10TB lake, enhancing query performance by 40%.โผ Extensive experience in ETL/ELT automation of data extraction, data cleaning and data preparation using Pyspark, Python, Spark SQL, AWS Glue, Airflow.โผ Collected, cleansed, and provided modeling and analysis of structured and semi-structured data used for business initiatives which resulted in $1M+ annual savings.โผ Developed a complete automation framework using Python that led the migration of over 500+ Spark jobs from Spark 2.4 to Spark 3.2, ensuring compatibility, optimizing performance, and overall savings of over $1.5 million.โผ Developed a comprehensive automation framework utilizing Python, designed to dynamically optimize 3000+ Spark Jobs across multiple parameters. โผ Excelled in debugging complex data pipelines, resolving issues that led to a 50% reduction in downtime and significantly enhancing data flow efficiency between production and analytics environments.โผ Developed a complete automation framework using Python on Databricks that led the migration ofover 500+ Spark jobs from Spark 2.4 to Spark 3.2, ensuring compatibility, optimizing performance,and overall savings of over $1.5 million. -
Technical WriterMedium Sep 2023 - PresentToronto, Ontario, Canada -
Pnc Bank: Big Data Engineer (Anti Money Laundering)Tata Consultancy Services Jun 2017 - Jan 2023Pune, Maharashtra, Indiaโข Implementing the Data pipeline architectures in big data for multi-vendor datasets with the help of Pyspark and Cloudera HDFS services.โข Extracting, cleaning and filtering the required data from the raw formations into the warehouse.โข Writing Pyspark scripts for the transformation and ingestion loads of warehouse data, along with scripts for creation of datamart/API tables. โข Performed end-to-end ETL automation of data extraction, data cleaning and data preparation using Pyspark, Python, and Spark SQL.โข Increased the efficiency of data pipelines that processed over 100 TB of data daily.โข Refining the existing workflows with optimization techniques and best code practices.โข Wrote scripts in Hive SQL for creating complex tables with high performance metrics like partitioning, clustering and skewing. โข Using Git 2.x for maintaining the code versioning with data scientists, data engineers and analystsโ teamsโข Getting & preparing requirements from Business Users.โข Implementing the Data pipeline architectures for multi-vendor datasets with the help of pyspark.โข Extracting, cleaning and filtering the required data from the raw formations into the warehouse.โข Refining the existing workflows with optimization techniques and best code practices.โข Understanding the input list of accounts provided by Business.โข Writing acceptance criteria for analysis and development stories. โข Preparing high level design documents and low-level documents for ETL.โข Preparing complete end to end data flow architecture & design. โข Engagement with Application Support and administrators to validate the design for architectural approval.โข Suggest alternative solutions to learn that involve best practices, development and performance standard.โข Mapping the source and target hive tables against the columns. โข Generation of analytics-based reporting for tableau dashboards. โข Involved in building reports in multiple views, Dashboards and Storyboard using Tableau -
Rbc Bank: Big Data Engineer / AnalystTata Consultancy Services Jan 2016 - Jun 2017Pune Area, Indiaโข Translated business needs into technical specifications to deliver customized reporting systemsโข Designed, built, and maintained 100% of automated systems for reporting, analysis, and analytics.โข Used Python and pyspark scripts to reconcile statement and ledger files. โข Getting & preparing requirements from Business Users.โข Automated the manual process of reconciliation which saved 25 hours of manual work. โข Writing acceptance criteria for analysis and development stories. โข Preparing high level design documents and low-level documents for ETL.โข Preparing complete end to end data flow architecture & design. โข Engagement with Application Support and administrators to validate the design for architectural approval.โข Suggest alternative solutions to learn that involve best practices, development and performance standard.โข Generation of analytics-based reporting for tableau dashboards.โข Writing end-to-end pyspark scripts that consists of loading data from a file to hive tables or staging table to partitioned hive table, transforming Data frames into the intended results, application of all the Business Validations and dump the data Frames into the hive tables. โข Mapping the source and target hive tables against the columns. โข Configuring the property file and configuration file with the pyspark file in order to make the functionality flexible with respect to the different environments and input arguments. โข Developing data pipelines for various data sources. -
Morgan Stanley Bank: Rules Developer / AnalystTata Consultancy Services Jan 2015 - Dec 2015Pune Area, IndiaThis project deals with the client onboarding process in Morgan Stanley firm. Whenever any new client onboard, it should evaluate against certain rules. Rules are applied on the basis of various norms applied by Client.Environment: Java, Xml, Unix, SybaseWork:1) Analyzing Business Requirement for Rules project.2)Creating and implementing Rules by analyzing and inspecting the requirements given. Coding whole rules by Rule3) Authoring internal tool with the help of Unix and Database4) Supporting and maintaining the whole UI by using angular and Java skills efficiently. -
C++ DeveloperTata Consultancy Services Sep 2014 - Dec 2014Hyderabad Area, IndiaPROJECT: Mapple (Training Project)Brief: Mapple was basically an application to depict Mobile store just like Apple store where you can select and buy anyproduct from the store.Environment: Unix for UI, C++ for backend code, Oracle (Storage)Work:1) Designed an application to create mobile store for buying any phone.2) Requirement gathering3) Lead and actively participated in team discussion
Ankur Chopra Education Details
-
1St Division
Frequently Asked Questions about Ankur Chopra
What company does Ankur Chopra work for?
Ankur Chopra works for Medium
What is Ankur Chopra's role at the current company?
Ankur Chopra's current role is Technical Writer.
What schools did Ankur Chopra attend?
Ankur Chopra attended Chatrapati Sahuji Maharaj Kanpur University, Kanpur, Guru Nanak Dev University.
Not the Ankur Chopra you were looking for?
-
-
Ankur Chopra
Scarborough, On -
Ankur Chopra
Toronto, On1gmail.com -
Ankur Chopra
Calgary, Ab
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records ร $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial