I am a data engineer with over 2 years of experience in building and optimizing large-scale data pipelines using big data technologies like Apache Spark, Hadoop, and cloud platforms such as AWS and GCP. My expertise includes working with PySpark, SQL, and AWS Glue to manage and transform data, creating scalable solutions that drive business impact. I have successfully delivered data projects for industries such as telecommunications and healthcare, automating resource management and enhancing data processing efficiency.Currently pursuing a Master's in Computer Science at the University at Buffalo, I am further honing my skills in data engineering and cloud computing. I am actively seeking full-time data and software engineering opportunities starting December 2024, where I can leverage my experience and technical skills to contribute to impactful projects. Letโs connect to explore potential collaborations or opportunities!๐๐๐๐ก๐ง๐ข๐๐๐ฅ ๐๐ค๐ข๐ฅ๐ฅ๐ฌ : โ ๐๐ซ๐จ๐ ๐ซ๐๐ฆ๐ฆ๐ข๐ง๐ ๐ฅ๐๐ง๐ ๐ฎ๐๐ ๐๐ฌ: C/C++, Java, Python, JavaScript, SQL. โ ๐๐๐: React.js, Node.js, HTML, CSS, REST APIs โ ๐๐๐ญ๐: Spark, Beam, MySQL, Hadoop, Hive, Snowflake, Apache oozie, Docker. โ ๐๐ฅ๐จ๐ฎ๐ ๐ฉ๐ฅ๐๐ญ๐๐จ๐ซ๐ฆ๐ฌ:
-
Data Engineer InternHaver Ai Inc. Jun 2024 - Aug 2024โข Developed monthly data ingestion jobs using ๐๐๐ ๐๐๐ฆ๐๐๐ and ๐๐๐ ๐๐ฏ๐๐ง๐ญ๐๐ซ๐ข๐๐ ๐ to fetch claims data in FHIR format from ๐๐๐๐ ๐๐๐, leading to 100% reduction in manual effort. Enhanced system reliability with real-time notifications via ๐๐๐ ๐๐๐.โข Utilized ๐๐๐ ๐๐ฅ๐ฎ๐ and ๐๐ฒ๐๐ฉ๐๐ซ๐ค for preprocessing and integrating ๐๐๐๐ data from various sources into ๐๐๐ ๐๐๐๐ฌ๐ก๐ข๐๐ญ, achieving a 50% increase in data processing speed.โข Designed and executed backend functionality for serving ๐๐๐๐ ๐๐๐ requests using ๐๐๐ ๐๐๐ ๐๐๐ญ๐๐ฐ๐๐ฒ and ๐๐๐ฆ๐๐๐.โข Developed a scalable ๐๐๐๐ฉ๐ฌ ๐ฉ๐ข๐ฉ๐๐ฅ๐ข๐ง๐ that automates the execution of a 6-step machine learning workflow in ๐๐๐ ๐๐๐ ๐๐๐๐ค๐๐ซ, triggered by user interactions on the frontend. Utilized AWS SageMaker ๐๐ฎ๐ญ๐จ๐๐ to generate predictions as part of the pipeline. -
Data EngineerSkuad Dec 2022 - May 2023โข Implemented business Key Performance Indicators (๐๐๐๐ฌ) for a telecommunications client using ๐๐ฒ๐๐ฉ๐๐ซ๐ค on Google Cloud Platform (๐๐๐) and Amazon Web Services (๐๐๐) leveraging Customer 360 data.โข Leveraged ๐๐๐ ๐๐ฅ๐ฎ๐, S3, ๐๐ญ๐ก๐๐ง๐, ๐๐๐ญ๐๐ฉ๐ซ๐จ๐, ๐ฐ๐จ๐ซ๐ค๐๐ฅ๐จ๐ฐ ๐ญ๐๐ฆ๐ฉ๐ฅ๐๐ญ๐๐ฌ, and ๐๐ข๐ ๐๐ฎ๐๐ซ๐ฒ to ensure efficient data processing.โข Successfully migrated a client use-case from ๐จ๐ง-๐ฉ๐ซ๐๐ฆ๐ข๐ฌ๐๐ฌ infrastructure written in ๐ฌ๐ก๐๐ฅ๐ฅ to ๐๐ฒ๐๐ฉ๐๐ซ๐ค on ๐๐๐, resulting in improved performance and scalability.โข Played a key role in implementing a ๐ฎ๐ญ๐ข๐ฅ๐ข๐ญ๐ฒ ๐ฌ๐๐ซ๐ข๐ฉ๐ญ for dynamic cluster setup using ๐๐๐ซ๐ซ๐๐๐จ๐ซ๐ฆ. -
Data EngineerQuantiphi Nov 2021 - Nov 2022Mumbai, Maharashtra, Indiaโข Spearheaded the implementation of a cloud solution which was a ๐ฉ๐๐ซ๐๐ฅ๐ฅ๐๐ฅ ๐ฉ๐ซ๐จ๐๐๐ฌ๐ฌ๐ข๐ง๐ ๐๐ซ๐๐ฆ๐๐ฐ๐จ๐ซ๐ค to process 300,000 pdf documents on ๐๐๐ utilizing ๐๐ฅ๐จ๐ฎ๐ ๐ฏ๐ข๐ฌ๐ข๐จ๐ง ๐๐๐.โข Wrote ๐๐ฑ๐ญ๐๐ซ๐ง๐๐ฅ ๐ฌ๐๐ซ๐ข๐ฉ๐ญ for triggering the airflow pipelines concurrently with set ๐๐ฑ๐ฉ๐๐ซ๐ข๐ฆ๐๐ง๐ญ๐๐ฅ ๐ฉ๐๐ซ๐๐ฆ๐๐ญ๐๐ซ๐ฌ and ๐จ๐ฉ๐ญ๐ข๐ฆ๐ข๐ณ๐๐ the processing time by around 65% (9 days).โข Led end-to-end migration of critical workflow orchestration projects from ๐๐๐๐จ๐จ๐ฉ ๐๐๐จ๐ฌ๐ฒ๐ฌ๐ญ๐๐ฆ to GCP Composer and Dataproc, improving overall ๐ซ๐๐ฌ๐จ๐ฎ๐ซ๐๐ ๐ฎ๐ญ๐ข๐ฅ๐ข๐ณ๐๐ญ๐ข๐จ๐ง by 40%.โข Migrated Generador Interfaz workflow by creating on-demand child workflows using ๐๐๐๐ file properties. Leveraged ๐๐ข๐ง๐ฃ๐ ๐ญ๐๐ฆ๐ฉ๐ฅ๐๐ญ๐๐ฌ and a ๐๐๐ ๐๐ฎ๐๐ค๐๐ญ for seamless Airflow migration, resulting in a 15% ๐๐๐๐ข๐๐ข๐๐ง๐๐ฒ boost.โข Redesigned and implemented ๐๐๐ฅ๐ข๐๐๐ญ๐จ๐ซ ๐ฐ๐จ๐ซ๐ค๐๐ฅ๐จ๐ฐ ๐ข๐ง ๐๐ข๐ซ๐๐ฅ๐จ๐ฐ, enabling the triggering of 47 other workflows based on a ๐๐๐๐ schedule stored in an ๐๐๐-๐๐จ๐ง๐ญ๐ซ๐จ๐ฅ ๐ญ๐๐๐ฅ๐. -
Data Engineer InternQuantiphi Jul 2021 - Nov 2021Mumbai, Maharashtra, Indiaโข Intensive training program covering ๐๐ฅ๐จ๐ฎ๐ ๐ฉ๐ฅ๐๐ญ๐๐จ๐ซ๐ฆ๐ฌ (๐๐๐ ๐๐ง๐ ๐๐๐), ๐๐ ๐ฆ๐จ๐๐๐ฅ๐ข๐ง๐ , ๐๐ข๐ ๐๐๐ญ๐ ๐ฉ๐ซ๐ข๐ง๐๐ข๐ฉ๐ฅ๐๐ฌ, ๐๐๐๐จ๐จ๐ฉ, ๐๐ฉ๐๐๐ก๐ ๐๐ฉ๐๐ซ๐ค, ๐๐๐ tools like ๐๐ง๐๐จ๐ซ๐ฆ๐๐ญ๐ข๐๐, and ๐๐ง๐จ๐ฐ๐๐ฅ๐๐ค๐, data visualization tools like ๐๐๐๐ฅ๐๐๐ฎ, providing a holistic view of data engineering.โข Developed a real-time request simulation logic that individually sent requests from a fixed input file in Google Cloud Storage (๐๐๐) to a deployed machine learning model on ๐๐๐ซ๐ญ๐๐ฑ ๐๐.โข Created a custom ๐๐ฅ๐จ๐ฎ๐ ๐๐๐ญ๐๐๐ฅ๐จ๐ฐ template using ๐๐ฉ๐๐๐ก๐ ๐๐๐๐ฆ to perform basic transformations on data.โข Utilized ๐๐ฉ๐๐๐ก๐ ๐๐ข๐ซ๐๐ฅ๐จ๐ฐ as a workflow management platform to ๐ฌ๐๐ก๐๐๐ฎ๐ฅ๐ and ๐จ๐ซ๐๐ก๐๐ฌ๐ญ๐ซ๐๐ญ๐ daily processing jobs.
Harshdeep Mishra Education Details
-
Cgpa : 9.15/10
Frequently Asked Questions about Harshdeep Mishra
What is Harshdeep Mishra's role at the current company?
Harshdeep Mishra's current role is MS CSE @ SUNY Buffalo | Data Engineer | AWS | GCP | Python | Spark | Cloud | SQL | Node.js | C++ | JavaScript.
What schools did Harshdeep Mishra attend?
Harshdeep Mishra attended University At Buffalo, Thadomal Shahani Engineering College, Thakur College Of Science & Commerce.
Not the Harshdeep Mishra you were looking for?
-
-
Harshdeep Mishra
Student At Bhartiya Vidya Bhavans Sardar Patel Institute Of Technology Munshi Nagar Andheri MumbaiMumbai -
-
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records ร $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial