Data Scientist
Rahway, New Jersey, United States
- Developed and optimized data pipelines to support genomic data analysis, ensuring secure and scalable storage of large datasets, which improved data retrieval efficiency for advanced analytics and research.
- Implemented data ingestion processes for handling real-time data from various sources, reducing processing times and enabling timely insights for decision-making.
- Utilized traditional data warehousing and ETL tools (such as SQL and Python) for data modeling, ETL processes, and… Show more
- Utilized traditional data warehousing and ETL tools (such as SQL and Python) for data modeling, ETL processes, and transforming raw data into structured formats suitable for analysis.
- Managed the distributed processing of large-scale genomic datasets using tools like Hadoop and Spark, which reduced analysis times and enabled faster insights generation.
- Conducted performance tuning of SQL queries, optimizing execution times for faster access to critical data in research and analytics projects.