Sai Krishna

Sai Krishna Email and Phone Number

Senior Data Scientist | NLP, Big Data Architectures @ Signify Health
dallas, texas, united states
Sai Krishna's Location
San Bernardino, California, United States, United States
About Sai Krishna

As a Senior Data Scientist with over 8 years of experience, I leverage machine learning and data science to address complex business challenges. My expertise encompasses the entire data science lifecycle, from data acquisition and preprocessing to predictive modeling and visualization.I excel in Python (versions 2.x and 3.x) and R, utilizing libraries such as NumPy, Pandas, SciPy, Scikit-learn, XGBoost, LightGBM, Keras, TensorFlow, and PyTorch. My proficiency extends to SQL/PLSQL, enabling effective database manipulation and complex query writing. I am skilled in developing ETL pipelines using Informatica, and managing big data technologies including Hadoop, Apache Spark, Hive, and HDFS.Key Skills & Tools:Data Modeling & ETL: Designed and implemented 3NF, Star, and Snowflake schemas. Developed ETL processes using Informatica, Azure Data Factory (ADF), and performed data manipulation with SQL tools like Teradata SQL Assistant and Oracle SQL Developer.Big Data Technologies: Experienced with Apache Hadoop, Spark, Hive, and HDFS for scalable data processing and analysis.Database Management: Proficient in SQL Server, MySQL, PostgreSQL, MongoDB, Cassandra, and HBase for managing both relational and NoSQL databases.Data Visualization: Created interactive dashboards and reports using Tableau, Power BI, QlikView, and Alteryx.Advanced Analytics: Applied machine learning algorithms including Decision Trees, Random Forests, Naive Bayes, Logistic Regression, and Neural Networks. Utilized statistical techniques like PCA, Hypothesis Testing, and Time Series Analysis.Software & Platforms: Utilized PyCharm, RStudio, Azure ML, IBM Watson Studio, and other IDEs for development and analysis.At Signify Health, I developed and deployed machine learning solutions, while at Charter Communications, I managed big data projects and predictive analytics. My role at Northern Trust involved advanced statistical modeling and data warehousing, and my early career at Synechron Technologies and High Radius Technologies provided a solid foundation in SQL development and data engineering.I am dedicated to fostering a data-driven decision-making culture within organizations and continuously enhancing my technical skills to stay ahead in the rapidly evolving field of data science.

Sai Krishna's Current Company Details
Signify Health

Signify Health

View
Senior Data Scientist | NLP, Big Data Architectures
dallas, texas, united states
Employees:
1794
Sai Krishna Work Experience Details
  • Signify Health
    Senior Data Scientist
    Signify Health Jan 2023 - Present
    Dallas, Texas, United States
    · Collaborated with Data and IT Architects on data movement and storage, using ER Studio 18.5 for modeling.· Developed and deployed machine learning algorithms with Python libraries (pandas, NumPy, SciPy, scikit-learn, NLTK).· Implemented supervised models (Logistic Regression, Decision Trees, KNN, Naive Bayes) to improve predictions.· Used the Caffe Deep Learning Framework for designing neural networks for image and video analysis.· Worked with data formats like JSON and… Show more · Collaborated with Data and IT Architects on data movement and storage, using ER Studio 18.5 for modeling.· Developed and deployed machine learning algorithms with Python libraries (pandas, NumPy, SciPy, scikit-learn, NLTK).· Implemented supervised models (Logistic Regression, Decision Trees, KNN, Naive Bayes) to improve predictions.· Used the Caffe Deep Learning Framework for designing neural networks for image and video analysis.· Worked with data formats like JSON and XML; utilized Python for manipulation and analysis.· Managed data mining phases: collection, cleaning, model development, validation, and visualization.· Created NLP systems for automating customer service query classification.· Designed 3NF and dimensional data models (Star and Snowflake Schemas) for operational and OLTP systems.· Built Azure-based solutions with Azure Event Hub, Stream Analytics, PowerBI, and Azure ML for real-time analysis.· Led Big Data integration in Microsoft Azure to enhance analytics capabilities.· Developed JSON scripts and managed Azure Data Factory (ADF) pipelines for ETL.· Updated Python scripts for AWS Cloud Search integration and document classification.· Built solutions using Azure PaaS for visualization and assessing business impact.· Guided Agile teams in application development and ensured HIPAA compliance for data security.· Developed R and Python programs for data preparation and harmonization.· Conducted gap analysis for data discrepancies and optimization.· Created OLAP databases, scorecards, dashboards, and reports for insights.· Managed data from various sources using Nexus, Toad, Business Objects, Powerball, and SmartView.· Utilized Seaborn and Matplotlib for data visualizations and automated ETL with Azure Data Factory.· Maintained documentation of data processes and models; conducted training for data science tools.· Evaluated new data analytics technologies to stay current. Show less
  • Charter Communications
    Data Scientist
    Charter Communications Aug 2021 - Jul 2022
    Stamford, Connecticut, United States
    · Analyzed large datasets using Python, R, and Scala, leveraging TensorFlow and PyTorch for deep learning.· Managed big data with Apache Spark and Hadoop, using Spark MLlib and Hadoop MapReduce for scalable data operations.· Developed predictive models with XGBoost, RandomForest, and SVM to forecast customer behavior.· Conducted data profiling on traffic patterns and locations with Python and MATLAB.· Applied statistical modeling, including decision trees and regression models… Show more · Analyzed large datasets using Python, R, and Scala, leveraging TensorFlow and PyTorch for deep learning.· Managed big data with Apache Spark and Hadoop, using Spark MLlib and Hadoop MapReduce for scalable data operations.· Developed predictive models with XGBoost, RandomForest, and SVM to forecast customer behavior.· Conducted data profiling on traffic patterns and locations with Python and MATLAB.· Applied statistical modeling, including decision trees and regression models, to improve decision-making.· Implemented machine learning pipelines in Scikit-learn, TensorFlow, and Keras for automated model processes.· Designed real-time data processing pipelines with Kafka, Spark Streaming, and Flink for live data insights.· Created visualizations and dashboards using Tableau, PowerBI, and D3.js to support business decisions.· Cleansed, engineered features, and scaled data with Pandas and NumPy for analysis readiness.· Maintained SQL and NoSQL databases like Cassandra and MongoDB, ensuring data integrity.· Led A/B testing and data-driven campaigns to evaluate strategy effectiveness.· Optimized models with cross-validation, ROC curves, AUC; addressed overfitting using Lasso and Ridge.· Collaborated with teams to integrate machine learning into enterprise systems.· Wrote stored procedures and triggers in Oracle and PL/SQL to automate data workflows.· Managed end-to-end machine learning projects from data collection to deployment.· Developed shell scripts and used NZSQL/NZLOAD for efficient data loading.· Applied PCA for dimensionality reduction in high-dimensional data.· Conducted exploratory data analysis to identify key variables for model development.· Implemented rule-based systems for interpreting outputs and providing recommendations.· Analyzed user lifetime value using longitudinal data analysis.· Communicated technical results to non-technical stakeholders to influence decisions.· Enhanced data quality with validation scripts in SQL and Hive. Show less
  • Northern Trust
    Data Scientist
    Northern Trust Jan 2019 - Jul 2021
    Chicago, Illinois, United States
    · Collaborated with clients to gather data requirements and executed ETL processes to standardize formats.· Conducted advanced SQL queries for data retrieval and preliminary analysis.· Utilized Python's Pandas for data preprocessing, including cleaning, type casting, and table merging for EDA.· Applied feature engineering like PCA, normalization, and label encoding using Scikit-learn for high-dimensional data.· Explored data using correlation analysis and visualization tools… Show more · Collaborated with clients to gather data requirements and executed ETL processes to standardize formats.· Conducted advanced SQL queries for data retrieval and preliminary analysis.· Utilized Python's Pandas for data preprocessing, including cleaning, type casting, and table merging for EDA.· Applied feature engineering like PCA, normalization, and label encoding using Scikit-learn for high-dimensional data.· Explored data using correlation analysis and visualization tools like Matplotlib and Seaborn, focusing on healthcare data.· Developed and maintained data warehousing and Data Lake solutions, integrating SQL and NoSQL databases.· Analyzed large datasets with R, SAS, MATLAB, and Python, focusing on linear regression models.· Executed pattern recognition on financial time series data using ARMA, ARIMA, and exponential smoothing.· Designed and tested predictive models like Logistic Regression, SVM, Random Forest, XGBoost, and Neural Networks.· Managed end-to-end data workflows, including collection, storage, analysis, and model validation.· Performed data wrangling, cleaning datasets, and visualizing trends using Python and Matplotlib.· Implemented and optimized predictive models on AWS Lambda, selecting the best algorithms for performance.· Gathered post-deployment feedback to retrain models and improve accuracy.· Developed and maintained reports using Tableau, influencing business decisions.· Mentored junior data scientists, leading complex data projects.· Streamlined data pipelines using Apache Spark and Hadoop, enhancing processing efficiency.· Strengthened data security through governance and quality control measures.· Integrated AI and machine learning into broader tech stacks and business applications.· Contributed to developing proprietary algorithms, translating complex data into actionable insights. Show less
  • Synechron Technologies Llc
    Junior Data Scientist
    Synechron Technologies Llc Jul 2017 - Dec 2018
    India
    · Engaged in the full Software Development Life Cycle (SDLC), analyzing business requirements and mapping workflows using Agile methodologies.· Completed an advanced Data Science program, focusing on Data Manipulation, Visualization, Machine Learning, Python, SQL, and cloud platforms like AWS and Azure.· Utilized Python libraries (Pandas, NumPy, Matplotlib, scikit-learn) to develop and fine-tune machine learning models, processing complex data formats like JSON and CSV.· Performed… Show more · Engaged in the full Software Development Life Cycle (SDLC), analyzing business requirements and mapping workflows using Agile methodologies.· Completed an advanced Data Science program, focusing on Data Manipulation, Visualization, Machine Learning, Python, SQL, and cloud platforms like AWS and Azure.· Utilized Python libraries (Pandas, NumPy, Matplotlib, scikit-learn) to develop and fine-tune machine learning models, processing complex data formats like JSON and CSV.· Performed sentiment analysis and trend detection using NLP techniques with Python’s NLTK and spaCy libraries.· Collaborated with senior data scientists to uncover patterns in large datasets, applying statistical methods and machine learning algorithms like XGBoost and decision trees.· Enhanced data visualization and analytics using Python and R, creating advanced graphical representations and interactive dashboards with R Shiny.· Implemented predictive models for customer behavior analysis, employing time series analysis and machine learning techniques in R and Python.· Applied Generalized Additive Models (GAM) for user segmentation and preference prediction, focusing on non-linear relationships in large-scale data.· Conducted data cleaning and preprocessing, utilizing advanced techniques for managing missing data, such as multiple imputation.· Developed and validated predictive models, including neural networks and deep learning frameworks like TensorFlow and PyTorch.· Improved model performance through ensemble methods like stacking and boosting, using libraries like LightGBM and CatBoost.· Presented insights to senior management using Power BI, Tableau, and Google Data Studio, driving data-driven decision-making.· Proficient in Big Data technologies like Apache Hadoop and Hive for managing and analyzing large datasets in distributed environments.· Continuously updated technical skills, adopting new technologies to stay ahead in the evolving field of Data Science and Machine Learning. Show less
  • Highradius
    Sql Developer
    Highradius May 2016 - Jun 2017
    India
    • Developed data mapping, governance, and transformation rules for MDM architecture involving OLTP, ODS, and OLAP.• Provided source to target mappings for ETL team, enabling initial, full, and incremental loads into the data mart.• Conducted JAD sessions, collected requirements, and defined source to target data mappings and business rules.

Frequently Asked Questions about Sai Krishna

What company does Sai Krishna work for?

Sai Krishna works for Signify Health

What is Sai Krishna's role at the current company?

Sai Krishna's current role is Senior Data Scientist | NLP, Big Data Architectures.

Who are Sai Krishna's colleagues?

Sai Krishna's colleagues are Lane Salazar, Szymon Woźniak, Carolyn Hardy, Mba, Josephine Silverthorn, Tiante Shorter, Ericka Daniels, Aimee Getter.

Not the Sai Krishna you were looking for?

  • Sai Krishna

    Actively Looking For Data Analyst Role| Ex-Nvidia | Ex- Virtusa | Mckinsey & Company | Bny | Certified Oracle Developer | Certified Aws | Data Analyst | Oracle Pl/Sql | Python | Machine Learning | Etl| Power Bi | Tableau
    Cincinnati, Oh
  • Sai K.

    Golang Developer
    Plano, Tx
  • Sai Krishna

    Actively Looking For New Opportunities In C2C And C2H | Java Full Stack Developer | Front End Developer | Spring Boot 2 | Golang | Kotlin | Python | Javascript | Angular 6 | Oracle | Cassandra | Mongodb | Aws | Azure ||
    Greater Houston
  • Sai Krishna

    Overland Park, Ks
    2
    cognizant.com, gmail.com
  • Sai Krishna

    Sr. Java Software Engineer | Expert In High-Performance Systems & Scalable Microservices | Spring Boot & Cloud Solutions Specialist|React & Angular|Aws|
    United States

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.