Sameer Khan Email and Phone Number
● Highly efficient Data Engineer with around 9 years of experience in Data Analysis, Statistical Analysis, strong functional knowledge on business processes and latest market trends. ● Data Acquisition, Data Validation, Predictive demonstrating, Data Visualization. Capable in measurable programming languages like R and Python.● Proficient in managing entire project life cycle and actively involved in all the phases of project life cycle.● Extensive experience in Text Analytics, developing different Statistical Machine Learning, Data mining solutions to various business problems and generating data visualizations using R, Python and Tableau.● Adept and deep understanding of Statistical modeling, Multivariate Analysis, model testing, problem analysis, model comparison and validation.● Skill in performing data parsing, data manipulation and data preparation with methods including describe data contents, compute descriptive statistics of data, regex, split and combine, Remap, merge, subset, reindex, melt and reshape.● Experience in using various packages in R and libraries in Python.● Solid working experience with SQL including MySQL and MS SQL Server. Strongly skilled in writing stored procedures, triggers and complex queries containing multiple joins, subqueries and window functions to create reports and perform analysis.● Experience on creating dashboards in Tableau for reporting and data visualization, and guided business decision - making for multiple stakeholders.● Experience on developing Python script for the whole data engineer project life cycle including data acquisition, data cleaning, data exploration, and data modeling using libraries such as Pandas and Sklearn.● Experience in designing ETL process using Talend Tool to load from Sources to Targets through data Transformations.● Experience to build ETL pipelines to extract, transform and load into analytical databases, schedule and automated pipelines using Apache Airflow.● Solid experience working with cloud platforms such as AWS and Google Cloud.● Experience working with Shell Scripting in operating systems such as Linux and version-control tools such as Git.● Detail-oriented and self-starter with strong communications skills presenting results of analysis to both technical and non-technical audiences and experience collaborating within cross-functional teams.● Experience on designing, developing and tracking Key Performance Indicators (KPI) and creating dashboards to monitor them.
Bank Of America
View- Website:
- bankofamerica.com
- Employees:
- 232061
-
Senior Data Scientist And Ai-Ml Engineer And Ml OpsBank Of AmericaUnited States -
Senior Data EngineerBank Of America Sep 2023 - PresentPlano, Texas, United States● Skilled in data cleansing, preprocessing using Python and creating data workflows with SQL queries using Alteryx and prepares Tableau Data Extracts (TDE)● Executed process improvements in data workflows using Alteryx processing engine and SQL● Strong experience in Teradata, Informatica, Python, UNIX shell scripting for processing large volumes of data from varied sources and loading into databases like Teradata, Oracle.● Strong experience with Informatica Designer, Workflow Manager, Workflow Monitor, Repository Manager.● Involved in understanding requirements and in modeling activities of the attributes identified from different source systems which are in Oracle, Teradata, CSV FILES. Data is Staged, integrated, Validated and finally loaded the data into Teradata Warehouse using Informatica and Teradata Utilities.● Extensive experience in developing and designing data integration solutions using ETL tool such as Informatica Powercenter, Teradata Utilities for handling large volumes of data.Built perform ant, scalable ETL processes to load, cleanse and validate data.● Worked for developing, support and maintenance of the ETL (Extract, Transform and Load) processes using Talend.● Created SSIS Packages to back up the database and compress and zip the compressed database backup.● Provided support for data processes. This will involve monitoring data, profiling database usage, trouble shooting, tuning and ensuring data integrity.● Collaborated with team members and stakeholders in design and development of data environment.● Prepared associated documentation for specifications, requirements and testing.● Used Tensorflow for text summarization and optimized the Tensorflow Model for efficiency.● Used storm for an automatic mechanism to analyze large amounts of non-unique data points with low latency and high throughput.● Developed MapReduce jobs in Python for data cleaning and data processing. -
Data EngineerSigma Llc Apr 2021 - Dec 2022Delhi, India● Worked on Agile methodologies and SCRUM process.● Performed Data Profiling to learn about user behavior and merged data from multiple data sources.● Participated in all phases of data mining; data collection, data cleaning, developing models, validation, visualization and performed Gap analysis.● Performed K-means clustering, Multivariate analysis and Support Vector Machines in Python and R.● Developed Clustering algorithms and Support Vector Machines that improved Customer segmentation and Market Expansion.● Professional Tableau user (Desktop, Online, and Server).● Data Storyteller, Mining Data from different Data Source such as SQL Server, Oracle, Cube Database, Web Analytics and Business Object.● Provided AD hoc analysis and reports to the Executive level management team.● Data Manipulation and Aggregation from different sources using Nexus, Toad, Business Objects, Power BI and Smart View.● In the Unix development environment, for Financial application reports used batch processes and models using Perl and Korn shell scripts with partitions and subpartitions on oracle database.● Developed analytics and strategy to integrate B2B analytics in outbound calling operations.● Implemented analytics delivery on cloud-based visualization using shiny tools for Business Object and Google analytics platform.● SPOC Data Engineer and predictive analyst to create annual and quarterly Business forecast reports.● Main source of Business Regression report.● Created various B2B Predictive and descriptive analytics using R and Tableau.● Created and automated ad hoc reports.● Responsible for planning & scheduling new product releases and promotional offers.● Worked on NOSQL databases like Cassandra.● Parsing data, producing concise conclusions from raw data in a clean, well-structured and easily maintainable format.● Worked on different data formats such as JSON, XML and performed machine learning algorithms in R and Python.
-
Data EngineerInfo Builders Jan 2018 - Mar 2021Mumbai, Maharashtra, India● Involved in Sprint planning sessions and participated in the daily Agile SCRUM meetings.● Worked on the project from gathering requirements to developing the entire application. ● Worked on Anaconda Python Environment. Created, activated and programmed in Anaconda environment. ● Used python modules of urllib, urllib2, Requests for web crawling. Used all these ML techniques: clustering, regression, classification, graphical models.● Involved in development of Web Services using SOAP for sending and getting data from the external interface in the XML format. Used with other packages such as Beautiful Soup for data parsing.● Worked on development of SQL and stored procedures on MYSQL.● Responsible for creating reporting dashboards, performing data mining and analysis to understand customer purchase behavior.● Collaborated with the marketing team to analyze marketing campaign data and perform analysis involving segmentation, cohort analysis.● Designed MySQL table schemas and implemented stored procedures to extract and store customer purchase and session data.● Actively involved in designing A/B tests, defining metrics to validate new user interface features, calculating sample size and checking statistical assumptions for tests.● Performed statistical analysis such as hypothesis testing, regression analysis, confidence interval and P-value calculation using R to find insights to increase click through rate and sales and built web applications for ad-hoc interactive dashboard.● Performed Exploratory Data Analysis to identify trends using Tableau and Python (Matplotlib, Seaborn, Plotly Dash).● Developed Python scripts to do data preprocessing for predictive models including missing value imputation, label encoding and feature engineering.● Communicated key findings from data to multiple stakeholders to facilitate data-driven decisions using tools MS PowerPoint Tableau and Jupyter Notebook
-
Data EngineerHdfc Bank Aug 2014 - Dec 2017Bengaluru, Karnataka, India● Developed Predictive Models using various Machine Learning techniques such as XGBoost, Random Forests, Logistic Regression to predict Credit Risk and potential Fraud customers with accuracy of 98.5% ● Utilized Time-Series forecasting algorithms such as ARIMA, SARIMA, Holtz-Winters to forecast operational needs of the company including worker productivity, sales, revenue and other vital metrics for business needs and planning.● Conducted large-scale exploratory data analysis and data mining to identify patterns, data, correlations and anomalies with structured and unstructured data to create business driven solutions and analysis using AWS Redshift, Glue, S3.● Working experience of Data Warehouse ETL /design and implementation of complex big data pipelines.● Develop data visualizations to provide insights with more impact, specifically with relation to book of business outcomes reporting and client specific projects using Tableau, R, Python, Excel ● Collaborated with various teams such as Operations, Marketing, Sales, Human Resources to drive business value to create data driven solutions and dashboard creation, KPI’s and ad-hoc reports using Tableau ● Designed and developed complex data models using Python and SQL to ensure data reliability and accuracy which led to reduction in anomalies and data redundancy in Datawarehouse such as Redshift and S3 storage.● Worked closely with cross-functional teams to identify opportunities to improve data quality, data mapping, data modeling and optimize data processing flows and analytical framework.● Used correlation analysis and graphical techniques in Matplotlib, Seaborn, GG Plot to get some insights about the customer loan patterns, market segmentation, anomalies and ● Constant collaboration with Data Engineering, Data Services team for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL, Python and AWS cloud native technologies.
Frequently Asked Questions about Sameer Khan
What company does Sameer Khan work for?
Sameer Khan works for Bank Of America
What is Sameer Khan's role at the current company?
Sameer Khan's current role is Senior Data Scientist and AI-ML Engineer and ML Ops.
Who are Sameer Khan's colleagues?
Sameer Khan's colleagues are Diane Tinsman, John Ritter, Lawrence Pan, Teegan Howell, Richard G., Nancy Mazur, Rosette Nampija.
Not the Sameer Khan you were looking for?
-
Sameer Khan
Austin, Texas Metropolitan Area5hotmail.com, mulesoft.com, salesforce.com, salesforce.com, salesforce.com -
-
Sameer Khan
Philadelphia, Pa -
-
Sameer Khan
Wilmington, De
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial