Arif Khan personal email
- Valid
Principal Engineering Leader/ Software Engineer , Seattle, WA.Open source contributionSimply Index Logger (SIL), developed in JAVA:● Idea, initiation of the project, designed & developed v1 of the product● Code is open-sourced & open for contribution from other developers● Design & source code is available at the following locationhttps://bitbucket.org/akindexedlogger/indexedlogger/wiki/HomePassionate to share knowledge, connect with people, help others where I can.Successfully provided 35 hours of free online training on big data in six weeks as instructor.http://khipustechlearning.blogspot.com/2017/08/free-big-data-hadoop-online-training.htmlPassionate about solving problems in the big data domain.Architected/ developed /delivered systems processing petabytes of data using big data & cloud technologies.Developer and founder of Windows Store App "Koya (Mailbox Analytics)". Know more about Koya on my blog.http://technicalpostsbyarifkhan.blogspot.com/2013/12/koya-mailbox-analytics.htmlAuthored a training manual on C programming, which is used as a training material for students across multiple locations of reputed training center in India.Blog posts1) Impact of technology on businesses http://technicalpostsbyarifkhan.blogspot.com/2013/12/impact-of-technology-on-businesses.html2) Sequential vs Random I/O and database systems http://technicalpostsbyarifkhan.blogspot.com/2013/12/random-vs-sequential-io-and-effects-on.html3)B+ tree data structure explainedhttp://technicalpostsbyarifkhan.blogspot.com/2014/01/b-tree-data-structure-explained.html4) Is ACIDIty killing relational database systemshttp://technicalpostsbyarifkhan.blogspot.com/2014/09/is-acidity-killing-relational-database.html5) Random I/O and key look-upshttp://technicalpostsbyarifkhan.blogspot.com/2013/12/influence-of-random-ios-on-key-look-ups.html6) Know more about Koya (Mailbox Analytics)http://technicalpostsbyarifkhan.blogspot.com/2013/12/koya-mailbox-analytics.html
-
Founder & Software Engineer (Zinzu)Zinzu Jul 2024 - PresentRedmond, Wa, UsBuilding an awesome SaaS analytics platform to find patterns in large volumes of datasets......follow the data sequence, uncover the story.....https://zinz.io Zinzu.ioConnecting the Dots in Your DataAt Zinzu, we transform scattered data points into insightful stories by uncovering patterns hidden within sequences. Our innovative platform empowers businesses to gain deeper insights with ease, making raw data accessible without complex queries. With seamless integration across AWS, GCP, and Azure, Zinzu standardizes disparate datasets into a common schema, treating each record as an event on a timeline.Whether it's tracking campaign performance, optimizing supply chains, or enhancing customer journeys, Zinzu's no-code platform simplifies complex data analysis through a user-friendly drag-and-drop interface. We go beyond traditional data retrieval methods, offering a more intuitive, flexible approach to understanding your data.Join us in revolutionizing how you connect the dots in your data.No vendor lock-in, pay as you go, and AI-powered natural language queries. Zinzu is here to complement your existing systems and fill the gaps in current analytics models.Check out these two short videos to learn more about Zinzu and its sequencing:https://www.youtube.com/@Zinzu-SequenceAnalytics -
Founder & Software Engineer (Tema)Zinzu May 2024 - PresentRedmond, Wa, UsReleased initial version of tema app (transforming agriculture with data.)https://play.google.com/store/apps/details?id=com.zinzu.tema -
Principal Engineering LeaderResonate May 2023 - Mar 2024Reston, Va, Us -
Staff Engineering Leader / Big Data ArchitectLiveramp Mar 2022 - Apr 2023San Francisco, Ca, Us● Tech lead for Segments delivery platform.● As a tech lead, spent time understanding legacy code running as Microservices onKubernetes (GK8) & hadoop jobs on GCP’s dataproc clusters, with minimum to no help,shared knowledge with the rest of the team and improved team's velocity.● Analyzed large volumes of service calls to identify cache inefficiency, worked with thepartner team to change calling patterns for efficient usage of pre-processed data.● Designed & led an effort with two other devs to implement auto routing methodology todifferent GCP's dataproc clusters. This enabled us to increase the number ofpreemptible nodes and reduced cost by 20% and improved sla’s by 40%.● Data is serialized as thrift objects in our large datasets, which need custom deserializers,troubleshooting was pain, designed and led an effort with another dev to create deserializer on large datasets and write output to Google’s bigquery db in a human readable format, this enabled us fast turnaround times on data quality issues.● Analyzed audience data sets on non-used segments, designed & implemented a self learning & auto adjusting service to cut unused data, further improving data filtering performance.● Worked with the SRE team to identify missing datadog alerts and improved monitoring by adding required alerts and reducing noisy ones.● Simplified zookeeper’s znode structure for coordination between microservices -
Open Source ContributionOpen Source Jan 2021 - Mar 2023Open source contribution (personal project during free time) Simply Index Logger (SIL):● Idea, initiation of the project, built a team of contributors, designed & developed v1 of the product● Code is open sourced & open for contribution from other developers● SIL is developed in JAVA.● Design & source code is available at the following locationhttps://bitbucket.org/akindexedlogger/indexedlogger/wiki/Home
-
Big Data ArchitectMicrosoft Apr 2017 - Mar 2022Redmond, Washington, Us● Led an effort to develop the export process to fulfill GDPR’s DSR. This was a cross team effort between onsite and offshore teams.● Analyzed data and implemented auto tagging process to separate out non-private & privacy data● Led cross team effort across the org to build a platform performing events sequencing using statemachine models in a most generic way. This was a complex effort involving Engineers, Data Scientists, PM’s and communicating to the leadership team. Successfully led, designed & delivered a scalable platform.● Led an effort across research, data scientists and different teams to evaluate various anomaly detection algorithms on windows telemetry data, successfully delivered cross group feature.● Worked closely with different teams in Windows org on a regular basis to identify gaps in events processing system, triage and prioritize feature requests with management & pm’s.● Mentoring new hires and actively participating in interviewing candidates.● Core member in the architectural review team in our org.● Designed / developed backend data processing system using Java MapReduce, Impala on azure& Cosmos for Anomaly detection framework.● Developed a prototype to re-use existing .net code on Linux using Mono and communicate withJVM using Apache Thrift. This was done to move processing to spark and also re-use most ofthe existing logic written in C#.● Worked with various teams in Office org to improve surveys performance.● Led a V team with data engineers & data scientists to analyze various sampling strategies toreduce data volume and successfully implemented sampling strategy for the org.● Designed data storage layer for experimentation system in COSMOS (internal map-reduce bigdata system) to enable backfilling and reduce storage (GDPR) guidelines.● Led cross team effort to implement GDPR policies on all data assets. -
Principal ArchitectPointinside Apr 2015 - Apr 2017● Architected Analytics platform for retail partners using Big Data and NoSql solutions on Amazon’s aws cloud.● Go to person and big data architect across the company.● Developed scalable solution to process data on aws EMR clusters and load processed data intoElasticSearch cluster to provide querying capabilities.● Built an automated solution to onboard new partners which includes various services (dns -dynect , sftp, setting up infrastructure on aws cloud and monitoring).● Worked as an architect to search team to reduce data load times to solr(designing a system toprovide incremental loads to solr with combination of emr processing)● Developed a prototype to build a cloud based self serving analytics platform to retail partners,which can be used by their in-house data stores / ui interfaces.● Designed and built a framework to streamline data loads to ElasticSearch cluster.● Performance tuned ElasticSearch indices and data load processes.
-
Senior Software Design EngineerMicrosoft Feb 2007 - Apr 2015Redmond, Washington, Us● Joined Bing Ads when the group was a startup, has worked on various iterations of ad campaign& editorial systems until Yahoo & Bing ad systems were merged.● Primary owner of ads editorial system, designed and developed a scalable platform to facilitatemanual verification of ads/keywords (serving editors from different locations).● Member of a team that successfully delivered scalable ads platform during crucial Yahoo andMicrosoft search engine merger under tight schedules.● Worked as a mentor and technical lead to offshore teams in India and China.● Travelled to Bangalore, India to provide training on Bing’s advertisement platform to newengineers who moved from Yahoo to Microsoft.● Architected a system to scale and distribute processing of windows app store telemetry datausing consistent hashing. Designed it in such a way , new servers can be added and accounts could be moved to new servers with no row copy of data.3● Designed and developed a caching solution to improve throughput of telemetry data processing by more than 40%, this started as a side project and later productionalized.● Successfully led an effort to make Windows Reliability Telemetry system stable and working with minimal intervention during crucial release of Windows 8.● Developed auto scalable windows service (similar to mapreduce) for XBOX supply chain to distribute processing of enterprise relational data across multiple servers and bulk transfer reduced data to persistent store, improved performance by more than 50%● Designed and developed editorial processing system for Bing ads, this was done using functional partitioning and queueing methodologies, improved performance by 60% and reduced cost of customer support by 20%.● Led a team of 3 developers to build B2B interface with external digital marketing provider “ExactTarget” to deliver emails targeted towards xbox users. -
Software EngineerConsult At Verizon, Dallas Tx Jan 2002 - Dec 2005
Arif Khan Skills
Frequently Asked Questions about Arif Khan
What company does Arif Khan work for?
Arif Khan works for Zinzu
What is Arif Khan's role at the current company?
Arif Khan's current role is Founder & Software Engineer.
What is Arif Khan's email address?
Arif Khan's email address is ar****@****hoo.com
What skills is Arif Khan known for?
Arif Khan has skills like Software Design, C#, Agile Methodologies, Sql, .net, Software Development, Software Engineering, Databases, Asp.net, Distributed Systems, Scrum, Visual Studio.
Free Chrome Extension
Find emails, phones & company data instantly
Aero Online
Your AI prospecting assistant
Select data to include:
0 records × $0.02 per record
Download 750 million emails and 100 million phone numbers
Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.
Start your free trial