Mehul Shah

Mehul Shah Email and Phone Number

CEO and co-founder @ Aryn
Saratoga, CA, US
About Mehul Shah

In the past decade or more, two technology trends have intersected: the cloud, with its abundance of on-demand, computing resources, and the ubiquity of data. This makes it cheap to learn from data and makes previously intractable problems feasible. In my work, I have leveraged this to build more efficient, smarter, and easier to use cloud data systems.I'm currently focusing on the most important things in life - family and pursuing my passions in cloud and data.At Google, I was VP of Engineering for Streams and Lakes - the data integration, streaming, and open source analytics services on Google Cloud. At AWS, I ran Search Services which includes Amazon OpenSearch/Elasticsearch Service, the OpenSearch project, Open Distro, and Amazon CloudSearch. I also launched and ran two fast-growing cloud services, AWS Lake Formation and AWS Glue, and managed engineering teams in Amazon Redshift.Prior to Amazon, I was co-founder and CEO of Amiato (2011-2014), a managed ETL service in the cloud (acquired by Amazon). From 2004-2011, I was a principal scientist at HP Labs where my work spanned large-scale data management, distributed systems, and energy-efficient computing. This work has been published in top-tier database and systems conferences and has won several awards. Prior to HP, I received my PhD from U.C. Berkeley (2004) for adding parallelism, fault-tolerance, and load-balancing to the TelegraphCQ data-stream processing system. In 1999, I worked on the IBM DB2/UDB database. I received an MEng in 1997 and BS in Computer Science and Physics in 1996, all from MIT. In my spare time, I serve on the Sort Benchmark committee.

Mehul Shah's Current Company Details
Aryn

Aryn

View
CEO and co-founder
Saratoga, CA, US
Website:
aryn.ai
Mehul Shah Work Experience Details
  • Aryn
    Ceo And Co-Founder
    Aryn
    Saratoga, Ca, Us
  • Aryn
    Ceo And Co-Founder
    Aryn Aug 2022 - Present
    Mountain View, Ca, Us
  • Google
    Vp, Engineering, Streams And Lakes, Google Cloud Analytics
    Google Jan 2022 - Jun 2022
    Mountain View, Ca, Us
    I ran engineering for "Streams and Lakes" - the data integration, streaming, and open source analytics services on Google Cloud. These include Dataproc, Dataflow, PubSub, Dataplex, DPMS, Data Catalog, Composer, Data Fusion, and more. Customers use these "first mile" services to move, prepare, organize, and analyze their data in data lakes and data warehouses.
  • Amazon Web Services (Aws)
    Director, Gm, Opensearch, Amazon Cloudsearch Service, Aws Lake Formation
    Amazon Web Services (Aws) 2020 - Jan 2022
    Seattle, Wa, Us
    I ran the Amazon OpenSearch Service (successor to Amazon Elasticsearch Service), the OpenSearch project, Amazon CloudSearch Service, and AWS Lake Formation. I was responsible for go-to-market, product, engineering, and operations. I helped set strategy and oversaw the fork of OpenSearch from Elasticsearch.I was part of analytics at AWS. We were re-architecting big data systems for the cloud, where resources are plentiful, data is abundant, and everything is a service. I had a chance to put my crazy ideas into practice and deliver them to thousands of customers.
  • Amazon Web Services (Aws)
    Director, General Manager, Aws Lake Formation And Aws Glue
    Amazon Web Services (Aws) 2015 - Oct 2020
    Seattle, Wa, Us
    I launched and ran engineering, operations, and go to market for two fast-growing cloud services: AWS Lake Formation and AWS Glue. We are a multi-disciplinary team of distributed system and database architects, front-end and UX specialists, and ML experts.AWS Lake Formation is a new service that makes it easy to setup, secure, and manage Data Lakes. Data Lakes -- the evolution of enterprise data warehouses -- are curated repositories of structured and unstructured data that allow self-serve analytics for modern use-cases: IoT, ML model training, data science, and more. AWS Lake Formation leverages cutting-edge ML techniques for data ingestion and cleaning, and simplifies data security and governance. We envision it as the locus of control for all data in an enterprise. AWS Glue is a serverless data integration service which powers AWS Lake Formation. AWS Glue offers a centralized metadata service -- Data Catalog -- with crawlers that automatically extract and index metadata to enable data discovery. It also provides an ETL (extract-transform-and-load) engine that automates much of the undifferentiated heavy-lifting for moving and transforming data sets. At its core, it uses the Schema-lift technology from Amiato that makes it easy to handle modern semi-structured data sets.
  • Amazon Web Services (Aws)
    Sr. Manager, Amazon Redshift
    Amazon Web Services (Aws) May 2014 - Sep 2015
    Seattle, Wa, Us
    I led the development and public launch of two headlining features in Amazon Redshift: interleaved sort keys (Z-indexing) and user-defined-functions (UDFs). Interleaved sort keys are an alternative to indexing and projections for columnar databases that allow fast search on tables across multiple dimensions. UDFs allow users to customize Redshift analyses in Python for modern big-data use-cases.
  • Uc Berkeley Electrical Engineering & Computer Sciences (Eecs)
    Visiting Faculty
    Uc Berkeley Electrical Engineering & Computer Sciences (Eecs) Jan 2018 - Jun 2018
    Berkeley, Ca, Us
    In the Spring 2018 semester, I taught the Introduction to Database Systems course, an upper-division course for juniors and seniors in the EECS department. The course had 450+ students enrolled and a staff of 10 TAs.
  • Amiato
    Ceo And Co-Founder
    Amiato Sep 2011 - May 2014
    Amiato was a fully managed, real-time ETL cloud service. It bridged the gap between unstructured data and the world of structured business intelligence (BI) tools. Schema-lift was the technology powering Amiato. Schema-lift automatically infers the structure of semi-structured data (e.g. JSON logs), transforms it into tables, and loads data warehouses.As CEO and co-founder, my responsibilities spanned: fund-raising, recruiting, sales, go-to-market, and managing customer and partner relationships. I led the M&A process for our successful acquisition by Amazon. I raised our funding which included notable investors: Andreessen-Horowitz, Ignition, and YC. I also helped build early prototypes and design Schema-lift.
  • Hewlett-Packard Laboratories
    Principal Research Scientist
    Hewlett-Packard Laboratories Sep 2004 - Sep 2011
    Houston, Texas, Us
    My work at HP spanned large-scale data management, distributed computing and energy-efficiency. Below I list my most significant projects in reverse chronological order.Armonia: Principal investigator for Armonia project -- a scalable, distributed, main-memory data management platform that offers strongly consistent low-latency operations and complex on-the-fly analytics. Applications include financial trading and social networking.HP-KVS: Built a highly available, low-cost key-value service for the cloud. HP-KVS is an eventually consistent, erasure-coded, large object store that spans multiple geographies.Sinfonia: A highly scalable, distributed transactional store for building data-center infrastructure applications. Built a large-scale distributed B-tree, clustered file-system, and group communication using Sinfonia. Basis for the Armonia project. Won best paper, SOSP 2007.Energy-efficient systems: Characterized and optimized the energy use of computer systems as a whole, from storage to memory to compute. Inventor and maintainer of the JouleSort bechmark, the first holistic energy efficiency benchmark, which has inspired efficient server designs and influenced other benchmarks. Investigated energy efficiency of DB workloads.Other work: Designed software and hardware for non-volatile RAM technologies like NAND Flash and Memristors. Developed methods for long-term preservation of digital information.
  • University Of California, Berkeley
    Graduate Student
    University Of California, Berkeley 1997 - 2004
    Berkeley, Ca, Us
    Thesis: “Flux: A Mechanism for Building Highly-Available, Fault-Tolerant, Scalable Dataflows”: In the TelegraphCQ system, my dissertation focused on making parallel CQ dataflows – computations that analyze high-throughput streaming data in real time – highly available, fault-tolerant, and automatically load-balancing.Continuously Adaptive Continuous Queries (CACQ): Developed an adaptive query processing system that executes numerous long-running queries simultaneously over streaming data.AMDB: A debugger and profiler for search indexes on non-traditional data types like audio and images. Designed UI for navigating high-fanout search trees. (Released open-source).
  • Ibm Almaden Research Center
    Research Intern
    Ibm Almaden Research Center Jan 1999 - Oct 1999
    Armonk, New York, Ny, Us
    Investigated alternative strategies for implementing collection types in IBM DB2/UDB. Designed language extensions for querying collection types. Gained experience with administration and software development in DB2/UDB.
  • At&T Labs, Inc.
    Intern
    At&T Labs, Inc. Jun 1996 - Jan 1997
    Dallas, Tx, Us
    MEng Thesis (jointly done at MIT): "ReferralWeb: A Resource Location System Guided by Personal Relations." The first system to automatically discover and extract social networks by mining publicly available data on the web. ReferralWeb also automatically finds experts on user-specified topics and recommends paths in the extracted social graph to connect users with those experts.
  • Bell Laboratories
    Intern
    Bell Laboratories Jun 1995 - Aug 1995
    Murray Hill, Nj, Us
    Built a prototype of a content-based image search and retrieval system.
  • Bell Laboratories
    Intern
    Bell Laboratories Jun 1994 - Aug 1994
    Murray Hill, Nj, Us
    Built tools to integrate simulated navigation systems with context-relevant websites.

Mehul Shah Skills

Distributed Systems Databases C++ Python C Cloud Computing Java Data Management Postgresql Analytics Software Development Scalability Computer Science Programming Algorithms Linux Db2

Mehul Shah Education Details

  • University Of California, Berkeley
    University Of California, Berkeley
    Computer Science (Databases)
  • Massachusetts Institute Of Technology
    Massachusetts Institute Of Technology
    Electrical Engineering And Computer Science
  • Massachusetts Institute Of Technology
    Massachusetts Institute Of Technology
    Computer Science
  • Montgomery Blair High School
    Montgomery Blair High School

Frequently Asked Questions about Mehul Shah

What company does Mehul Shah work for?

Mehul Shah works for Aryn

What is Mehul Shah's role at the current company?

Mehul Shah's current role is CEO and co-founder.

What is Mehul Shah's email address?

Mehul Shah's email address is ma****@****ail.com

What is Mehul Shah's direct phone number?

Mehul Shah's direct phone number is +151068*****

What schools did Mehul Shah attend?

Mehul Shah attended University Of California, Berkeley, Massachusetts Institute Of Technology, Massachusetts Institute Of Technology, Montgomery Blair High School.

What are some of Mehul Shah's interests?

Mehul Shah has interest in Kids, Cooking, Exercise, Investing, Traveling, Outdoors, Electronics, Home Improvement, Reading, Music.

What skills is Mehul Shah known for?

Mehul Shah has skills like Distributed Systems, Databases, C++, Python, C, Cloud Computing, Java, Data Management, Postgresql, Analytics, Software Development, Scalability.

Free Chrome Extension

Find emails, phones & company data instantly

Find verified emails from LinkedIn profiles
Get direct phone numbers & mobile contacts
Access company data & employee information
Works directly on LinkedIn - no copy/paste needed
Get Chrome Extension - Free

Aero Online

Your AI prospecting assistant

Download 750 million emails and 100 million phone numbers

Access emails and phone numbers of over 750 million business users. Instantly download verified profiles using 20+ filters, including location, job title, company, function, and industry.