Feature Store Resources
Overview
Main implementations:
- Amazon SageMaker Feature Store - announced in Nov. 2020. 
 
- Google Clound Vertex AI Feature Store.
 
- Tecton - Three founders from Uber Michelangelo  - $60m total - series C.
 
- Logical Clocks - Hopworks - eu1.3m ($1.6m) - seed.
 
- Feast - open source project initiated by Google Cloud and Gojek. Key Gojek Feast developer joins Tecton on 2020-11-17.
 
- Databricks Feature Store.
 
- Molecula - $23.6Mm total - Series A.
 
- Zipline - Airbnb’s feature store - not (yet) open-sourced.
 
- Splice ML Manager - little documentation on its feature store feature. Company has folded as of 2021-08.
 
SageMaker Feature Store
Basic
AWS Documentation
Blog
News
Articles
Google Cloud
Tecton
Basic
About
Funding
Blog
Articles
Presentations
- Accelerating the ML Lifecycle with an Enterprise-Grade Feature Store - Atlassian - video - slides - DAIS 2020
 
Logical Clocks - Hopsworks
Basic
Hopsworks
About
Funding
- Crunchbase - Logical Clocks - euro 1.3M ($1.6m) - seed round - 2018-11-20
 
- Note: this implies they are profitable enough to support a staff of 25.
 
Blog
Ron DB
- TLDR
- RonDB is a stable distribution of MySQL NDB Cluster, a key-value store with SQL capabilities. 
 
- RonDB is brought to you by the RonDB team at Logical Clocks AB and the development team at iClaustron AB.
 
 
- Blog
 
- https://github.com/logicalclocks/rondb 
 
- Mikael Ronström - Developer of AXE VM, NDB Cluster, MySQL Cluster, MySQL InnoDB, Scaling MySQL Server, MySQL Partitioning, MySQL Threadpool and now RonDB
 
Articles and resources
Hopsworks and Databricks
Notes
- Runs on AWS, Azure, GCP and on-prem.
 
- Integrates with SageMaker, Databricks, Kubernetes and Cloudera.
 
- Have offline and online feature stores.
 
- Offline is powered by Hive and Hudi.
 
- In conversations with Databricks and Matei.
 
- Interested in Delta, but waiting for OSS Delta to become more like Databricks Delta (e.g. z-order).
 
- Have only 25 people, question of priorities.
 
- Hoping by the end of year - source Sep. 2020 - (2020?).
 
- Don’t use MLflow since have their own experiment tracking system - with projects.
 
Molecula
Basic
Funding
- Crunchbase - Molecula - total $23.6M 
- $17.6m - Series A - 2021-01-13
 
- $6m - Seed - 2021-08-22 - The Seraph Group, Lontra Ventures, Velar Capital, Capital Factory, Andrew Busey and Jason Dorsey
 
 
- Articles
 
Product
- https://www.molecula.com/products
- Molecula Enterprise Feature Store - Molecula provides centralized access to all your big data by reducing the dimensionality of the original source data , into a highly-optimized format that is natively predisposed for real-time machine-scale analytics and AI
 
 
- A New Paradigm to Data Access - Datasheet
 
Articles
Rasgo
Basic
Founders
- Jared Parker - CEO - director of sales at Domino Data Lab - LinkedIn
 
-  Patrick Dougherty - CTO - LinkedIn
 
Funding
- Crunchbase - Rasgo - total $20mM 
- $20m - Series A - 2021-06-24 - Insight Partners and Unusual Ventures.
 
- $5.1M - Seed - 2020-07 - Unusual Ventures
 
 
- Articles
 
Kaskada
Basic
Articles
Funding
Pinecone
- https://www.pinecone.io
- Vector database for machine learning
 
- We are engineers who built large machine learning platforms, databases, and search engines at AWS (SageMaker), Facebook, Yahoo, and Google
 
 
- Team
- Edo Liberty - CEO
 
- Amir Sadoughi - Head of Engineering
 
- Lior Ehrenfeld - COO
 
- Greg Kogan - VP Marketing
 
 
- Funding
 
Alteryx Featuretools
Feast
Databricks Feature Store
Zipline
Featurestore.org
Splice Machine
Articles
Misc