Feature Store Resources
Overview
Main implementations:
- Amazon SageMaker Feature Store - announced in Nov. 2020.
- Google Clound Vertex AI Feature Store.
- Tecton - Three founders from Uber Michelangelo - $60m total - series C.
- Logical Clocks - Hopworks - eu1.3m ($1.6m) - seed.
- Feast - open source project initiated by Google Cloud and Gojek. Key Gojek Feast developer joins Tecton on 2020-11-17.
- Databricks Feature Store.
- Molecula - $23.6Mm total - Series A.
- Zipline - Airbnb’s feature store - not (yet) open-sourced.
- Splice ML Manager - little documentation on its feature store feature. Company has folded as of 2021-08.
SageMaker Feature Store
Basic
AWS Documentation
Blog
News
Articles
Google Cloud
Tecton
Basic
About
Funding
Blog
Articles
Presentations
- Accelerating the ML Lifecycle with an Enterprise-Grade Feature Store - Atlassian - video - slides - DAIS 2020
Logical Clocks - Hopsworks
Basic
Hopsworks
About
Funding
- Crunchbase - Logical Clocks - euro 1.3M ($1.6m) - seed round - 2018-11-20
- Note: this implies they are profitable enough to support a staff of 25.
Blog
Ron DB
- TLDR
- RonDB is a stable distribution of MySQL NDB Cluster, a key-value store with SQL capabilities.
- RonDB is brought to you by the RonDB team at Logical Clocks AB and the development team at iClaustron AB.
- Blog
- https://github.com/logicalclocks/rondb
- Mikael Ronström - Developer of AXE VM, NDB Cluster, MySQL Cluster, MySQL InnoDB, Scaling MySQL Server, MySQL Partitioning, MySQL Threadpool and now RonDB
Articles and resources
Hopsworks and Databricks
Notes
- Runs on AWS, Azure, GCP and on-prem.
- Integrates with SageMaker, Databricks, Kubernetes and Cloudera.
- Have offline and online feature stores.
- Offline is powered by Hive and Hudi.
- In conversations with Databricks and Matei.
- Interested in Delta, but waiting for OSS Delta to become more like Databricks Delta (e.g. z-order).
- Have only 25 people, question of priorities.
- Hoping by the end of year - source Sep. 2020 - (2020?).
- Don’t use MLflow since have their own experiment tracking system - with projects.
Molecula
Basic
Funding
- Crunchbase - Molecula - total $23.6M
- $17.6m - Series A - 2021-01-13
- $6m - Seed - 2021-08-22 - The Seraph Group, Lontra Ventures, Velar Capital, Capital Factory, Andrew Busey and Jason Dorsey
- Articles
Product
- https://www.molecula.com/products
- Molecula Enterprise Feature Store - Molecula provides centralized access to all your big data by reducing the dimensionality of the original source data , into a highly-optimized format that is natively predisposed for real-time machine-scale analytics and AI
- A New Paradigm to Data Access - Datasheet
Articles
Rasgo
Basic
Founders
- Jared Parker - CEO - director of sales at Domino Data Lab - LinkedIn
- Patrick Dougherty - CTO - LinkedIn
Funding
- Crunchbase - Rasgo - total $20mM
- $20m - Series A - 2021-06-24 - Insight Partners and Unusual Ventures.
- $5.1M - Seed - 2020-07 - Unusual Ventures
- Articles
Kaskada
Basic
Articles
Funding
Pinecone
- https://www.pinecone.io
- Vector database for machine learning
- We are engineers who built large machine learning platforms, databases, and search engines at AWS (SageMaker), Facebook, Yahoo, and Google
- Team
- Edo Liberty - CEO
- Amir Sadoughi - Head of Engineering
- Lior Ehrenfeld - COO
- Greg Kogan - VP Marketing
- Funding
Alteryx Featuretools
Feast
Databricks Feature Store
Zipline
Featurestore.org
Splice Machine
Articles
Misc