Definition
A feature store is a centralized repository for the engineered inputs — the features — that machine-learning models consume. It sits between raw data and the model, ensuring that a feature is defined once and then reused consistently across training and production. Its core purpose is to eliminate training-serving skew: the silent failure where a model is trained on features computed one way but scored on features computed slightly differently in production.
Online and offline halves
A feature store is typically a dual-database system. The offline store is a columnar, high-volume repository of historical feature values used for model training, batch inference, and feature discovery. The online store is a low-latency, row-oriented database that serves precomputed features to live applications in milliseconds. The same feature definition writes to both, so a fraud-detection or real-time personalization model gets identical logic whether it is being trained on last year's data or scoring a transaction happening right now.
Why it earns its keep
Beyond consistency, a feature store adds versioning, sharing, and lineage. Teams can discover and reuse features instead of re-deriving them, version a feature so old models stay reproducible, and trace exactly which data produced a given value. For self-hosted AI builders, that reproducibility is also a sovereignty win: features are documented and auditable rather than buried in throwaway scripts.
A feature store is fed by the upstream data pipeline / ETL and often draws its raw inputs from a data lake before features are computed and stored.
In Simple Terms
A feature store is a centralized repository for the engineered inputs — the features — that machine-learning models consume. It sits between raw data and…
