Skip to content

Bitcoin accepted at checkout  |  Ships from Laval, QC, Canada  |  Expert support since 2016

Inverted File Index (IVF)

Sovereign AI

Definition

An inverted file index (IVF) is a partitioning scheme that makes vector search scale to billions of items. At build time, a coarse quantizer, usually k-means, groups all vectors into a fixed number of clusters or cells (the count is called nlist). Each cell holds an inverted list: the IDs of every vector assigned to that centroid. Rather than comparing a query against every vector, IVF compares it only against vectors in the handful of cells whose centroids are closest.

The nprobe trade-off

At search time you set nprobe, the number of nearest cells to actually scan. Probe one cell and the search is blazing fast but may miss neighbours that landed just across a cell boundary; probe more cells and recall climbs at the cost of latency. A common rule of thumb is to probe enough cells to cover a few percent of the dataset, which typically recovers high recall while still skipping the vast majority of vectors. Tuning nprobe is the main recall-versus-speed dial for a self-hosted index.

IVF plus product quantization

IVF only narrows which vectors to examine; it does not shrink them. In practice it is combined with compression: the popular IVF-PQ index uses IVF to pick candidate cells and product quantization to store the vectors inside those cells compactly. This pairing is how libraries like Faiss fit large corpora into modest RAM on hardware you own.

To go deeper on the compression half of that pairing, see our product quantization entry, and compare IVF's cell-based pruning with the graph-based approach described under vector quantization for choosing the right index for your own retrieval stack.

In Simple Terms

An inverted file index (IVF) is a partitioning scheme that makes vector search scale to billions of items. At build time, a coarse quantizer, usually…

Explore the Full Glossary

Browse all Bitcoin mining terms from A to Z. Whether you are a beginner or expert, deepen your understanding of the mining ecosystem.

Mining Glossary

ASIC Miner Database

Compare 500+ miners with real-time profitability data, home mining scores, and detailed specs.

Compare Miners