FAISS is an open-source library for similarity search and clustering of vectors. It contains algorithms that search in sets of vectors of any size, up to ones that possibly do not fit in RAM. From Wikipedia
New guides detail fully local setups that keep enterprise data on‑device, reducing API costs.