Retrieval-augmented generation is a technique that enables large language models to retrieve and incorporate new information. With RAG, LLMs do not respond to user queries until they refer to a specified set of documents. From Wikipedia
A new guide shows how real-time retrieval from vector stores reduces hallucinations, keeping LLM knowledge fresh