Today's guest is Yujian Tang from Zilliz, one of the big players in the vector database market. This is the first episode in a series of episodes we’re doing on vectors and vector databases. We start with the basics, what is a vector? What are vector embeddings? How does vector search work? And why the heck do I even need a vector database?
RAG models for customizing LLMs is where vector databases are getting a lot of their use. On the surface, it seems pretty simple, but in reality, there's a lot of tinkering that goes into taking RAG to production.
Yujian explains some of the tripwires that you might run into and how to think through those problems. We think you're going to really enjoy this episode.
Timestamps
02:08 Introduction
03:16 What is a Vector?
07:01 How does Vector Search work?
14:08 Why need a Vector database?
15:11 Use Cases
17:37 What is RAG?
20:34 RAG vs fine-tuning
29:51 Measuring Performance
32:32 Is RAG here to stay?
35:43 Milvus
37:17 History of Milvus
47:44 Rapid Fire
X