Presentation
Are Streaming Engines and Vector Databases Integrated Well?
DescriptionRecently, streaming processing engines (SPEs) support plugin-based connector to streamline online indexing and online query serving. Users can seamlessly integrate the online indexing/serving stack with a few code modifications. SPEs transform in-flight massive datasets to high-dimensional vector embeddings and delegates to store the embeddings on the vector database. In addition, it queries to the vector database to find similar data. However, loose coupling of the streaming engine and vector database does not recognize the internal operations of each engine, which can lead to performance bottlenecks in online data indexing and query serving scenarios. Through a preliminary experiment, we observed a high tail latency of query serving when data indexing is overlapped. Based on the results, we suggest the potential performance bottleneck cases that cause a high tail latency of query serving and present a future work that can mitigate this problem.