Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 9 Next »

Current state: Under Discussion

ISSUE: https://github.com/milvus-io/milvus/issues/9481

PRs: 

Keywords: load balance, handoff, query node, query coord, sealed segment

Released:

Summary

This article focuses on the importance of handoff and load balance segment features, as well as proposing a scenario for handoff and load balance segments without interrupting queries.

Motivation

The query node of milvus currently has the following issues:

  1. In the load collection/partition process, query nodes not only load the sealed segments, but also watch dmChannels so that they can receive vectors inserted in real time, and if new vectors are continuously inserted after the load collection operation, the query nodes will create a large number of growing segments, which can only service query requests in a near-brutal manner. At the same time, data node may have flushed these vectors into persistent storage and created the corresponding index.
  2. Query nodes may experience unbalanced sealed segments during dynamic expansion. This results in a relatively slow query for high-load query node, which will becomes the final bottleneck for queries.

So we want to do the following:

  1. handoff:After datanode has flushed segment to persistent storage, query node will load the segment's binlogs from the persistent storage and replaces the growing segment with sealed segment.
  2. load balance:Automatically balance all sealed segments on query nodes to avoid idle computing and storage resources,and speed up the overall query.

Public Interfaces

Handoff and load balance are transparent to users.

Design Details

Query node logically manages the sealed segments and the growing segments in two parts, historical and streaming. As shown in the following figure, G3 represents the growing segment 3, and S8 represents the sealed segment 8.


Based on the diagram above, it can be concluded that the handoff and load balance process consists of two main steps:

  1.  Load the sealed segment onto a query node
  2. Remove the corresponding sealed segment or growing segment from a query node

However, there are several scenarios to consider in the implementation:

  1. What if some query node associated with the handoff or load balance process is suddenly down
  2. If there is a sudden downtime on query coord, how to properly resume the handoff and load balance tasks after the restart
  3. If the same sealed segment exists on different query nodes at the same time, whether it will have an impact on the final query results
  4. Different query nodes may not process query messages at the same speed, and how to ensure that different query nodes have the same global sealed segmentIDs when processing the same query message. If not handled well, it may cause the query to time out

Based on the above considerations, we propose the following designs:

Prerequisite: The cache meta of querycoord records which sealed segment, growing segment, and dmchannels are on each querynode. These metas will be stored in etcd synchronously.

1.First of all, querycoord automatically generates handoff tasks and load balance tasks and writes them into etcd, and then clears them from etcd until the task is completed, ensuring that querycoord can accurately restore the task after restarting.

2. Next, query coord uses its own allocation strategy to determine which querynode the sealed segment of balance or handoff should be allocated to. The allocation strategy considers whether the total size of segments on each query node is balanced, and whether the segments of each collection are evenly distributed across all querynode

3.Querycoord directly loads the allocated sealed segments to the corresponding query node, and then the same segment will exist on different querynodes. We de-duplicate the query result based on the primary key during local reduce and global reduce to avoid affecting the query result.

4.After query coord successfully loads sealed segments on querynode, update the sealed segments list of each querynode in cache meta. For example, after balancing sealed segment 8 from node 1 node 2, the meta of node1 changes from {S6, S7,S8} to { S6, S7}, node2's meta is changed from {S5} to {S5, S8}. If the query node suddenly goes down, directly follow the meta records of the coord to recover the query node. While querycoord updates the meta, it broadcasts a change info of sealed segments to querychannel.


  • No labels