Failed to reload the collections after restarting the milvus cluster. #33135
-
@yhmo @xiaofan-luan Hello! I am inserting batch data continually, while the milvus cluster crashed suddenly. I cheked the k8s events and the pods logs that time. It reports some nodes have The thing I am sure is the disk space is enough, so I guess the disk pressure is mainly more from high disk i/o rate, may dues to the huge amount of inserting logs & indexing files persistence and loading segments operation? And the weird thing is, if I restart the milvus cluster (by helm, using commands like I have tried 5 times, it always run into crashing. Finally, I tried to release the loading for these collections as soon as the new milvus restarted. Then I create a new collection, so I can insert the data again with this new collection. I have some questions:
Hoping you will give me some advice :) Thank you all so much! |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 4 replies
-
We didn't limit the maximum number of entities in a collection. Loading process will use multi-threads to load segments. The disk reading pressure might be high.
If a querynode's CPU core is 2, there might be 10 * 2 threads to read files parallelly. Shards define the number of "data-channel" of a collection. Each data-channel consumes data from pulsar/kafka to generate segments. Segments come from the same data-channel can be merged. Segments come from different data-channels cannot be merged. By default, each segment size is 512MB to 1024MB, the number of segments mainly depends on how much data is inserted, slightly affected by shards. Compaction has its machinery to manage tasks. I think no much difference for shards=1 or shards=8. |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Generally, the Milvus config looks good to me.
|
Beta Was this translation helpful? Give feedback.
-
what disk and index are you using? |
Beta Was this translation helpful? Give feedback.
问题是这样的,使用IVFPQ索引后,只有量化数据,没有原始数据
所以milvus会加载一份原始数据到内存中,默认目录就是/var目录
如果你不需要原始数据,关闭预热就可以,方式已经给出了。
如果你需要原始数据查询,需要把目录配置对。