CPU memory leak #4

K024 · 2024-01-12T07:07:19Z

A CPU memory leak is observed when inferencing using GPU, even when NativeOps is not used (by removing libllm_sharp_ops.so).

CPU memory continues to grow while inferencing. Diagnostics from torch.Tensor.TotalCount and torch.Tensor.PeakCount is stable during multiple turns of chat. GPU memory is also stable, and no GPU memory leak is observed.

Profiled the program with valgrind massif and memcheck. No obvious clues for the leakage from the logs.

massif.out.gz
vgdump.gz

The text was updated successfully, but these errors were encountered:

K024 added the help wanted Extra attention is needed label Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPU memory leak #4

CPU memory leak #4

K024 commented Jan 12, 2024

CPU memory leak #4

CPU memory leak #4

Comments

K024 commented Jan 12, 2024