How to perform inference on large datasets? #17

abdulfatir · 2024-03-18T11:51:54Z

Opening this as a FAQ.

The pipeline.predict interface accepts either a 1D/2D tensor or a list of tensors. If you want to perform inference on a large dataset, you can either:

Send batches of shape [batch_size, context_length] to the predict function in a loop over batches in your dataset. Note: you would need to pad the time series with torch.nan on the left, if they don't have the same length.
(Easier) Send lists of tensors of length batch_size to the predict function in a loop over batches in your dataset. No need to pad here, it will be done internally.

If you're running OOM, decrease the batch_size.

The text was updated successfully, but these errors were encountered:

abdulfatir added the FAQ Frequently asked question label Mar 18, 2024

abdulfatir changed the title ~~How to perform inference for large datasets?~~ How to perform inference on large datasets? Mar 18, 2024

lostella pinned this issue Mar 18, 2024

abdulfatir unpinned this issue Mar 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to perform inference on large datasets? #17

How to perform inference on large datasets? #17

abdulfatir commented Mar 18, 2024 •

edited

How to perform inference on large datasets? #17

How to perform inference on large datasets? #17

Comments

abdulfatir commented Mar 18, 2024 • edited

abdulfatir commented Mar 18, 2024 •

edited