Skip to content

Whether can be in "datasets. GeneratorBasedBuilder. _generate_examples" load model in data processing. When using non-streaming load_dataset, whether the model can be removed to avoid VRAM occupation #6204

Closed Answered by mariosasko
aihao2000 asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, feel free to do this. The only thing non-streaming HF datasets depend on are Arrow files that are memory-mapped when they are loaded (the .cache_files attribute returns them)

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by aihao2000
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants