Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

alluxio fs distributedLoad loading failure causes the fluid cache to be unusable #18548

Open
yangcheng-dev opened this issue Mar 15, 2024 · 5 comments
Labels
type-bug This issue is about a bug

Comments

@yangcheng-dev
Copy link

yangcheng-dev commented Mar 15, 2024

Alluxio Version:
alluxio-dev:2.9.0

Describe the bug
I use alluxioruntime cache through fluid to connect to a web page file webufs service. The specific error scenario is that when dataloading a large file (10g), an error will be reported, and then the file will stop pulling. If the file is smaller, the probability of this problem is much smaller. Please help analyze it!
I would like to ask everyone in the Alluxio community whether there is any maximum parameter limit here. The bottom layer of Fluid's DataLoad uses the alluxio fs distributedLoad command.
To Reproduce
Steps to reproduce the behavior (as minimally and precisely as possible)

  1. use fluid create a dataset and alluxioruntime
  2. For data preheating, use dataload.The bottom layer of Fluid's DataLoad uses the "alluxio fs distributedLoad" command.
  3. Fluid's dataLoad interrupt

Expected behavior
dataLoad successed
Urgency
DINGTALK_IM_540086843
DINGTALK_IM_729206639
DINGTALK_IM_3748145450

Are you planning to fix it
Please indicate if you are already working on a PR.

Additional context
Add any other context about the problem here.

@yangcheng-dev yangcheng-dev added the type-bug This issue is about a bug label Mar 15, 2024
@yangcheng-dev
Copy link
Author

yangcheng-dev commented Mar 19, 2024

@jasondrogba
@YichuanSun
please help!

@jasondrogba
Copy link
Contributor

jasondrogba commented Mar 19, 2024

you can take a look on #16094 and #16096
you can try to increase MaxDirectMemorySize.

export ALLUXIO_FUSE_JAVA_OPTS="-XX:MaxDirectMemorySize=128m"

@yangcheng-dev
Copy link
Author

you can take a look on #16094 and #16096 you can try to increase MaxDirectMemorySize.

export ALLUXIO_FUSE_JAVA_OPTS="-XX:MaxDirectMemorySize=128m"

Thank you for your help, but I have configured the value to 32g. The problem still persists, similar to the issue below
#11611

@jasondrogba
Copy link
Contributor

Maybe it is a problem with fluid #11611 (comment)

@yangcheng-dev
Copy link
Author

Maybe it is a problem with fluid #11611 (comment)

Yes, but the problem has not been truly resolved so far.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type-bug This issue is about a bug
Projects
None yet
Development

No branches or pull requests

2 participants