-
Notifications
You must be signed in to change notification settings - Fork 7.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Recommended Train input image size #12149
Comments
Q1. Function of
|
@GreatV Q1. EastRandomCropData seems to be a means for augmentation. I am curious about the exact input image size of the detection model. Is there a way to check? Q2. DetResizeForTest How does an image change when it becomes larger or smaller than a fixed size? If the image is smaller than a fixed size, is it supposed to fill in the space? Q3. I am curious about how images are processed in the Recognition and Classification model. How is it handled and trained if it is smaller or larger than the fixed size? |
hello.
I am training and fine-tuning the models below.
It may be necessary to adjust the size of the image for better performance.
I would like to know about the training input image size for detection, recognition, and classification.
From what I researched, it seems that the input image size in the yml file is as follows.
**Detection
Q1. What is the function of Train's 'EastRandomCropData' in detection? Since the maximum image size is 640,640, will anything exceeding that size be cropped? So what happens when it is less than 640,640?
Q2. What does 'DetResizeForTest' mean in Eval? Is there a reason it is different from the training image size?**
- Recognition
Q3. What does each mean in image_shape? 3: Channel, 48: Height, 320: Width. Is that right?
-Classification
Q4 : Channel, 48: Height, 192: Width. Is that right?
Q5. Is it possible to change the size of each input image? I want to train words, not sentences.
I would appreciate your reply.
The text was updated successfully, but these errors were encountered: