You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey, @JaMe76 can we open this discussion about training hacks to achieve the best out of this pipeline?
As you mentioned your Layout detection model is somewhat the same as your open-source model but with training on the lesser dataset. Can we discuss which model will be better to train? Should we go with detectron2 or Sagment Anything or Yolo or cascade r-cnn or any pre-trained weights you recommend?
The table transformer is worthy of tuning as I can see it might be helpful to train grid tables.
OCR: doctr is the best OCR so far. as I am getting errors from base.py from requirements as I have already installed deepdoctection[pt]
Cell, Row & Column detection will work best with a table transformer I guess.
Training Layout detection can change the whole game for this pipeline.
Need inputs to keep in mind while going through training, Thank you
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hey, @JaMe76 can we open this discussion about training hacks to achieve the best out of this pipeline?
As you mentioned your Layout detection model is somewhat the same as your open-source model but with training on the lesser dataset. Can we discuss which model will be better to train? Should we go with detectron2 or Sagment Anything or Yolo or cascade r-cnn or any pre-trained weights you recommend?
The table transformer is worthy of tuning as I can see it might be helpful to train grid tables.
OCR: doctr is the best OCR so far. as I am getting errors from base.py from requirements as I have already installed deepdoctection[pt]
Cell, Row & Column detection will work best with a table transformer I guess.
Training Layout detection can change the whole game for this pipeline.
Need inputs to keep in mind while going through training, Thank you
Beta Was this translation helpful? Give feedback.
All reactions