Skip to content

How to change the PageParsingService in the default pipeline built with get_dd_analyzer #216

Closed Answered by alior101
alior101 asked this question in Q&A
Discussion options

You must be logged in to vote

Never mind .. I found it myself ..

pipe = dd.get_dd_analyzer(reset_config_file=True,config_overwrite=[ "LANGUAGE='eng'", "TEXT_ORDERING.FLOATING_TEXT_BLOCK_CATEGORIES=['title', 'text', 'list', 'figure']", "TEXT_ORDERING.TEXT_BLOCK_CATEGORIES=['title', 'text', 'list', 'figure' , 'cell', 'column_header','projected_row_header', 'spanning', 'row_header']"])

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by alior101
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant