Skip to content

Programatically steer smart mode detection #181

Answered by j-mendez
blinjrm asked this question in Q&A
Discussion options

You must be logged in to vote

When crawling with with smart mode detection, is there a way to tell the crawler when to use JS rendering / HTTP only?

For example when crawling an e-commerce website, JS rendering may be needed to navigate catalog pages, but HTTP only can suffice to extract data from product pages, which represent >90% of pages.

Right now smart mode is only automatic and handles it in that manner. It only loads the JS when it is needed to render markup ( if it has a certain dom mutation or JS framework being set ). It's hard to have this handled programmatically since theres no way to control the input prior.

If smart mode needs more flexibility use chrome flag with the crawl_raw to perform http crawli…

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@blinjrm
Comment options

@j-mendez
Comment options

@blinjrm
Comment options

Answer selected by blinjrm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants