Skip to content

crawl incorrect with given requestQueue #2267

Answered by B4nan
zshnb asked this question in Q&A
Discussion options

You must be logged in to vote

uniqueKey works fine, your code looks like you are reusing a single UUID generated upfront, you need to generate a new one for each request - so this needs to happen inside the transformRequestFunction. Also depends on what you are after, if it's about respecting URL fragments (the thing after #), there is an option for that too, that would be better than adding random strings to unique key.

const requestQueue = await RequestQueue.open(uuid)

This is a wrong take, you are creating a named queue (with a random name), and those are not removed automatically on start. You just generate data that you will need to clean up manually. Using the default queue is fine here, your problem is dedupl…

Replies: 1 comment 16 replies

Comment options

You must be logged in to vote
16 replies
@zshnb
Comment options

@B4nan
Comment options

@zshnb
Comment options

@B4nan
Comment options

@zshnb
Comment options

Answer selected by zshnb
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants