Retrieve a single url in headless mode #488

Sim4n6 · 2023-06-21T06:14:28Z

Sim4n6
Jun 21, 2023

I have a list of URLs stored in urls.txt file in the format https://www.ddddd.com/aaa?qq=bbb.

Can you make katana retrieve its related JS and static file and follow redirect if any for a single URL in a headless mode with no follow to http links within the retrieved web page, please?

I am looking for a simple cmd with a little help please.

Answered by olearycrew

Jun 21, 2023

@Sim4n6 thanks for your question! I think I've developed a command to get you what you want. The command is:

katana -list urls.txt -d 2 -jc -hl -sr -mr '(.*)\.js'

Where:

-list urls.txt Uses your URL list - the format here shouldn't matter
-d 2 Sets depth to 2, so you only crawl pages directly from the URLs you sent
-jc Sets JavaScript crawling on (optional) to crawl within those JS files
-hl Headless
-sr Save responses (optional)
-mr '(.*)\.js' Match a regex to only craw files ending in.js - you could modify this regex as needed

I hope this helps - and even if it isn't perfect, it at least gives you some idea of what you can do!

View full answer

olearycrew · 2023-06-21T14:27:35Z

olearycrew
Jun 21, 2023
Maintainer

@Sim4n6 thanks for your question! I think I've developed a command to get you what you want. The command is:

katana -list urls.txt -d 2 -jc -hl -sr -mr '(.*)\.js'

Where:

-list urls.txt Uses your URL list - the format here shouldn't matter
-d 2 Sets depth to 2, so you only crawl pages directly from the URLs you sent
-jc Sets JavaScript crawling on (optional) to crawl within those JS files
-hl Headless
-sr Save responses (optional)
-mr '(.*)\.js' Match a regex to only craw files ending in.js - you could modify this regex as needed

I hope this helps - and even if it isn't perfect, it at least gives you some idea of what you can do!

0 replies

Sim4n6 · 2023-06-21T16:35:00Z

Sim4n6
Jun 21, 2023
Author

thx

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retrieve a single url in headless mode #488

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Retrieve a single url in headless mode #488

Sim4n6 Jun 21, 2023

Replies: 2 comments

olearycrew Jun 21, 2023 Maintainer

Sim4n6 Jun 21, 2023 Author

Sim4n6
Jun 21, 2023

olearycrew
Jun 21, 2023
Maintainer

Sim4n6
Jun 21, 2023
Author