Skip to content

A scrapping tool which is used for crawling and extracting href links from a website

License

Notifications You must be signed in to change notification settings

Kadetron/Exticz

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 

Repository files navigation

Exticz

Eye


Exticz is a scrapping tool which is used to extract href links from a website and is written in shell scripting, can be used for examining websites for redirection and finding sub domains


This is the initial build and can crawl up to a depth of 2.

Usage

$ exticz

exticz banner example

$ exticz
Enter the URL: <[domain].[TLD]> | <[domain].[TLD]/directory> | <[domain].[TLD]/directory?key=value>

exticz example

$ exticz <[domain].[TLD]> | <[domain].[TLD]/directory> | <[domain].[TLD]/directory?key=value>

exticz commandLine

While using arguments as input it is recommended to enclose the url in quotes

Installation

  git clone https://github.com/Kadetron/Exticz  
  cd Exticz
  sudo mv exticz /usr/local/bin
  cd ..
  rm -r ./Exticz

Features

  • Uses https scheme by default when no scheme is specified in input
  • Highlight the base urls to have a consistent format. The sub urls come under the highlighted base urls
  • Accepts any kind of valid links
  • Can be used in a command chain
  • Removes successive repeated urls from the result
  • Prints the number of links at the end

Dependencies

  • awk
  • curl
  • grep
  • sed
  • uniq
  • tee
  • wc

About

A scrapping tool which is used for crawling and extracting href links from a website

Topics

Resources

License

Stars

Watchers

Forks

Languages