Skip to content

mlucchini/wd-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

crawler

Build Status Coverage Status

Run the provided assembly
	./crawl <domain-name>
Run the tests

You may need to download SBT.

	sbt test
Compile an assembly

You may need to download SBT.

	sbt assembly
Trade-offs
  • Poor error-handling strategy
  • No retry strategy
  • No graphical interface
  • Not used the Actor model to encapsulate the mutable shared state in the parallel crawler
  • Wrote in Scala and used a few libraries, ended up with an assembly of substantial size
Context

Job interview's exercise

About

🕸 Simple async website crawler with Scala and Netty

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published